Gene Information

Name : ECP_4555 (ECP_4555)
Accession : YP_672392.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Virulence
Product : hemolysin A
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG2931
EC number : -
Position : 4763630 - 4766704 bp
Length : 3075 bp
Strand : -
Note : -

DNA sequence :
ATGCCAACAATAACCACTGCACAAATTAAAAGCACACTGCAGTCTGCAAAGCAATCCGCTGCAAATAAATTGCACTCAGC
AGGACAAAGCACGAAAGATGCATTAAAAAAAGCAGCAGAGCAAACCCGCAATGCGGGAAACAGACTCATTTTACTTATCC
CTAAAGATTATAAAGGACAGGGTTCAAGCCTTAATGACCTTGTCAGGACGGCAGATGAACTGGGAATTGAAGTCCAGTAT
GATGAAAAGAATGGCACGGCGATTACTAAACAGGTATTCGGCACAGCAGAGAAACTCATTGGCCTCACCGAACGGGGAGT
GACTATCTTTGCACCACAATTAGACAAATTACTGCAAAAGTATCAAAAAGCGGGTAATAAATTAGGCGGCAGTGCTGAAA
ATATAGGTGATAACTTAGGAAAGGCAGGCAGTGTACTGTCAACGTTTCAAAATTTTCTGGGTACTGCACTTTCCTCAATG
AAAATAGACGAACTGATAAAGAAACAAAAATCTGGTGGCAATGTCAGTTCTTCTGAACTGGCAAAAGCGAGTATTGAGCT
AATCAACCAACTCGTGGACACAGCTGCCAGCCTTAATAATAATGTTAACTCATTTTCTCAACAACTCAATAAGCTGGGAA
GTGTATTATCCAATACAAAGCACCTGAACGGTGTTGGTAATAAGTTACAGAATTTACCTAACCTTGATAATATCGGTGCA
GGGTTAGATACTGTATCGGGTATTTTATCTGCGATTTCAGCAAGTTTCATTCTGAGCAATGCAGATGCAGATACCGGAAC
TAAAGCTGCAGCAGGTGTTGAATTAACAACGAAAGTACTGGGTAATGTTGGAAAAGGTATTTCTCAATATATTATCGCAC
AGCGTGCAGCACAGGGGTTATCTACATCTGCTGCTGCTGCCGGTTTAATTGCTTCTGTTGTGACACTGGCAATTAGTCCA
CTCTCATTCCTGTCCATTGCCGATAAGTTTAAACGTGCCAATAAAATAGAGGAGTATTCACAACGATTCAAAAAACTTGG
ATACGATGGTGACAGTTTACTTGCTGCTTTCCACAAAGAAACAGGAGCTATTGATGCATCGTTAACAACGATAAGCACTG
TTCTGGCTTCAGTATCTTCAGGTATTAGTGCTGCTGCAACGACATCTCTGGTTGGTGCACCGGTAAGCGCGCTGGTAGGG
GCTGTTACGGGGATAATTTCAGGCATCCTTGAGGCTTCAAAACAGGCAATGTTTGAACATGTCGCCAGTAAAATGGCCGA
TGTTATTGCTGAATGGGAGAAAAAACACGGCAAAAATTACTTTGAAAATGGATATGATGCCCGCCATGCTGCATTTTTAG
AGGATAACTTTGAAATATTATCTCAGTATAATAAAGAGTATTCTGTTGAAAGATCTGTCCTCATTACCCAGCAACATTGG
GATACGCTGATAGGTGAGTTAGCTGGTGTCACCAGAAATGGAGACAAAACACTCAGTGGTAAAAGTTATATTGACTATTA
TGAAGAAGGAAAACGACTGGAGAAAGAACCGGATGAATTCCAGAAGCAAGTCTTTGACCCATTAAAAGGGAATATTGACC
TGTCTGTAATTAAGTCATCTACTTTGTTAAAGTTTATTACGCCATTATTGACTCCTGGCAAGGAAATTCGTGAAAGGAGG
CAGTCCGGAAAATACGAGTATATTACCGAACTATTAGTCAAGGGGGTTGATAAATGGACGGTGAAGGGGGTTCAGGACAA
GGGGTCTGTGTATGATTACTCTAACCTGATTCAGCATGCATCTGTCGGTAATAACCAGTATCGGGAAATTCGTATTGAGT
CACACCTGGGAGACGGGGATGATAAGGTCTTTTTATCTGCCGGCTCAGCCAATATCTACGCAGGAAAGGGACATGATGTT
GTTTATTATGATAAAACAGACACCGGTTATCTGACCATTGATGGCACAAAAGCAACTGAAGCCGGTAATTACACGGTAAC
ACGTGTACTTGGTGGCGATGTTAAGGTTTTACAGGAAGTTGTGAAGGAGCAGGAGGTTTCAGTTGGCAAAAGGACTGAAA
AAACACAATATCGAAGCTATGAATTCACTCATATTAATGGCACAGACTTAACCGAGACAGATAACTTATACTCCGTGGAG
GAGCTCATCGGAACCAACCGTGCTGATAAGTTTTTTGGCAGCAAATTTACAGATATCTTCCATGGCGCGGATGGTGATGA
CCACATAGAAGGAAATGATGGGAATGACCGCTTATATGGTGATAAAGGTAATGACACACTGAGGGGCGGAAACGGGGATG
ACCAGCTCTATGGCGGTGATGGTAACGATAAGCTAACCGGAGGTGTGGGTAATAACTACCTTAATGGCGGAGACGGGGAT
GATGAGCTTCAGGTTCAGGGTAATTCTCTTGCTAAAAATGTATTATCCGGTGGAAAAGGTAATGACAAGCTGTACGGCAG
TGAGGGGGCAGATCTGCTTGATGGCGGAGAAGGGAATGATCTCCTGAAGGGGGGGTATGGTAATGATATTTATCGTTATC
TTTCAGGATATGGCCATCATATTATTGACGATGATGGGGGAAAAGACGATAAACTCAGTTTGGCTGATATTGATTTCCGG
GACGTTGCCTTTAAGCGAGAAGGAAATGACCTCATCATGTATAAAGCTGAAGGTAATGTTCTTTCCATTGGTCATAAAAA
TGGTATTACATTCAGGAACTGGTTTGAAAAAGAGTCAGGTGATATCTCTAATCACCAGATAGAGCAGATTTTTGATAAAG
ATGGCCGGGTCATCACACCAGATTCCCTTAAAAAAGCATTTGAATATCAGCAGAGTAATAATCAAGCCAATTATGTGTAT
GGAGAGTATGCATCAACTTATGCAGACCTGGATAATCTGAATCCATTAATTAATGAAATCAGCAAAATTATTTCAGCTGC
AGGTAACTTCGATGTTAAGGAGGAAAGATCTGCCGCTTCTTTATTGCAGTTGTCCGGTAATGCCAGTGATTTTTCATATG
GACGGAACTCAATAACTTTGACCGCATCAGCATAA

Protein sequence :
MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQY
DEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQKYQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSM
KIDELIKKQKSGGNVSSSELAKASIELINQLVDTAASLNNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA
GLDTVSGILSAISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASVVTLAISP
LSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVG
AVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFEILSQYNKEYSVERSVLITQQHW
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKEPDEFQKQVFDPLKGNIDLSVIKSSTLLKFITPLLTPGKEIRERR
QSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHASVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDV
VYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE
ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGDGNDKLTGGVGNNYLNGGDGD
DELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGNDLLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFR
DVAFKREGNDLIMYKAEGNVLSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY
GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITLTASA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hlyA CAD33759.1 hemolysin A Virulence PAI I 536 Protein 0.0 99
hlyA CAD42039.1 HlyA protein Virulence PAI II 536 Protein 0.0 99
hlyA NP_755445.1 hemolysin A Virulence PAI I CFT073 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECP_4555 YP_672392.1 hemolysin A VFG1500 Protein 0.0 99
ECP_4555 YP_672392.1 hemolysin A VFG1558 Protein 0.0 99
ECP_4555 YP_672392.1 hemolysin A VFG0906 Protein 0.0 98
ECP_4555 YP_672392.1 hemolysin A VFG0840 Protein 0.0 62