Gene Information

Name : ECP_3827 (ECP_3827)
Accession : YP_671699.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Virulence
Product : hemolysin A
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG2931
EC number : -
Position : 3993570 - 3996644 bp
Length : 3075 bp
Strand : +
Note : -

DNA sequence :
ATGCCAACAATAACCACTGCACAAATTAAAAGCACACTGCAGTCTGCAAAGCAATCCGCTGCAAATAAATTGCACTCAGC
AGGACAAAGCACGAAAGATGCATTAAAAAAAGCAGCAGAGCAAACCCGCAATGCGGGAAACAGACTCATTTTACTTATCC
CTAAAGATTATAAAGGACAGGGTTCAAGCCTTAATGACCTTGTCAGGACGGCAGATGAACTGGGAATTGAAGTCCAGTAT
GATGAAAAGAATGGCACGGCGATTACTAAACAGGTATTCGGCACAGCAGAGAAACTCATTGGCCTCACCGAACGGGGAGT
GACTATCTTTGCACCACAATTAGACAAATTACTGCAAAAGTATCAAAAAGCGGGTAATAAATTAGGCGGCAGTGCTGAAA
ATATAGGTGATAACTTAGGAAAGGCAGGCAGTGTACTGTCAACGTTTCAAAATTTTCTGGGTACTGCACTTTCCTCAATG
AAAATAGACGAACTGATAAAGAAACAAAAATCTGGTAGCAATGTCAGTTCTTCTGAACTGGCAAAAGCGAGTATTGAGCT
AATCAACCAACTCGTGGACACAGCTGCCAGCATTAATAATAATGTTAACTCATTTTCTCAACAACTCAATAAGCTGGGAA
GTGTATTATCCAATACAAAGCACCTGAACGGTGTTGGTAATAAGTTACAGAATTTACCTAACCTTGATAATATCGGTGCA
GGGTTAGATACTGTATCGGGTATTTTATCTGTGATTTCAGCAAGTTTCATTCTGAGCAATGCAGATGCAGATACCGGAAC
TAAAGCTGCAGCAGGTGTTGAATTAACAACGAAAGTACTGGGTAATGTTGGAAAAGGTATTTCTCAATATATTATCGCAC
AGCGTGCAGCACAGGGGTTATCTACATCTGCTGCTGCTGCCGGTTTAATTGCTTCTGCTGTGACACTGGCAATTAGTCCC
CTCTCATTCCTGTCCATTGCCGATAAGTTTAAACGTGCAAATAAAATAGAGGAGTATTCACAACGATTCAAAAAACTTGG
ATACGATGGTGACAGTTTACTTGCTGCTTTCCACAAAGAAACAGGAGCTATTGATGCATCATTAACAACGATAAGCACTG
TACTGGCTTCAGTATCCTCAGGTATTAGTGCTGCTGCAACGACATCTCTTGTTGGTGCACCGGTAAGCGCACTGGTAGGT
GCTGTTACGGGGATAATTTCAGGTATCCTTGAGGCTTCAAAACAGGCAATGTTTGAACATGTCGCCAGTAAAATGGCCGA
TGTTATTGCTGAATGGGAGAAAAAACACGGCAAAAATTACTTTGAAAATGGATATGATGCCCGCCATGCTGCATTTTTAG
AAGATAACTTTAAAATATTATCTCAGTATAATAAAGAGTATTCTGTTGAAAGATCAGTCCTCATTACTCAGCAACATTGG
GATACGCTGATAGGTGAGTTAGCTGGTGTCACCAGAAATGGAGACAAAACACTCAGTGGTAAAAGTTATATTGACTATTA
TGAAGAAGGAAAACGTCTGGAGAAAAAACCGGATGAATTCCAGAAGCAAGTCTTTGACCCATTGAAAGGAAATATTGACC
TTTCTGACAGCAAATCTTCTACGTTATTGAAATTTGTTACACCATTGTTAACTCCCGGTGAGGAAATTCGTGAAAGGAGG
CAGTCCGGAAAATATGAATATATTACCGAGTTATTAGTCAAGGGTGTTGATAAATGGACGGTGAAGGGGGTTCAGGATAA
GGGGTCTGTATATGATTACTCTAACCTGATTCAGCATGCATCAGTCGGTAATAACCAGTATCGGGAAATTCGTATTGAGT
CACACCTGGGAGACGGGGATGATAAGGTCTTTTTAGCTGCCGGCTCAGCCAATATCTACGCAGGAAAGGGGCATGATGTT
GTTTACTATGATAAAACAGATACCGGTTATCTGACCATCGATGGTACAAAAGCAACTGAAGCCGGTAATTACACGGTAAC
ACGTGTACTTGGTGGCGATGTTAAGGTTTTACAGGAAGTTGTGAAGGAGCAGGAGGTTTCAGTTGGCAAAAGGACTGAAA
AAACACAATATCGAAGCTATGAATTCACTCATATTAATGGCACAGACTTAACCGAGACAGATAACTTATACTCCGTGGAG
GAGCTCATCGGAACCAACCGTGCTGATAAGTTTTTTGGCAGCAAATTTACAGATATCTTCCATGGCGCGGATGGTGATGA
CCACATAGAAGGAAATGATGGGAATGACCGCTTATATGGTGATAAAGGTAATGACACACTGAGGGGCGGAAACGGGGATG
ACCAGCTCTATGGCGGTGATGGTAACGATAAGCTAACCGGAGGTGTGGGTAATAACTACCTTAATGGCGGAGACGGGGAT
GATGAGCTTCAGGTTCAGGGTAATTCTCTTGCTAAAAATGTATTATCCGGTGGAAAAGGTAATGACAAGCTGTACGGCAG
TGAGGGGGCAGATCTGCTTGATGGCGGAGAAGGGAATGATCTCCTGAAGGGGGGGTATGGTAATGATATTTATCGTTATC
TTTCAGGATATGGCCATCATATTATTGACGATGATGGGGGAAAAGACGATAAACTCAGTTTGGCTGATATTGATTTCCGG
GACGTTGCCTTTAAGCGAGAAGGAAATGACCTCATCATGTATAAAGCTGAAGGTAATGTTCTTTCCATTGGTCATAAAAA
TGGTATTACATTCAGGAACTGGTTTGAAAAAGAGTCAGGTGATATCTCTAATCACCAGATAGAGCAGATTTTTGATAAAG
ATGGCCGGGTCATCACACCAGATTCCCTTAAAAAAGCATTTGAATATCAGCAGAGTAATAATCAAGCCAATTATGTGTAT
GGAGAGTATGCATCAACTTATGCAGACCTGGATAATCTGAATCCATTAATTAATGAAATCAGCAAAATTATTTCAGCTGC
AGGTAACTTCGATGTTAAGGAGGAAAGATCTGCCGCTTCTTTATTGCAGTTGTCCGGTAATGCCAGTGATTTTTCATATG
GACGGAACTCAATAACTTTGACCGCATCAGCATAA

Protein sequence :
MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQY
DEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQKYQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSM
KIDELIKKQKSGSNVSSSELAKASIELINQLVDTAASINNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA
GLDTVSGILSVISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISP
LSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVG
AVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKPDEFQKQVFDPLKGNIDLSDSKSSTLLKFVTPLLTPGEEIRERR
QSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHASVGNNQYREIRIESHLGDGDDKVFLAAGSANIYAGKGHDV
VYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE
ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGDGNDKLTGGVGNNYLNGGDGD
DELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGNDLLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFR
DVAFKREGNDLIMYKAEGNVLSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY
GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITLTASA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hlyA CAD42039.1 HlyA protein Virulence PAI II 536 Protein 0.0 99
hlyA CAD33759.1 hemolysin A Virulence PAI I 536 Protein 0.0 99
hlyA NP_755445.1 hemolysin A Virulence PAI I CFT073 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECP_3827 YP_671699.1 hemolysin A VFG1500 Protein 0.0 99
ECP_3827 YP_671699.1 hemolysin A VFG1558 Protein 0.0 99
ECP_3827 YP_671699.1 hemolysin A VFG0906 Protein 0.0 98
ECP_3827 YP_671699.1 hemolysin A VFG0840 Protein 0.0 62