Gene Information

Name : eae (ECO103_3609)
Accession : YP_003223466.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Virulence
Product : intimin epsilon
Function : -
COG functional category : N : Cell motility
COG ID : COG5492
EC number : -
Position : 3686375 - 3689221 bp
Length : 2847 bp
Strand : -
Note : Integrative element ECO103_IE03

DNA sequence :
ATGATTACTCATGGTTTTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG
GGGATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGCTCCGAAAGCGAAATGATGAAGGCTGGGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC
AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG
CTGGCATTTGGTCAGGTCGGGGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA
ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA
CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAT
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC
GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAGACATCGGCTAAAGCGGATAACGTTGATACCATTACTTATACCGCGACGGTTAAAAAGAA
TGGTGTAGCTCAGGCTAATGCCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTCGGGGCAAATAGTGCCAAAA
CGGATGGTAACGGTAAGGCAACCGTAACGTTGAAGTCGGGTACGCCAGGGCAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCGCCACTTAATGCCAGTGCGGTTATATTTGTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
AACAACAGCGAAGGCAAATGGTTCTGATGCGATTACCTATATTGTTAAAGTAATGAAGAATAACCAACCAGAAGCAAACC
ATTCTGTTACATTCTCAACGAACTTTGGTAATCTGGGGGGAAATTCTAATACCCAAATTGTGAAAACGGATAAAGATGGT
AGGGCTACGGTAAAACTGACATCTGGCGTTGCAGGTAATGCTGTTGTTAGTGCAAAAGTCAGCGAAGTTAATACAGAGGT
TAAGGCTCCTGAGGTAAAATTCTTCTCAGTTCTGAGCATTGATAGTAATGTGAGTATTATTGGAACCTCCGCTAATGGCG
CTTTACCTAATATTTGGTTGCGATATGGTCAGTTTAAGCTGACAGCCAAAGGTGGCGATGGGAAATATCAATGGCGCTCT
CAAGATCCAAGTGTTGCATCAGTTGATGCTTTAACTGGTCGAGTTACTTTGCTGAAGAAAGGAACAACAACAATTGAAGT
TGTATCGGGTGATAACCAAACAGCAATGTATACAATTAATACACCTACAAAATTTATATCTGTGGAGACACAAAATAAAG
TAGTCTATAGTGATGCTGAGGCAACATGTAGAATGAATAATGCACGCTTGCCGTCATCTACGAGTGAGCTAAAGGATGTG
TATAATAAATGGGGCGCCGCCAATAGTTATGAAGGCTATAAAGGTAAAAAAACAATAACAGCATGGACACAGCAAACTGA
GGATGATAAACAAAAAGGTTGGACTAGTACATTTGACATAGTTACAAAAAATGAAATCCCTAGTAATGGCAGTAATAGTA
AAGTCCACGTGAATAAAGCTAACGCTTTTGCCGTCTGTGTAAGATGA

Protein sequence :
MITHGFYTRTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ
GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV
TDFTADKTSAKADNVDTITYTATVKKNGVAQANAPVTFSIVSGTATLGANSAKTDGNGKATVTLKSGTPGQVVVSAKTAE
MTSPLNASAVIFVDQTKASITEIKADKTTAKANGSDAITYIVKVMKNNQPEANHSVTFSTNFGNLGGNSNTQIVKTDKDG
RATVKLTSGVAGNAVVSAKVSEVNTEVKAPEVKFFSVLSIDSNVSIIGTSANGALPNIWLRYGQFKLTAKGGDGKYQWRS
QDPSVASVDALTGRVTLLKKGTTTIEVVSGDNQTAMYTINTPTKFISVETQNKVVYSDAEATCRMNNARLPSSTSELKDV
YNKWGAANSYEGYKGKKTITAWTQQTEDDKQKGWTSTFDIVTKNEIPSNGSNSKVHVNKANAFAVCVR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 100
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 100
eae AAK26724.1 intimin Virulence LEE Protein 0.0 85
eae AAL57551.1 Eae Virulence LEE Protein 0.0 85
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 85
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 85
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 83
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 82
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 82
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 82
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 82
eae AAC38392.1 intimin Virulence LEE Protein 0.0 81
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 78

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eae YP_003223466.1 intimin epsilon VFG0803 Protein 0.0 82
eae YP_003223466.1 intimin epsilon VFG0739 Protein 0.0 81