Gene Information

Name : eae (ECO26_5280)
Accession : YP_003232162.1
Strain : Escherichia coli 11368
Genome accession: NC_013361
Putative virulence/resistance : Virulence
Product : intimin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5354665 - 5357484 bp
Length : 2820 bp
Strand : +
Note : Integrative element ECO26_IE08

DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG
GGTATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGTTCCGAAAGCGAAATGATGAAGGCTGGACCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC
AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG
CTGGCATTTGGTCAGGTCGGTGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA
ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA
CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGCGCATTACGCAGTCAGGGCGGTCAGATTCAGCATGGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGCGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC
GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAAACATCGGCTAAAGCGGATGGCATAGAAGCTATTACCTATACCGCGACGGTTAAAAAGAA
TGGTGTAGCTCAGGCTAATGTCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTTGGGGCAAATAGTGCCAGAA
CGGATGGTAACGGTAAGGCGACCGTAACGCTGAAGTCGGCTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCGCCACTTAATGCCAGCGCGGTTATATTTGTTGATCAAACCAAGGCCAGTATTACTGAGATTAAGGCTGATAA
AACAACAGCGAAGGCAGATGGTTCTGATGCGATTACCTATACTGTCAGAGTGATGAAGGAGGGGGCACCCGTAGTAGATC
AGAAAGTGACCTTTTCTAAGGATTTTGGGACCCTGAATAAGACTGAAGCAACAACCGATCAGAATGGTTATGCTACTGTA
AAATTATCATCCAATACTCCTGGCAAGGCCATTGTTAGTGCAAAAGTGAGTGGAGTAGGTACAGAAGTTAAGGCTACTAC
CGTTGAGTTTTTTGCCCCGTTGAGTATTGATGGTGATAAAGTGACCGTAATTGGTACTGGTATCACGGGGGCTCTGCCAA
AGAACTGGTTACAGTATGGTCAGGTTAAGCTACAGGCAACAGGGGGCAATGGAAAATACACATGGAAATCCAGTAATACT
AAAATTGCTTCTGTTGATAACTCGGGAGTGATAACCTTAAATGAAAAAGGGAGTGCCACAATTACTGTAGTATCTGGTGA
TAATCAGAGTGCGACATACACAATTAATGCACCGGGTAGTATTGTAATTGCTGTGGATAAAAATACTCGAGTTACGTATT
TTGATGCCGAAAACAAATGTAAGACAAATAGCGCAAATTTAGCACAGTCAAAAGAACTATTGGCCAATATCTATTCAACA
TGGGGTGCTGCAAATAAATATCCTTACTATTCTGGTTCTAAATCATTGACTGCTTGGATTAAACAATCCTCTTCTGAACA
GTCATCAGGTGTATCAAGCACATATGATTTGGTTACGAAGAACCAGTTGATCAATGTTGGAGTAAACAATAAGAATGCTT
TTTCTGTTTGTGTAAAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ
GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHGGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV
TDFTADKTSAKADGIEAITYTATVKKNGVAQANVPVTFSIVSGTATLGANSARTDGNGKATVTLKSATPGQVVVSAKTAE
MTSPLNASAVIFVDQTKASITEIKADKTTAKADGSDAITYTVRVMKEGAPVVDQKVTFSKDFGTLNKTEATTDQNGYATV
KLSSNTPGKAIVSAKVSGVGTEVKATTVEFFAPLSIDGDKVTVIGTGITGALPKNWLQYGQVKLQATGGNGKYTWKSSNT
KIASVDNSGVITLNEKGSATITVVSGDNQSATYTINAPGSIVIAVDKNTRVTYFDAENKCKTNSANLAQSKELLANIYST
WGAANKYPYYSGSKSLTAWIKQSSSEQSSGVSSTYDLVTKNQLINVGVNNKNAFSVCVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae AAK26724.1 intimin Virulence LEE Protein 0.0 100
eae AAL57551.1 Eae Virulence LEE Protein 0.0 100
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 100
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 99
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 87
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 85
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 85
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 83
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 83
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 83
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 83
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 83
eae AAC38392.1 intimin Virulence LEE Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eae YP_003232162.1 intimin VFG0803 Protein 0.0 83
eae YP_003232162.1 intimin VFG0739 Protein 0.0 83