Gene Information

Name : eaeA (E2348C_3939)
Accession : YP_002331401.1
Strain : Escherichia coli E2348/69
Genome accession: NC_011601
Putative virulence/resistance : Virulence
Product : intimin EaeA
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4110343 - 4113162 bp
Length : 2820 bp
Strand : -
Note : -

DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT
TAACTCATAATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACAGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA
GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCCTTACCACTTTTAGGTTCGGCACCTCTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACAAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC
AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGTGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTGCCATCATACCCGGCATTAGGTGCCAA
GCTGATGTATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATTGAGCCACAATATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACGCAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGTAGCCAGGGCGGCCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGATTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGTAGCAATGTTTATAAAGTGACGGCTCGCGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACTGCTTACTATTACCGTTCTGTCGAATGGTCAGGTGGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAGACTTCGGCTAAAGCGGATGGCACCGAAGCAATTACTTATACTGCGACGGTGAAAAAGAA
TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAGTTTTAAGTGCCAATAGTGCCAATA
CCAATGGTAGCGGTAAGGCGACTGTAACCCTGAAATCGGATAAACCAGGCCAGGTCGTCGTGTCTGCTAAAACAGCAGAG
ATGACTTCAGCGCTTAATGCCAATGCAGTTATATTTGTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
AACAACGGCAGTAGCAAATGGTCAGGATGCTATTACATACACTGTTAAAGTGATGAAGGGGGATAAGCCTGTATCTAATC
AGGAAGTGACCTTTACGACGACCTTAGGTAAGTTAAGTAATTCCACTGAAAAAACGGATACGAATGGCTATGCCAAAGTA
ACATTAACATCGACAACTCCAGGAAAATCACTCGTTAGTGCCCGTGTTAGCGATGTCGCCGTTGATGTCAAAGCACCTGA
AGTTGAATTTTTTACAACGCTTACAATTGATGACGGTAATATTGAAATTGTTGGAACCGGAGTTAAAGGGAAGTTACCCA
CTGTATGGTTGCAATATGGTCAAGTTAATCTGAAAGCCAGCGGAGGTAACGGAAAATATACATGGCGCTCAGCAAATCCA
GCAATTGCTTCGGTGGATGCTTCTTCTGGTCAGGTCACCTTAAAAGAGAAGGGAACTACAACTATTTCCGTTATCTCAAG
TGATAATCAAACTGCAACTTATACTATTGCAACACCTAATAGTCTGATTGTTCCTAATATGAGCAAGCGTGTGACCTATA
ATGATGCTGTGAATACATGTAAGAATTTTGGAGGAAAGTTGCCGTCTTCTCAGAATGAACTGGAAAATGTCTTTAAAGCA
TGGGGGGCTGCAAATAAATATGAATATTATAAGTCTAGTCAGACTATAATTTCATGGGTACAACAAACAGCTCAAGATGC
GAAGAGTGGTGTTGCAAGTACATACGATTTAGTTAAACAAAACCCTCTGAATAATATTAAGGCTAGTGAATCTAATGCTT
ATGCCACTTGTGTAAAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGV
TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAE
MTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKV
TLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANP
AIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKA
WGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNPLNNIKASESNAYATCVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae AAC38392.1 intimin Virulence LEE Protein 0.0 99
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 83
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 83
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 83
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 83
eae AAL57551.1 Eae Virulence LEE Protein 0.0 83
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 83
eae AAK26724.1 intimin Virulence LEE Protein 0.0 83
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 83
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 81
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 81
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 81
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 79
eae AFO66294.1 intimin-like protein Not tested SESS LEE Protein 0.0 59
eae AFO66392.1 intimin-like protein Virulence SESS LEE Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eaeA YP_002331401.1 intimin EaeA VFG0739 Protein 0.0 99
eaeA YP_002331401.1 intimin EaeA VFG0803 Protein 0.0 83