Gene Information

Name : ECO55CA74_21185 (ECO55CA74_21185)
Accession : YP_006161342.1
Strain : Escherichia coli RM12579
Genome accession: NC_017656
Putative virulence/resistance : Virulence
Product : Gamma intimin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4392401 - 4395205 bp
Length : 2805 bp
Strand : -
Note : -

DNA sequence :
ATGATTACTCATGGTTGTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT
TAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA
GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCTTGTTG
CTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC
AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGCAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAA
GCTGATATATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAGTTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGGTCAAGTTGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCGGATAAGACTTCGGCTAAAGCGGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAA
TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCAAAA
CGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
GACAACTGCAGTAGCAAATGGTAAGGATGCTATTAAATATACTGTAAAAGTTATGAAAAACGGTCAGCCAGTTAATAATC
AATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCAAGCAACCACGGGAAATGATGGTCGT
GCGACGATAACACTAACTTCCAGTTCCGCCGGTAAAGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGC
GACTGAGGTCACTTTTTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAGAGGCGAGTTGC
CTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGATGGTACATATTCATGGTATTCAGAAAAT
ACCAGTATCGCGACTGTCGATGCATCAGGGAAAGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGG
TGATAAGCAAACAGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTATGCTGATG
CTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGATATTTATGACTCATGGGGGGCTGCAAAT
AAATATAGCCATTATAGTTCTATGAACTCAATAACTGCTTGGATTAAACAGACATCTAGTGAGCAGAGTTCTGGAGTATC
AAGCACTTATAACCTAATAACACAAAACCCTCTTCCTAGGGTTAATGTTAATACTCCAAATGTCTATGCGGTTTGTGTAG
AATAA

Protein sequence :
MITHGCYTRTRHKHKLKKTLIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHDSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPANMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLIYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKSWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV
TDFTADKTSAKADNADTITYTATVKKNGVAQANVPVSFNIVSGTATLGANSAKTDANGKATVTLKSSTPGQVVVSAKTAE
MTSALNASAVIFFDQTKASITEIKADKTTAVANGKDAIKYTVKVMKNGQPVNNQSVTFSTNFGMFNGKSQTQATTGNDGR
ATITLTSSSAGKATVSATVSDGAEVKATEVTFFDELKIDNKVDIIGNNVRGELPNIWLQYGQFKLKASGGDGTYSWYSEN
TSIATVDASGKVTLNGKGSVVIKATSGDKQTVSYTIKAPSYMIKVDKQAYYADAMSICKNLLPSTQTVLSDIYDSWGAAN
KYSHYSSMNSITAWIKQTSSEQSSGVSSTYNLITQNPLPRVNVNTPNVYAVCVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 99
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 99
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 99
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 99
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 90
eae AAK26724.1 intimin Virulence LEE Protein 0.0 83
eae AAL57551.1 Eae Virulence LEE Protein 0.0 83
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 83
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 83
eae AAC38392.1 intimin Virulence LEE Protein 0.0 83
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 82
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 82
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 78
eae AFO66392.1 intimin-like protein Virulence SESS LEE Protein 0.0 60
eae AFO66294.1 intimin-like protein Not tested SESS LEE Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO55CA74_21185 YP_006161342.1 Gamma intimin VFG0803 Protein 0.0 99
ECO55CA74_21185 YP_006161342.1 Gamma intimin VFG0739 Protein 0.0 83