Gene Information

Name : CDCO157_4293 (CDCO157_4293)
Accession : YP_006315159.1
Strain : Escherichia coli Xuzhou21
Genome accession: NC_017906
Putative virulence/resistance : Virulence
Product : gamma intimin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4523434 - 4526238 bp
Length : 2805 bp
Strand : -
Note : -

DNA sequence :
ATGATTACTCATGGTTGTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT
TAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA
GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCTTGTTG
CTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC
AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGCAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAA
GCTGATATATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAGTTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGGTCAAGTTGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCGGATAAGACTTCGGCTAAAGCGGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAA
TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCAAAA
CGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
GACAACTGCAGTAGCAAATGGTAAGGATGCTATTAAATATACTGTAAAAGTTATGAAAAACGGTCAGCCAGTTAATAATC
AATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCAAGCAACCACGGGAAATGATGGTCGT
GCGACGATAACACTAACTTCCAGTTCCGCCGGTAAAGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGC
GACTGAGGTCACTTTTTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAGAGGCGAGTTGC
CTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGATGGTACATATTCATGGTATTCAGAAAAT
ACCAGTATCGCGACTGTCGATGCATCAGGGAAAGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGG
TGATAAGCAAACAGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTATGCTGATG
CTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGATATTTATGACTCATGGGGGGCTGCAAAT
AAATATAGCCATTATAGTTCTATGAACTCAATAACTGCTTGGATTAAACAGACATCTAGTGAGCAGCGTTCTGGAGTATC
AAGCACTTATAACCTAATAACACAAAACCCTCTTCCTGGGGTTAATGTTAATACTCCAAATGTCTATGCGGTTTGTGTAG
AATAA

Protein sequence :
MITHGCYTRTRHKHKLKKTLIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHDSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPANMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLIYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKSWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV
TDFTADKTSAKADNADTITYTATVKKNGVAQANVPVSFNIVSGTATLGANSAKTDANGKATVTLKSSTPGQVVVSAKTAE
MTSALNASAVIFFDQTKASITEIKADKTTAVANGKDAIKYTVKVMKNGQPVNNQSVTFSTNFGMFNGKSQTQATTGNDGR
ATITLTSSSAGKATVSATVSDGAEVKATEVTFFDELKIDNKVDIIGNNVRGELPNIWLQYGQFKLKASGGDGTYSWYSEN
TSIATVDASGKVTLNGKGSVVIKATSGDKQTVSYTIKAPSYMIKVDKQAYYADAMSICKNLLPSTQTVLSDIYDSWGAAN
KYSHYSSMNSITAWIKQTSSEQRSGVSSTYNLITQNPLPGVNVNTPNVYAVCVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 100
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 100
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 100
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 100
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 90
eae AAK26724.1 intimin Virulence LEE Protein 0.0 83
eae AAL57551.1 Eae Virulence LEE Protein 0.0 83
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 83
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 83
eae AAC38392.1 intimin Virulence LEE Protein 0.0 83
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 82
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 82
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 78
eae AFO66392.1 intimin-like protein Virulence SESS LEE Protein 0.0 60
eae AFO66294.1 intimin-like protein Not tested SESS LEE Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CDCO157_4293 YP_006315159.1 gamma intimin VFG0803 Protein 0.0 100
CDCO157_4293 YP_006315159.1 gamma intimin VFG0739 Protein 0.0 83