Gene Information

Name : ECIAI1_0547 (ECIAI1_0547)
Accession : YP_002386021.1
Strain : Escherichia coli IAI1
Genome accession: NC_011741
Putative virulence/resistance : Unknown
Product : Rhs core protein with extension
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 589204 - 594003 bp
Length : 4800 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1766878, 2644231, 2403547, 7934896

DNA sequence :
ATGAGTGAAGGACCAGGCGGGCCACAGGGAGCGACCGCAGGCGGTACGCTGGCAATGCGAATGCTGTCACAGCAGGCGAT
GGTCGCCAGCCAGATGAAACGGGCAGCCAACGACAAAGCCATTGCACAGATGCTGGCAGCAAAGAAGTCCGGCCCACCTG
CCGCCAGGCTGGGCGATGAAATTCAGCATAAAAGTTTTCTGGGGGCACTGGCAGGGGCCGTGCTGGGGGCGATAGTGACC
ATCGCAGAAGGTTGCCTGATTATGGCCGCCTGTGCCACCGGCCCTTATGCGCTGGTTCTGGTGCCTGCGCTGATGTATGC
CAGCTATAAGGCGAGTGATTATGTGGAGGAGAAACAGAACCAGCTTGAGTCATGGATAAACAGCTTTTGTGACACGGACG
GCGCCATCAATACCGGTTCTGAAAATGTAAACATTAACGGAAAGCCCGCTGCCAGAGCCGCCGTCACCCTTCCCCCTCCT
CCCCCACCTGGAGCAATACCTGAAGTCCCACAGGGGGAACCCTCATGGGGTGATATTGCCACTGACCTGCTTGAATCGGC
AGCGGAAAAAGCAGTACCACTGGCGAAGGCCTGGGGGAACGCTGTTATCACCCTGACGGACAGCAATGCCGGTTTTATGG
ATCGCTTATGTGCAGGCACATCGCTTCTGTTTCCCGCCGGTCCGGTATTAATGGAGTTTGCCACCATGGTGGGCGGGCGT
GGCGAAATCAAAAAAGACGTGGATTTCCCGGAAGCCGGTGAGGACACGGCGCTCTGCGACAAGGAGAACAAACCACCGAG
GATAGCCCAGGGCAGCAGCAACGTCTTTATCAACAATCAGCCTGCCGCGCGCAAGGGCGACAAACTGGAGTGCAGCGCGG
CAATCGTGGAAGGTTCGCCGGACGTCTTTATTGGGGGTGAGCAGGTCACCTATCTGGATATCCAGCCGGAGTTCCCGCCA
TGGCAGAGAATGATCCTGGGAGGAATAACGATAGCCAGCTATCTTCTGCCGCCAGCAGGACTGCTGGGAAAACTGGGGAA
TCTGGCGAAACTGGGCAAACTGGGAAACCTGCTGGGGAAAAGCGGGAAGCTGCTGGGTGCAAAACTCGGCGCATTGCTGA
GCAGAACAAAAAATGCCCTGAAAAATACGTATAACGTCCTGAAAAAGTTTATCAAAGATCCGGTTGACCCGGTAACCGGC
GCGTACTGCGACGAACGTACTGACTTCACCCTGGGCCAGACACTCCCCCTCTCCTTCACCCGCTTCCACTGTTCGGTACT
GCCACTGCATGGCCTGACCGGCGTGGGCTGGAGCGATTCCTGGAGCGAATACGCCTGGGTGCGTGAACAGGGAAAGCGGG
TGGATATCATCAGCCTGGGAGCCACGCTGAACTTCGCCTTCGACGGTGAAAGTGATACGGCGGTTAACCCGTATCACGCC
CAGTACATCCTGCGCCGCCGTGATGATTACCTGGAGCTGTTCGACCGTGATGCCCTGAGCAGCCGCTTCTTTTATGACGC
CTTTCCGGGAATGCGCCTGCGCCACCCGGTGACTGACGATACCAGCGATGACCGCCTGGCACACAGCCCCGCAGACCGGA
TGTACATGCTGGGCGGGATGAGCGACACCGCCAGCAACCGCATCACGTTTGAGCGCGACAGCCAGTACCGGATCACAGGT
GTCAGTCACACCGACGGGATCCGGCTTAAACTGACGTACCATGCCAGCGGCTACCTGAAAGCCATTCACCGTACGGATAA
CGGCATACAGACGCTGGCGACCTACGAACAGGATGCGCGGGGGCGGCTGACAGAAGCGGATGCGCGGCTGGACTACCACC
TGTTTTATGAGTACGACGCTGCGGACCGGATCATCCGCTGGTCCGATAACGACCAGACGTGGAGCCGTTTCACCTACGAT
GAACAGGGCCGGTGCGTGACCGTCACCGGGGCGGAGGGCTATTACAACGCCACGCTGGACTATGGTGACGGCTGCACCAC
CCTGACGGACGGCAAGGGCACTCACCGTTATTACTATGATCCTGACGGCAATATTCTGCGGGAAGAAGCGCCGGACGGCA
GCACCACCACGTATGAATGGGATGAATTCCATCACCTGCTGGCCCGCCACTCCCCTGCCGGGCGGGTGGAGAAATTTGAA
TACAACGCCGCACACGGCCAGCTAAGCCGTTACACGGCAGCAGACGGTGCGGAGTGGCAGTACCGCTATGATGAGCGCGG
CCTGCTCAGCAACATCACCGACCCTGCCGGGCAGACGTGGACACAGCAGTGTGATGAACGCGGCCTGCCGGTAAGCCTGG
TGTCGCCACAGGGCGAAGAGACCCGGCTGGCATACACCGCTCAGGGGCTGCTGTCGGGAATATTCCGCCAGGATGAACGG
CGTCTGGGCATAGAGTACGACCACCACAACCGGCCGGAAACACTCACCGACGTGATGGGCCGTGAACACCACACCGAATA
CAGCGGTCACGACCTGCCGGTGAAGATGCGCGGCCCCGGCGGTCAGTCAGTGCGGTTGCAGTGGCAGCAGCACCATAAAC
TGAGCGGCCTTGAGCGCACTGGCACCGGCGCGGAAGGATTCCGCTATGACCGCCACGGCAACCTGCTGGCCTGGACGGAC
GGCAACGGTGTTGTCTGGACGATGGAATACGGTCCGTTCGATTTGCCGGTGGCGCGAACGGACGGTGAAGGCCACCGCTG
GCAGTATCGCTACGATAAAGACACGCTGCAACTGACAGAAGTCATTAACCCGCAGGGCGAGTCTTATCTTTATATTCTCG
ATAACTGTGGCCGGGTGACGGAAGAGCGTGACTGGGGCGGCGTGGTCTGGCGTTACCGCTATGACGCCGATGGCCTCTGT
ACCGCCAGGGTCAACGGACTGGAGGAAACCATCCTCTACAGCCGGGACGCCGCAGGCCGCCTGGCAGAAGTCATCACGCC
GGAAGGCAAAACGCAGTATGCCTATGACAAATCCGGCAGGCTGACGGGTATCTTCAGCCCGGACGGTACATCACAGCGCA
CCGGCTATGACGAACGCGGGCGGGTGAATGTCACCACTCAGGGCCGACGGGCCATTGAATACCACTACCCCGATGAACAC
ACCGTTATCCGCTGTATCCTGCCACCGGAAGATGAACGCGACAGACACCCCGATGAATCCCTGCTGAAAACCACGTACCG
TTATAACGCCGCCGGAGAACTGACGGAGGTCATTCTGCCGGGGGATGAGACGCTGACGTTCAGCCGTGATGAGGCGGGAC
GTGAAGTGTTCCGGCACAGTAACCGGGGTTTTGCCTGTGAGCAGGGCTGGAATGCAGCCAGCCAGCTTGTCACCCAGCGC
GCCGGATTTTTTCCGGAAGAAGCCACATGGGGCGGGCTGCTCCCCTCACTGGTACGGGAGTACCGTTACGACAGCGCGGG
CAATGTGTCGGCTGTCACCAGCCGGGAAGATTACGGACGGGAAACACGGCGGGAATACCGGCTGGACCGGAACGGTCAGG
TCACGGCGGTGACAGCCTCAGGCACCGGGCTGGGCTATGGCGAAGGCGATGAGTCCTATGGCTATGACAGTTGTGGCTAC
CTGAAGGCGCAGTCTGCGGGCAGGCACCGGATAAGTGAAGAGACTGACCAGTATGCCGGAGGCCACCGGCTGAAACAGGC
CGGAAACACGCAGTATGACTATGACGCCGCAGGCCGGATGGTCAGCCGGACAAAACACCGTGACGGCTACCGCCCGGAAA
CAGAGCGGTTCCGGTGGGACAGCCGGGACCAGCTGACCGGGTATTGCAGCGCACAGGGTGAGCTGTGGGAATACCGCCAC
GACGCCAGCGGCAGACGAACAGAGAAACGCTGCGACCGGAAGAAAATCCGCTTCACGTATCTGTGGGACGGCGACAGTAT
TGCGGAAATCCGGGAATACCGCGATGATAAACTGTACAGCGTACGGCACCTGGTGTTTAACGGCTTTGAGCTGATAAGCC
AGCAGTTCAGCCGGGTACGGCAGCCGCATCCGTCCGTGGCCCCGCAGTGGGTGACGCGAACGAATTATGCGGTGAGCGAC
CTGACGGGCCGTCCGCTGATGCTCTTTAACAGTGAAGGTAAAACCGTCTGGCGACCGGGGCAGACCAGCCTGTGGGGGCT
GGCACTCAGTCTGCCCGCAGACACAGACTACCCGGACCCGCGCGGGGAACTGGACCCGGAAGCCGCCCCCGGCCTGCTGT
ATGCGGGACAGTGGCAGGATGCGGAATCCGGGCTGTGCTATAACCGGTTCCGGTATTACGAGCCGGAAACCGGAATGTAC
CTGGTGAGTGACCCGCTGGGGTTGCAGGGGGGTGAGCAGACTTACCGGTATGTGCCGAATCCACTGGGGTATATTGATCC
GCTGGGGCTGGCAAAAACTTCTGTTCCCGCAGAAAAAATAAGCCTTTCGGATAAAGCAAGAGATCTTTTTAGACAAGGAA
AAGTAAGAGAAGCATTAGATGTTCATTATGAGGATCTTGTACGAAGAAAGCTTGGTGGTATATCTCAAGAAATTGCCGGG
CGTGAATATGATGTTGTTACTGACAAGATCATAGCGCAGGTCAAGAGGACTTATAGCTCTATAGATAATCCAAAAAACTT
CTTAAGTAAGTCTACTCGGACACAAATTAAGAAGACAATTGAGCTGGCTGAAGAACAGGGGAAAGAGGCTCAGTTTTGGT
TTAAATATGGAGTAAGCCCTAAAGTAAGAGAATATATAGAAAGCAAAGGTGGGAAAGTTATATTGGGTATGGGGAATTAA

Protein sequence :
MSEGPGGPQGATAGGTLAMRMLSQQAMVASQMKRAANDKAIAQMLAAKKSGPPAARLGDEIQHKSFLGALAGAVLGAIVT
IAEGCLIMAACATGPYALVLVPALMYASYKASDYVEEKQNQLESWINSFCDTDGAINTGSENVNINGKPAARAAVTLPPP
PPPGAIPEVPQGEPSWGDIATDLLESAAEKAVPLAKAWGNAVITLTDSNAGFMDRLCAGTSLLFPAGPVLMEFATMVGGR
GEIKKDVDFPEAGEDTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIGGEQVTYLDIQPEFPP
WQRMILGGITIASYLLPPAGLLGKLGNLAKLGKLGNLLGKSGKLLGAKLGALLSRTKNALKNTYNVLKKFIKDPVDPVTG
AYCDERTDFTLGQTLPLSFTRFHCSVLPLHGLTGVGWSDSWSEYAWVREQGKRVDIISLGATLNFAFDGESDTAVNPYHA
QYILRRRDDYLELFDRDALSSRFFYDAFPGMRLRHPVTDDTSDDRLAHSPADRMYMLGGMSDTASNRITFERDSQYRITG
VSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDARGRLTEADARLDYHLFYEYDAADRIIRWSDNDQTWSRFTYD
EQGRCVTVTGAEGYYNATLDYGDGCTTLTDGKGTHRYYYDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGRVEKFE
YNAAHGQLSRYTAADGAEWQYRYDERGLLSNITDPAGQTWTQQCDERGLPVSLVSPQGEETRLAYTAQGLLSGIFRQDER
RLGIEYDHHNRPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQHHKLSGLERTGTGAEGFRYDRHGNLLAWTD
GNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWGGVVWRYRYDADGLC
TARVNGLEETILYSRDAAGRLAEVITPEGKTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTTQGRRAIEYHYPDEH
TVIRCILPPEDERDRHPDESLLKTTYRYNAAGELTEVILPGDETLTFSRDEAGREVFRHSNRGFACEQGWNAASQLVTQR
AGFFPEEATWGGLLPSLVREYRYDSAGNVSAVTSREDYGRETRREYRLDRNGQVTAVTASGTGLGYGEGDESYGYDSCGY
LKAQSAGRHRISEETDQYAGGHRLKQAGNTQYDYDAAGRMVSRTKHRDGYRPETERFRWDSRDQLTGYCSAQGELWEYRH
DASGRRTEKRCDRKKIRFTYLWDGDSIAEIREYRDDKLYSVRHLVFNGFELISQQFSRVRQPHPSVAPQWVTRTNYAVSD
LTGRPLMLFNSEGKTVWRPGQTSLWGLALSLPADTDYPDPRGELDPEAAPGLLYAGQWQDAESGLCYNRFRYYEPETGMY
LVSDPLGLQGGEQTYRYVPNPLGYIDPLGLAKTSVPAEKISLSDKARDLFRQGKVREALDVHYEDLVRRKLGGISQEIAG
REYDVVTDKIIAQVKRTYSSIDNPKNFLSKSTRTQIKKTIELAEEQGKEAQFWFKYGVSPKVREYIESKGGKVILGMGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
rhs-core AAN64198.1 Rhs Not tested macrophage toxin pathogenicity island Protein 3e-176 41