Name : ECIAI1_0547 (ECIAI1_0547) Accession : YP_002386021.1 Strain : Escherichia coli IAI1 Genome accession: NC_011741 Putative virulence/resistance : Unknown Product : Rhs core protein with extension Function : - COG functional category : M : Cell wall/membrane/envelope biogenesis COG ID : COG3209 EC number : - Position : 589204 - 594003 bp Length : 4800 bp Strand : - Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1766878, 2644231, 2403547, 7934896 DNA sequence : ATGAGTGAAGGACCAGGCGGGCCACAGGGAGCGACCGCAGGCGGTACGCTGGCAATGCGAATGCTGTCACAGCAGGCGAT GGTCGCCAGCCAGATGAAACGGGCAGCCAACGACAAAGCCATTGCACAGATGCTGGCAGCAAAGAAGTCCGGCCCACCTG CCGCCAGGCTGGGCGATGAAATTCAGCATAAAAGTTTTCTGGGGGCACTGGCAGGGGCCGTGCTGGGGGCGATAGTGACC ATCGCAGAAGGTTGCCTGATTATGGCCGCCTGTGCCACCGGCCCTTATGCGCTGGTTCTGGTGCCTGCGCTGATGTATGC CAGCTATAAGGCGAGTGATTATGTGGAGGAGAAACAGAACCAGCTTGAGTCATGGATAAACAGCTTTTGTGACACGGACG GCGCCATCAATACCGGTTCTGAAAATGTAAACATTAACGGAAAGCCCGCTGCCAGAGCCGCCGTCACCCTTCCCCCTCCT CCCCCACCTGGAGCAATACCTGAAGTCCCACAGGGGGAACCCTCATGGGGTGATATTGCCACTGACCTGCTTGAATCGGC AGCGGAAAAAGCAGTACCACTGGCGAAGGCCTGGGGGAACGCTGTTATCACCCTGACGGACAGCAATGCCGGTTTTATGG ATCGCTTATGTGCAGGCACATCGCTTCTGTTTCCCGCCGGTCCGGTATTAATGGAGTTTGCCACCATGGTGGGCGGGCGT GGCGAAATCAAAAAAGACGTGGATTTCCCGGAAGCCGGTGAGGACACGGCGCTCTGCGACAAGGAGAACAAACCACCGAG GATAGCCCAGGGCAGCAGCAACGTCTTTATCAACAATCAGCCTGCCGCGCGCAAGGGCGACAAACTGGAGTGCAGCGCGG CAATCGTGGAAGGTTCGCCGGACGTCTTTATTGGGGGTGAGCAGGTCACCTATCTGGATATCCAGCCGGAGTTCCCGCCA TGGCAGAGAATGATCCTGGGAGGAATAACGATAGCCAGCTATCTTCTGCCGCCAGCAGGACTGCTGGGAAAACTGGGGAA TCTGGCGAAACTGGGCAAACTGGGAAACCTGCTGGGGAAAAGCGGGAAGCTGCTGGGTGCAAAACTCGGCGCATTGCTGA GCAGAACAAAAAATGCCCTGAAAAATACGTATAACGTCCTGAAAAAGTTTATCAAAGATCCGGTTGACCCGGTAACCGGC GCGTACTGCGACGAACGTACTGACTTCACCCTGGGCCAGACACTCCCCCTCTCCTTCACCCGCTTCCACTGTTCGGTACT GCCACTGCATGGCCTGACCGGCGTGGGCTGGAGCGATTCCTGGAGCGAATACGCCTGGGTGCGTGAACAGGGAAAGCGGG TGGATATCATCAGCCTGGGAGCCACGCTGAACTTCGCCTTCGACGGTGAAAGTGATACGGCGGTTAACCCGTATCACGCC CAGTACATCCTGCGCCGCCGTGATGATTACCTGGAGCTGTTCGACCGTGATGCCCTGAGCAGCCGCTTCTTTTATGACGC CTTTCCGGGAATGCGCCTGCGCCACCCGGTGACTGACGATACCAGCGATGACCGCCTGGCACACAGCCCCGCAGACCGGA TGTACATGCTGGGCGGGATGAGCGACACCGCCAGCAACCGCATCACGTTTGAGCGCGACAGCCAGTACCGGATCACAGGT GTCAGTCACACCGACGGGATCCGGCTTAAACTGACGTACCATGCCAGCGGCTACCTGAAAGCCATTCACCGTACGGATAA CGGCATACAGACGCTGGCGACCTACGAACAGGATGCGCGGGGGCGGCTGACAGAAGCGGATGCGCGGCTGGACTACCACC TGTTTTATGAGTACGACGCTGCGGACCGGATCATCCGCTGGTCCGATAACGACCAGACGTGGAGCCGTTTCACCTACGAT GAACAGGGCCGGTGCGTGACCGTCACCGGGGCGGAGGGCTATTACAACGCCACGCTGGACTATGGTGACGGCTGCACCAC CCTGACGGACGGCAAGGGCACTCACCGTTATTACTATGATCCTGACGGCAATATTCTGCGGGAAGAAGCGCCGGACGGCA GCACCACCACGTATGAATGGGATGAATTCCATCACCTGCTGGCCCGCCACTCCCCTGCCGGGCGGGTGGAGAAATTTGAA TACAACGCCGCACACGGCCAGCTAAGCCGTTACACGGCAGCAGACGGTGCGGAGTGGCAGTACCGCTATGATGAGCGCGG CCTGCTCAGCAACATCACCGACCCTGCCGGGCAGACGTGGACACAGCAGTGTGATGAACGCGGCCTGCCGGTAAGCCTGG TGTCGCCACAGGGCGAAGAGACCCGGCTGGCATACACCGCTCAGGGGCTGCTGTCGGGAATATTCCGCCAGGATGAACGG CGTCTGGGCATAGAGTACGACCACCACAACCGGCCGGAAACACTCACCGACGTGATGGGCCGTGAACACCACACCGAATA CAGCGGTCACGACCTGCCGGTGAAGATGCGCGGCCCCGGCGGTCAGTCAGTGCGGTTGCAGTGGCAGCAGCACCATAAAC TGAGCGGCCTTGAGCGCACTGGCACCGGCGCGGAAGGATTCCGCTATGACCGCCACGGCAACCTGCTGGCCTGGACGGAC GGCAACGGTGTTGTCTGGACGATGGAATACGGTCCGTTCGATTTGCCGGTGGCGCGAACGGACGGTGAAGGCCACCGCTG GCAGTATCGCTACGATAAAGACACGCTGCAACTGACAGAAGTCATTAACCCGCAGGGCGAGTCTTATCTTTATATTCTCG ATAACTGTGGCCGGGTGACGGAAGAGCGTGACTGGGGCGGCGTGGTCTGGCGTTACCGCTATGACGCCGATGGCCTCTGT ACCGCCAGGGTCAACGGACTGGAGGAAACCATCCTCTACAGCCGGGACGCCGCAGGCCGCCTGGCAGAAGTCATCACGCC GGAAGGCAAAACGCAGTATGCCTATGACAAATCCGGCAGGCTGACGGGTATCTTCAGCCCGGACGGTACATCACAGCGCA CCGGCTATGACGAACGCGGGCGGGTGAATGTCACCACTCAGGGCCGACGGGCCATTGAATACCACTACCCCGATGAACAC ACCGTTATCCGCTGTATCCTGCCACCGGAAGATGAACGCGACAGACACCCCGATGAATCCCTGCTGAAAACCACGTACCG TTATAACGCCGCCGGAGAACTGACGGAGGTCATTCTGCCGGGGGATGAGACGCTGACGTTCAGCCGTGATGAGGCGGGAC GTGAAGTGTTCCGGCACAGTAACCGGGGTTTTGCCTGTGAGCAGGGCTGGAATGCAGCCAGCCAGCTTGTCACCCAGCGC GCCGGATTTTTTCCGGAAGAAGCCACATGGGGCGGGCTGCTCCCCTCACTGGTACGGGAGTACCGTTACGACAGCGCGGG CAATGTGTCGGCTGTCACCAGCCGGGAAGATTACGGACGGGAAACACGGCGGGAATACCGGCTGGACCGGAACGGTCAGG TCACGGCGGTGACAGCCTCAGGCACCGGGCTGGGCTATGGCGAAGGCGATGAGTCCTATGGCTATGACAGTTGTGGCTAC CTGAAGGCGCAGTCTGCGGGCAGGCACCGGATAAGTGAAGAGACTGACCAGTATGCCGGAGGCCACCGGCTGAAACAGGC CGGAAACACGCAGTATGACTATGACGCCGCAGGCCGGATGGTCAGCCGGACAAAACACCGTGACGGCTACCGCCCGGAAA CAGAGCGGTTCCGGTGGGACAGCCGGGACCAGCTGACCGGGTATTGCAGCGCACAGGGTGAGCTGTGGGAATACCGCCAC GACGCCAGCGGCAGACGAACAGAGAAACGCTGCGACCGGAAGAAAATCCGCTTCACGTATCTGTGGGACGGCGACAGTAT TGCGGAAATCCGGGAATACCGCGATGATAAACTGTACAGCGTACGGCACCTGGTGTTTAACGGCTTTGAGCTGATAAGCC AGCAGTTCAGCCGGGTACGGCAGCCGCATCCGTCCGTGGCCCCGCAGTGGGTGACGCGAACGAATTATGCGGTGAGCGAC CTGACGGGCCGTCCGCTGATGCTCTTTAACAGTGAAGGTAAAACCGTCTGGCGACCGGGGCAGACCAGCCTGTGGGGGCT GGCACTCAGTCTGCCCGCAGACACAGACTACCCGGACCCGCGCGGGGAACTGGACCCGGAAGCCGCCCCCGGCCTGCTGT ATGCGGGACAGTGGCAGGATGCGGAATCCGGGCTGTGCTATAACCGGTTCCGGTATTACGAGCCGGAAACCGGAATGTAC CTGGTGAGTGACCCGCTGGGGTTGCAGGGGGGTGAGCAGACTTACCGGTATGTGCCGAATCCACTGGGGTATATTGATCC GCTGGGGCTGGCAAAAACTTCTGTTCCCGCAGAAAAAATAAGCCTTTCGGATAAAGCAAGAGATCTTTTTAGACAAGGAA AAGTAAGAGAAGCATTAGATGTTCATTATGAGGATCTTGTACGAAGAAAGCTTGGTGGTATATCTCAAGAAATTGCCGGG CGTGAATATGATGTTGTTACTGACAAGATCATAGCGCAGGTCAAGAGGACTTATAGCTCTATAGATAATCCAAAAAACTT CTTAAGTAAGTCTACTCGGACACAAATTAAGAAGACAATTGAGCTGGCTGAAGAACAGGGGAAAGAGGCTCAGTTTTGGT TTAAATATGGAGTAAGCCCTAAAGTAAGAGAATATATAGAAAGCAAAGGTGGGAAAGTTATATTGGGTATGGGGAATTAA Protein sequence : MSEGPGGPQGATAGGTLAMRMLSQQAMVASQMKRAANDKAIAQMLAAKKSGPPAARLGDEIQHKSFLGALAGAVLGAIVT IAEGCLIMAACATGPYALVLVPALMYASYKASDYVEEKQNQLESWINSFCDTDGAINTGSENVNINGKPAARAAVTLPPP PPPGAIPEVPQGEPSWGDIATDLLESAAEKAVPLAKAWGNAVITLTDSNAGFMDRLCAGTSLLFPAGPVLMEFATMVGGR GEIKKDVDFPEAGEDTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIGGEQVTYLDIQPEFPP WQRMILGGITIASYLLPPAGLLGKLGNLAKLGKLGNLLGKSGKLLGAKLGALLSRTKNALKNTYNVLKKFIKDPVDPVTG AYCDERTDFTLGQTLPLSFTRFHCSVLPLHGLTGVGWSDSWSEYAWVREQGKRVDIISLGATLNFAFDGESDTAVNPYHA QYILRRRDDYLELFDRDALSSRFFYDAFPGMRLRHPVTDDTSDDRLAHSPADRMYMLGGMSDTASNRITFERDSQYRITG VSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDARGRLTEADARLDYHLFYEYDAADRIIRWSDNDQTWSRFTYD EQGRCVTVTGAEGYYNATLDYGDGCTTLTDGKGTHRYYYDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGRVEKFE YNAAHGQLSRYTAADGAEWQYRYDERGLLSNITDPAGQTWTQQCDERGLPVSLVSPQGEETRLAYTAQGLLSGIFRQDER RLGIEYDHHNRPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQHHKLSGLERTGTGAEGFRYDRHGNLLAWTD GNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWGGVVWRYRYDADGLC TARVNGLEETILYSRDAAGRLAEVITPEGKTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTTQGRRAIEYHYPDEH TVIRCILPPEDERDRHPDESLLKTTYRYNAAGELTEVILPGDETLTFSRDEAGREVFRHSNRGFACEQGWNAASQLVTQR AGFFPEEATWGGLLPSLVREYRYDSAGNVSAVTSREDYGRETRREYRLDRNGQVTAVTASGTGLGYGEGDESYGYDSCGY LKAQSAGRHRISEETDQYAGGHRLKQAGNTQYDYDAAGRMVSRTKHRDGYRPETERFRWDSRDQLTGYCSAQGELWEYRH DASGRRTEKRCDRKKIRFTYLWDGDSIAEIREYRDDKLYSVRHLVFNGFELISQQFSRVRQPHPSVAPQWVTRTNYAVSD LTGRPLMLFNSEGKTVWRPGQTSLWGLALSLPADTDYPDPRGELDPEAAPGLLYAGQWQDAESGLCYNRFRYYEPETGMY LVSDPLGLQGGEQTYRYVPNPLGYIDPLGLAKTSVPAEKISLSDKARDLFRQGKVREALDVHYEDLVRRKLGGISQEIAG REYDVVTDKIIAQVKRTYSSIDNPKNFLSKSTRTQIKKTIELAEEQGKEAQFWFKYGVSPKVREYIESKGGKVILGMGN |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
rhs-core | AAN64198.1 | Rhs | Not tested | macrophage toxin pathogenicity island | Protein | 3e-176 | 41 |