Name : EcolC_3079 (EcolC_3079) Accession : YP_001726030.1 Strain : Escherichia coli ATCC 8739 Genome accession: NC_010468 Putative virulence/resistance : Unknown Product : YD repeat-containing protein Function : - COG functional category : M : Cell wall/membrane/envelope biogenesis COG ID : COG3209 EC number : - Position : 3367048 - 3371565 bp Length : 4518 bp Strand : + Note : TIGRFAM: YD repeat protein; PFAM: YD repeat-containing protein; PAAR repeat-containing protein; KEGG: ssn:SSON_0517 rhs core protein with extension DNA sequence : ATGAGTGAAGGACCAGGCGGGCCACAGGGAGCGACCGCAGGCGGTACGCTGGCAATGCGAATGCTGTCACAGCAGGCGAT GGTCGCCAGCCAGATGAAACGGGCAGCCAACGACAAAGCCATTGCACAGATGCTGGCAGCAAAGAAGTCCGGCCCACCTG CCGCCAGGCTGGGCGATGAAATTCAGCATAAAAGTTTTCTGGGGGCACTGGCAGGGGCCGTGCTGGGGGCGATAGTGACC ATCGCAGAAGGTTGCCTGATTATGGCCGCCTGCGCCACCGGCCCTTATGCGCTGGTTCTGGTGCCTGCGCTGATGTATGC CAGCTATAAGGCGAGTGATTATGTGGAGGAGAAACAGAACCAGCTTGAATCATGGATAAACAGCTTTTGTGACACGGACG GCGCCATCAATACCGGTTCTGAAAATGTAAACATTAACGGAAAGCCCGCTGCAAGGGCCGCCGTCACCCTTCCCCCTCCT CCCCCACCTGGAGCAATACCTGAAATCCCACAGGGGGAACCCTCATGGGGTGATATTGCCACTGACCTGCTTGAATCGGC AGCGGAAAAAGCAGTACCACTGGCGAAGGCCTGGGGGAACGCTGTTATCACCCTGACGGAAAGCAATGCCGGTTTTATGG ATCGCGTCAGCGCCGGCGCATCGCTTCTGTTTCCCGCCGGTCCGGTATTAATGGAGTTTGCCACCATGGTGGGCGGGCGT GGCGAAATCAAAAAAGATGTGGATTTCCCGGAAGCCGGTGAGGACACGGCGCTCTGCGACAAGGAGAACAAACCACCGAG GATAGCCCAGGGCAGTAGCAACGTCTTTATCAACAATCAGCCTGCCGCGCGCAAGGGCGACAAACTGGAGTGCAGTGCGG CGATCGTGGAAGGTTCGCCGGACGTCTTTATTGGGGGTGAGCAGGTCACCTATCTGGATATCCAGCCGGAGTTCCCGCCA TGGCAGAGAATGATCCTGGGAGGAATAACGATAGCCAGCTATCTTCTGCCGCCAGCAGGACTGCTGGGAAAACTGGGGAA TCTGGCGAAACTGGGCAAACTGGGAAACCTGCTGGGGAAAAGCGGGAAGCTGCTGGGCGCAAAGCTCGGCGCGTTGCTGG GGAAAACAGGTAAGTCGTTAAAAAGTATTGCCAATAAAGTCATCAGATGGGTAACAGATCCTGTCGATCCGGTAACCGGC GCGTACTGCGACGAACGTACCGACTTCACCCTGGGCCAGACCCTCCCCCTCTCCTTCACCCGTTTCCACAGTTCGGTACT GCCACTGCATGGCCTGACGGGCGTGGGCTGGAGTGACTCCTGGAGCGAATACGCCTGGGTGCGTGAACAGGGAAACCGGG TGGATATCATCAGCCTGGGAGCCACGCTGAACTTCGCCTTCGACGGTGAAAGTGATACGGCGGTTAACCCGTATCACGCC CAGTACATTCTGCGCCGCCGTGATGATTATCTGGAGCTGTTCGACAGGGATGCACTGAGCAGCCGCTTCTTTTATGACGC CTTTCCGGGAATGCGTCTGCGCCACCCGGTGACTGACGATACCAGCGATGACCGCCTGGCACACAGCCCCGCAGACCGGA TGTACATGCTGGGCGGGATGAGCGACACCGCCAGCAACCGCATCACGTTTGAGCGCGACAGCCAGTACCGGATCACGGGT GTCAGTCACACCGACGGGATCCGGCTTAAACTGACGTACCACGCCAGCGGCTACCTGAAAGCCATTCACCGCACGGATAA CGGCATACAGACGCTGGCGACCTACGAACAGGATGCGCGGGGGCGGCTGACAGAAGCGGATGCGCGGCTGGACTACCACC TGTTTTATGAGTACGACGCTGCGGACCGGATCATCCGCTGGTCCGATAACGACCAGACGTGGAGCCGTTTCACCTACGAT GCACAGGGCCGGTGCGTGACCGTCACCGGGGCGGAGGGCTATTACAACGCCACGCTGGACTATGGTGACGGCTGCACCAC CGTGACGGACGGCAAGGGCATTCACTGTTATTACTATGATCCTGACGGCAATATTCTGCGGGAAGCAGCGCCGGACGGCA GTACCACCACGTATGAATGGGATGAATTCCATCACCTGCTGGCCCGCCACTCCCCTGCCGGACGGGTGGAGAAATTTGAA TACAACGCCGCACACGGTCAGTTAAGCCGTTATACGGCGGCAGACGGCGCGGAGTGGCAGTACCGCTATGATGAGCGCGG CCTGCTCAGCAACATCACCGACCCTGCCGGACAGACGTGGACACAGCAGTGCGATGAACGCGGCCTGCCGGTGAGTCTGG TATCGCCACAGGGCGAAGAGACCCGGCTGGCGTACACCGCTCAGGGGCTGCTATCGGGGATATTCCGCCAGGATGAACGG CGTCTGGGCATAGAGTACGACCACCACAACCGGCCGGAAACACTCACCGACGTGATGGGCCGTGAACACCACACCGAATA CAGCGGTCACGACCTGCCGGTGAAGATGCGCGGCCCCGGCGGTCAGTCAGTGCGGTTGCAGTGGCAGCAGCACCATAAAC TGAGTGGCATTGAGCGGGCAGAAACCGGCGCAGAAGGATTCCGCTATGACCGCCACGGCAACCTGCTGGCGTACACGGAC GGTAACGGCGTTGTCTGGACAATGGAATACGGCCCGTTCGATTTGCCGGTGGCGCGAACGGACGGTGAAGGCCACCGCTG GCAGTACCGCTACGATAAAGACACGCTGCAGCTCACAGAAGTCATTAACCCGCAGGGCGAGTCATACCGTTATATTCTGG ACAACTGTGGCCGGGTGACGGAAGAGCGTGACTGGGGCGGCGTGGTCTGGCGTTACCGCTATGACGCTGATGGCCTGTGT ACCGCCAGGGTCAACGGCCTGGAGGAAACCATCCTCTACAGCCGGGACGCCGCAGGCCGCCTGGCAGAAGTCATCACGCC GGAAGGCAAAACGCAGTATGCCTATGACAAATCCGGCAGGCTGACGGGTATCTTCAGCCCGGACGGTACATCACAGCGCA CCGGCTATGACGAACGCGGGCGGGTGAATGTCACCACTCAGGGCCGACGGGCCATTGAATACCACTACCCCGATGAACAC ACCGTTATCCGCTGTATCCTGCCACCGGAAGATGAACGCGACAGACACCCCGATGAATCCCTGCTGAAAACCACGTACCG TTATAACGCCGCCGGAGAACTGACGGAGGTCATTCTGCCGGGGGATGAGACGCTGACGTTCAGCCGTGATGAGGCGGGAC GTGAAGTGTTCCGGCACAGTAACCGGGGTTTTGCCTGTGAGCAGGGCTGGAATGCAGCCAGCCAGCTTGTCACCCAGCGC GCCGGATTTTTCCCGGAGGAAACCACATGGGGCGGGCTGCTCCCCTCACTGGTACGGGAGTACCGTTACGACAGCGCGGG CAATGTGTCGGCTGTCACCAGCCGGGAAGATTACGGACGGGAAACACGGCGGGAATACCGGCTGGACCGGAACGGTCAGG TCACGGCGGTGACAGCCTCAGGCACCGGGCTGGGCTATGGCGAAGGCGATGAGTCCTATGGCTATGACAGTTGTGGCTAC CTGAAGGCGCAGTCTGCGGGCAGGCACCGGATAAGTGAAGAGACTGAGCGGTATGCCGGAGGCCACCGGCTGAAACAGGC CGGAAACATGCAGTATGACTATGACGCCGCAGGCCGGATGGTCAGCCGGACAAAACACCGTGACGGCTACCGCCCGGAAA CAGAGCGGTTCCGGTGGGACAGCCGGGACCAGCTGACCGGGTATTGCAGCGCACAGGGTGAGCAGTGGGAATACCGCCAC GACGCCAGCGGCAGACGAACGGAAAAACGCTGCGACCGGAAGAAAATCCGTTTTACGTACCTGTGGGACGGCGACAGTAT TGGGGAAATCCGGGAATACCGCGATGATAAACTGTACAGCGTACGGCACCTGGTGTTTAACAGCTTTGAGCTGATAAGCC AGCAGTTCAGCCGGGTACGACAGCCGCACCCGTCCGTGGCCCCGCAGTGGGTGACGCGGACGAATCATGCGGTGAACGAC CTGACGGGCCGTCCGCTGATGCTCTTTAACAGTGAAGGTAAAACCGTCTGGCGACCGGGACAGACCAGCCTGTGGGGGCT GGCACTCAGCCTGCCCGCAGACACCGGCTACCCGGACCCGCGCGGGGAACTGGACCCGGAAGCCGACCCCGGCCTGCTGT ATGCGGGACAGTGGCGGGATGGAGAATCAGGGCTGTGCTATAACCGGTTCCGGTATTACGAGCCGGAAACCGGGATGTAC CTGGTGAGTGATCCACTGGGGTTGCAGGGAGGGGAGCAGACTTACCAGTATGTGCCGAATCCTTTAAGATGGATAGATCC CTTAGGATTAAATAAAGGAGCTTCATTATCTAAAATGATGAATAGCTCCAGTGATCTCATGGGGTTGAGAAGGCAGCCCC AGAACTTCTGGCGGCTATATCGCGGAAAAGACATTTAA Protein sequence : MSEGPGGPQGATAGGTLAMRMLSQQAMVASQMKRAANDKAIAQMLAAKKSGPPAARLGDEIQHKSFLGALAGAVLGAIVT IAEGCLIMAACATGPYALVLVPALMYASYKASDYVEEKQNQLESWINSFCDTDGAINTGSENVNINGKPAARAAVTLPPP PPPGAIPEIPQGEPSWGDIATDLLESAAEKAVPLAKAWGNAVITLTESNAGFMDRVSAGASLLFPAGPVLMEFATMVGGR GEIKKDVDFPEAGEDTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIGGEQVTYLDIQPEFPP WQRMILGGITIASYLLPPAGLLGKLGNLAKLGKLGNLLGKSGKLLGAKLGALLGKTGKSLKSIANKVIRWVTDPVDPVTG AYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEYAWVREQGNRVDIISLGATLNFAFDGESDTAVNPYHA QYILRRRDDYLELFDRDALSSRFFYDAFPGMRLRHPVTDDTSDDRLAHSPADRMYMLGGMSDTASNRITFERDSQYRITG VSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDARGRLTEADARLDYHLFYEYDAADRIIRWSDNDQTWSRFTYD AQGRCVTVTGAEGYYNATLDYGDGCTTVTDGKGIHCYYYDPDGNILREAAPDGSTTTYEWDEFHHLLARHSPAGRVEKFE YNAAHGQLSRYTAADGAEWQYRYDERGLLSNITDPAGQTWTQQCDERGLPVSLVSPQGEETRLAYTAQGLLSGIFRQDER RLGIEYDHHNRPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQHHKLSGIERAETGAEGFRYDRHGNLLAYTD GNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYRYILDNCGRVTEERDWGGVVWRYRYDADGLC TARVNGLEETILYSRDAAGRLAEVITPEGKTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTTQGRRAIEYHYPDEH TVIRCILPPEDERDRHPDESLLKTTYRYNAAGELTEVILPGDETLTFSRDEAGREVFRHSNRGFACEQGWNAASQLVTQR AGFFPEETTWGGLLPSLVREYRYDSAGNVSAVTSREDYGRETRREYRLDRNGQVTAVTASGTGLGYGEGDESYGYDSCGY LKAQSAGRHRISEETERYAGGHRLKQAGNMQYDYDAAGRMVSRTKHRDGYRPETERFRWDSRDQLTGYCSAQGEQWEYRH DASGRRTEKRCDRKKIRFTYLWDGDSIGEIREYRDDKLYSVRHLVFNSFELISQQFSRVRQPHPSVAPQWVTRTNHAVND LTGRPLMLFNSEGKTVWRPGQTSLWGLALSLPADTGYPDPRGELDPEADPGLLYAGQWRDGESGLCYNRFRYYEPETGMY LVSDPLGLQGGEQTYQYVPNPLRWIDPLGLNKGASLSKMMNSSSDLMGLRRQPQNFWRLYRGKDI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
rhs-core | AAN64198.1 | Rhs | Not tested | macrophage toxin pathogenicity island | Protein | 9e-161 | 41 |