Gene Information

Name : SSON53_02785 (SSON53_02785)
Accession : YP_005455134.1
Strain : Shigella sonnei 53G
Genome accession: NC_016822
Putative virulence/resistance : Unknown
Product : rhs core protein with extension
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 551605 - 555036 bp
Length : 3432 bp
Strand : -
Note : COG3209 Rhs family protein

DNA sequence :
TTGCTGGGGAAAACAGGTAAGTCGTTAAAAAGTATTGCCAATAAAGTCATCAGATGGGTAACAGATCCTGTCGATCCGGT
AACCGGCGCATACTGCGACGAACGTACCGACTTCACCCTGGGCCAGACCCTCCCCCTCTCCTTCACCCGTTTCCACAGTT
CTGTACTGCCGCTGCATGGCCTGACCGGCGTGGGCTGGAGCGACTCCTGGAGCGAATACGCCTGGGTGCGTGAACAGGGA
AACCGGGTGGATATCATCAGCCAGGGAGCCACGCTGAGATTTGCCTTCGACGGTGACAGTGATACGGCGGTTAACCCGTA
TCACGCCCAGTACATTCTGCGCCGCCGCGATGATTATCTGGAGCTGTTCGACAGGGATGCACTGAGCAGCCGCTTCTTTT
ATGACGCCTTTCCGGGAATGCGTCTGCGCCACCCGGTGACTGACGATACCAGCGATGACCGCCTGGCACACAGCCCCAAT
GACCGGATGTACATGCTGGGCGGGATGAGCGACACCGCCAGCAACCGCATCACGTTTGAGCGCGACAGCCAGTACCGGAT
CACGGGTGTCAGTCACACCGACGGGATCCGGCTTAAACTGACGTACCACGCCAGCGGCTACCTGAAAGCCATTCACCGCA
CGGATAACGACATACAGACGCTGGCGACCTACGAACAGGATGCGCGGGGGCGGCTGACAGAAGCGGATGCGCGGCTGGAC
TACCACCTGTTTTATGAGTACGACGCTGCGGACCGGATCATCCGCTGGTCCGATAACGACCAGACGTGGAGCCGTTTCAC
CTACGATGAACAGGGCCGGTGCGTGAATGTCACCGGGGCGGAGGGCTATTACAACGCCACGCTGGACTATGGTGACGGCT
GCACCACCGTGACGGACGGCAAGGGCATTCACCGTTATTACTATGATCCTGACGGCAATATTCTGCGGGAAGAAGCGCCG
GACGGCAGCACCACCATGTATGAATGGGATGAATTCCATCACCTGCTGGCCCGCCACTCCCCTGCCGGGCGGGTGGAGAA
ATTTGAATACAACGCCGCACACGGTCAGTTAAGCCGTTACACGGCGGCAGACGGCGCGGAGTGGCAGTACCGCTATGATG
AGCGCGGCCTGCTCAGCAACATCACCGACCCTGCCGGGCAGACGTGGACACAGCAGTGTGATGAACGCGGCCTGCCGGTA
AGTCTGGTGTCGCCACAGGGCGAAGAGACCCGGCTGGCATACACCGCTCAGGGGCTGCTGTCGGGAATATTCCGCCAGGA
TGAACGGCGTCTGGGCATAGAGTACGACCACCACAACCGGCCGGAAACACTCACCGACGTGATGGGCCGTGAACACCACA
CCGAATACAGCGGTCACGACCTGCCAGTGAAGATGCGCGGCCCCGGCGGTCAGTCAGTGCGGTTACAGTGGCAGCAGCAC
CATAAACTGAGCGGCATTGAACGTGCTGGAACCGGCGCGGAAGGATTTCGCTATGACCGCCACGGCAACCTGCTGGCCTG
GACGGACGGCAACGGTGTTGTCTGGACGATGGAATACGGTCCGTTCGATTTTCCGGTGGCGCGAACGGACGGTGAAGGCC
ACCGCTGGCAGTACCGCTACGATAAAGACACGCTGCAACTGACAGAAGTCATCAACCCGCAGGGTGAGTCATATCTTTAT
GTTCTGGATAACTGCGGCCGGGTGACGGAAGAGCGTGACTGGGGCGGCGTGGTCTGGCGTTACCGCTATGACGCCGATGG
CCTCTGTACCGCCAGGGTTAACGGCCTGGAGGAAACCATCCTCTACAGCCGGGACGCCGCAGGCCGCCTGGCAGAAATCA
TCTCGCCGGAAGGCAAAACGCAGTATGCCTATGACAAATCCGGCAGGCTGACGGGTATCTTCAGCCCGGACGGCACATCA
CAGCGCACCGGCTATGACGAACGCGGGCGGGTGAATGTCACCACTCAGGGCCGACGGGCCATTGAATACAACTACCCCGA
TGAACACACCGTCATCCGCTGTATCCTGCCACCGGAAGATGAACGCGACAGACACCCCGGTGAATCCCTGCTGAAAACCA
CGTACCGCTACAACGCCGCCGGAGAACTGACAGAGGTCATTCTGCCGGGGGATGAGACGCTGACGTTCAGCCGTGATGAG
GCGGGACGTGAAGTGCTCCGGCACAGTAACCGGGGTTTTGCCTGTGAACAGGGCTGGAATGCAGCCGGTCAGCCTGTCAG
TCAGCGCGCCGGATTTTTTCCGGCAGAAGCCACATGGGACGGGCTGGTTCCCTCACTGGTACGGGAGTACCGTTACGACA
GCGCGGGTAACGTGTCAGGCGTCACCAGCCGGGAAGATTACGGACGGGAAACACGGCGGGAGTATCGGCTGGACCGGAAC
GGCCAGGTCACGGCGGTGACAGCCTCAGGCACCGGGCTGGGCTATGGCGAAGGCGATGAGTCCTATGGCTATGACAGCTG
CGGCTACCTGAAGGCGCAGTCTGCGGGCAGGCACCGGATAAGTGAAGAGACTGACCAGTATGCCGGAGGCCACCGGCTGA
AACAGGCCGGAAACACGCAGTATGACTATGACGCCGCAGGCCGGATGGTCAGCCGCACCAGACACCGTGACGGCTACCGA
CCGGAAACAGAGCGGTTCCGGTGGGACAGCCGGGACCAGCTGACCGGGTATTGCAGCGCACAGGGTGAGCTGTGGGAATA
CCGCCACGACGCCAGCGGCAGACGAACAGAGAAACGCTGCGACCGGAAGAAAATCCGCTTCACGTATCTGTGGGACGGCG
ACAGTATTGCGGAAATCCGGGAATACCGCGATGATAAACTGTACAGCGTAAGGCACCTGGTGTTTAACGGCTTTGAGCTG
ATAAGCCAGCAGTTCAGCCGGGTACGGCAGGCGCATCCGTCGGTGGCCCCGCAGTGGGTGACGCGGACGAATCATGCGGT
GAGCGACCTGACGGGCCGCCCGCTGATGCTCTTTAACAGTGAAGGAAAGACGGTCTGGCGACCGGGGCAGACCAGCCTGT
GGGGGCTGGCACTCAGCCTGCCCGCAGACACCGGCTACCCGGACCCGCGCGGGGAACTGGACCCGGAAGCCGACCCCGGC
CTGCTGTATGCAGGACAGTGGCAGGATGCGGAATCCGGGCTGTGCTATAATCGGTTCCGGTATTACGAGCCGGAAACCGG
AATGTACCTGGTGAGTGATCCGCTGGGGTTGCAGGGCGGGGAGCAGACTTACCGGTATGTGCCGAATCCTTGTGGGTGGG
TTGATCCGCTGGGATTGGCTGCAAGTTCTAAAATCAGCAGTTTGATGGACTATATTGGCGATGGTCGTCGTGTTAGTGGG
CATACGGGTTTCCTGGATGGGGTTCGTTTATCACGTAGTCAAATAAACAAGGTGATGCTGCCAACTTACTGA

Protein sequence :
MLGKTGKSLKSIANKVIRWVTDPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEYAWVREQG
NRVDIISQGATLRFAFDGDSDTAVNPYHAQYILRRRDDYLELFDRDALSSRFFYDAFPGMRLRHPVTDDTSDDRLAHSPN
DRMYMLGGMSDTASNRITFERDSQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNDIQTLATYEQDARGRLTEADARLD
YHLFYEYDAADRIIRWSDNDQTWSRFTYDEQGRCVNVTGAEGYYNATLDYGDGCTTVTDGKGIHRYYYDPDGNILREEAP
DGSTTMYEWDEFHHLLARHSPAGRVEKFEYNAAHGQLSRYTAADGAEWQYRYDERGLLSNITDPAGQTWTQQCDERGLPV
SLVSPQGEETRLAYTAQGLLSGIFRQDERRLGIEYDHHNRPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQH
HKLSGIERAGTGAEGFRYDRHGNLLAWTDGNGVVWTMEYGPFDFPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYLY
VLDNCGRVTEERDWGGVVWRYRYDADGLCTARVNGLEETILYSRDAAGRLAEIISPEGKTQYAYDKSGRLTGIFSPDGTS
QRTGYDERGRVNVTTQGRRAIEYNYPDEHTVIRCILPPEDERDRHPGESLLKTTYRYNAAGELTEVILPGDETLTFSRDE
AGREVLRHSNRGFACEQGWNAAGQPVSQRAGFFPAEATWDGLVPSLVREYRYDSAGNVSGVTSREDYGRETRREYRLDRN
GQVTAVTASGTGLGYGEGDESYGYDSCGYLKAQSAGRHRISEETDQYAGGHRLKQAGNTQYDYDAAGRMVSRTRHRDGYR
PETERFRWDSRDQLTGYCSAQGELWEYRHDASGRRTEKRCDRKKIRFTYLWDGDSIAEIREYRDDKLYSVRHLVFNGFEL
ISQQFSRVRQAHPSVAPQWVTRTNHAVSDLTGRPLMLFNSEGKTVWRPGQTSLWGLALSLPADTGYPDPRGELDPEADPG
LLYAGQWQDAESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGWVDPLGLAASSKISSLMDYIGDGRRVSG
HTGFLDGVRLSRSQINKVMLPTY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
rhs-core AAN64198.1 Rhs Not tested macrophage toxin pathogenicity island Protein 2e-161 41