Gene Information

Name : STMMW_02961 (STMMW_02961)
Accession : YP_005231369.1
Strain : Salmonella enterica D23580
Genome accession: NC_016854
Putative virulence/resistance : Unknown
Product : putative Rhs family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 332522 - 336616 bp
Length : 4095 bp
Strand : +
Note : pfam_scan;Pfam:PF03527; E()=2.6E-8;score=32.6;query 1145-1177;description=RHS; pfam_scan;Pfam:PF05593; E()=1.1E-6;score=28.0;query 518-555;description=RHS_repeat; pfam_scan;Pfam:PF05488; E()=1.5E-4;score=20.9;query 78-102;description=PAAR_motif

DNA sequence :
ATGTATGAAGCAGCCCGTGTGGATGATCCTATCTACCACACCAGCGCGCTCGCCGGGTTTCTTATCGGCGCTATCATCGG
CATCGCCATTATCGCGCTTGCCGCCTTTGCCTTCTTTAGCTGCGGTTTTCTTGCCGGGCTGATTCTGGGTTTTATGGCCG
ATCAAATAGCCTCCGGGGTATTGCAACTGGGCGAGGCCATCGGGCGCTCCATCCACCACACGGCAGGAAAAATCCTCACC
GGTTCGGAGAATGTCAGCACCAACAGTCGCCCGGCGGCGCGCGCGGTACTGAGTACGGTGAAATGCGATAACCATATCGC
AGAAAAACGCATCGCCCAAGGGTCGGAAAATATCTACATCAACAGCCAGCCCGCCGCCCGTAAGGATGACCACACCGAAT
GCGACGCGGTGATTGAAGACGGTTCGCCGAATGTGTTTCTCGGCGGCGGCACACAGACGGTACTGGAAATCAGTTCTGAA
ATTCCGGACTGGCTGCGCAAGGTGGTGGATGTATTGTTTGTCGTGGCGAGTCTGCTCGGCGGGCTGGCCGGGGCGTGGCG
GCAGGCGGCAAAGCTGGGGACGAAATTTGGCACTAAATGTGCCGCTAAGTTTATCGGCGGGGAGCTTGTCGGGATGGCCG
TGGGTGAGGCTATCAGCGGGCTGTTCAGCAATCCGGTGGATGTGACCACCGGGCAGAAAATCCTGCTGCCGGAAACGGAC
TTCACCCTGCCCGGTCGCCTGCCGGTCACCTGCTCGCGTTTTTACGCCAGCCACCTGGAAACTGTGGGACTGTTGGGACG
GGGCTGGCGGCTGAACTGGGAAACCAGCCTGCGCGATGACGATGAACACATCACGCTGACCGGCGTACAGGGGCGGGAAC
TGCGTTACCCGAAAACGATGCTGACGCCCGGCCACCAGATATTTGACCCGGAAGAACAGTTATACCTCAGCCGCCTGCAT
GACGGGCGTTACGTGCTGCATTACACCGATCGCAGCTATTACGTATTTGGTGATTTTGACAGTGACGGCATGGCATACCT
GCTGTTTATGGAGACGCCGCACCGCCAGCGCATTGTCTTCGGGCACGAAGGAGGCAGACTGGTACGGATAGCCTCCAGCA
GCGGGCATCACCTGTTACTGCACCGCACACAGACCCCGGCAGGGGAGCGGCTGTCGCGAATTGAACTGGTGCAGGGCGGC
ACCCGTGGCAATCTGGTGGAGTACCGGTATGACGATAACGGTCAACTGACCGGCGTGGTGAACCGGGCGGGAACGCAGGT
GCGTCAGTTTGCTTATGAAAACGGGCTGATGACGGCGCACAGCAATGCGACGGGGTTCACCTGCCGCTACCGCTGGCAGG
AACTCGACGGCGCGCCGCGCGTGACGGAGCACGACACCAGTGACGGCGAACATTACCGCTTTGACTATGATTTTGCCGCA
GGCACCACCACCGTCACCGGCAGGCAGGGGGAGACATGGCAGTGGTGGTACGACAGGGAAACGTATATCACCGCGCACCG
GACGCCGGGCGGTGGAATGTACCGCTTCACGTACAACGAAGACCACTTCCCTGTCAACATTGAGCTGCCCGGCGGTCGCA
CGGTGGCGTATGAATATGACATCCAGAACCGGGTGGTGAAGACGACAGATCCGGAAGGCCGGGTGACGCAGACGCAGTGG
AACGGCGAGTTCGACGAAATCACGCGCACGGCGCTGGACGATGACGCTGTCTGGAAAACGCAGTACAACGCCCACGGCCA
GTCAGTGCAGGAGACGGACCCGGAAGGGCGGGTGACGCAGTACGCTTACGATGAACAGGGGCAGATGTGCAGCCGGACGG
ATGCGGCGGGCGGCACGGTGGTGACGGCGTTCGACAGCCGGGGGCAGATGACGCGGTACACCGACTGTTCAGGGCGCAGC
ACAGGATATGACCACGATGAGGACGGCAACCTGACGCGGGTGACGGACGCGGAAGGGAAGGTGGTACGCATCAGCTACAA
CCGACTTGGGTTGCCGGAGACGGTAAACTCACCGGGGAAACAGCAGGACAGGTATACCTGGAATGCGCTGGGGCTGATGA
GCAGCCACCGGCGCATCACGGGGAGCGTGGAGAGCTGGCGGTATACGCCGCGCGGTCTGCTGGCGGCGCACACGGATGAG
GAGAAGCGCGAGACGCGCTGGCAGTACACGCCGGAAGGCCGGGTGGCAGCGCTGACCAACGGCAACGGGGCGCAGTACCG
GTTCAGTCACGATGCGGACGGCAGGCTGGTGCGTGAGGTTCGCCCGGACGGACTGAGCCGTACTTTTATCCTGGACGACA
GCGGTTATCTGACGGCGATACAGACCACGGGCACGCAGGGCGGCGTGCGGCGGGAGACGCAGCAGCGGGATGCGCTGGGC
CGTCTGTTACGGACGGAGAATGAACACGGCCAGCGGACGTTCAGCTACAACCGGCTGGACCAGATAACGGCAGTGACGCT
CACGCCCACGGAGGCGGGGCAACAGCAGCACCGGATGCAGGCCGACACGGTGCGTTTTGAGTATGACCGCAGCGGCTGGC
TGACGGCGGAGCACGCGGGGAACGGTAGCATATGTTATCAGCGCGACGCGCTGGGCAACCCGACGGACATCACGCTGCCG
GACGGGCAGCACCTGACGCATCTGTATTACGGGAGCGGGCATCTGTTACAGACGGCGCTGGACGGCCTGACGGTGAGCGA
GTATGAGCGCGACAGCCTGCACCGTCAGATAATGCGCACGCAGGGGCAGCTTGCGACGTACAGCGGCTATGACGACGACG
GGCTGCTGAGCTGGCAGCGCAGTCTGGCGTCCGGCAGTGCCCCTGTTCTTCCTGGCCAGCGCCCGGCGCGGCAGGGCTGC
GTGACGTCGAGGGACTATTACTGGAACAACCACGGCGAGGTGGGCACGATTGACGACGGCCTGCGTGGCAGCGTGGTGTA
CAGCTATGACAGAAGCGGTTACCTGACCGGGCGCTCAGGTCAGATGTATGACCATGACCGTTATTATTACGATAAGGCGG
GCAACCTGCTGGATAACGAAGGGCAGGGAGCGGTGATGAGCAACCGGCTGCCGGGCTGTGGTCGTGACCGTTACGGCTAT
AACGAGTGGGGCGAGCTGACCACGCGGCGCGACCAGCAACTGGAGTGGAACGCGCAGGGGCAGCTGACGCGGGTCATCAG
CGGCAACACGGAGACGCACTACGGCTACGATGCGCTGGGGAGGCGAACCCGCAAGGCGACGTACGGGCGGCACACGGGCC
ATACGGCGCGGAGCCGGACGGACTTTGTGTGGGAGGGGTTCAGGCTGTTGCAGGAGAACGTGCAGCAGCAGGGCTGGCGG
ACCTATCTGTACGATGCGGAACAGCCGTACACGCCGGTGGCGAGCGTGACGGGGCGGGGAGAAAGCAGGCAGGTGTGGTA
TTACCACACGGATGTGACGGGCACGCCGCAGGAGGTGACGGCGGCGGACGGAACGCTGGTGTGGGCGGGGTATATCAGGG
GGTTTGGAGAGAATGCGGCGGACATCAGCAACAGCGGGGCGTACTTTCACCAGCCGCTGCGGCTGCCGGGGCAGTATTTT
GACGACGAGACAGGGCTGCATTACAATCTGTTCAGATATTATGCACCGGAGTGTGGACGGTTTGTCAGTCAGGATCCGAT
CGGGCTGAGGGGCGGGTTAAACCTTTATCAGTATGCGCCAAATCCTCTCAAATATATAGACCCACTTGGTTTAACCGCGA
CTGTTGGGCGATGGATGGGGCCTGCGGAATATCAGCAAATGCTTGATACTGGGACAGTAGTACAAAGTTCAACAGGGACA
ACTCATGTTGCCTACCCTGCTGATATAGATGCTTTTGGTAAGCAAGCAAAAAATGGTGCTATGTATGTTGAATTTGATGT
GCCTGAAAAATCATTAGTACCTACAAATGAAGGATGGGCAAAAATAGTAGGGCCAGATTCTATCGAAGGGCGATTAGCTA
AACGCAAAGGTTTGCCTGTTCCTGAAATGCCAACAGCAGAAAACATAACTGTAAGGGGCGAGAAAATTAATGGGGAAGTT
GAAGCAAAATGCTAA

Protein sequence :
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGVLQLGEAIGRSIHHTAGKILT
GSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSENIYINSQPAARKDDHTECDAVIEDGSPNVFLGGGTQTVLEISSE
IPDWLRKVVDVLFVVASLLGGLAGAWRQAAKLGTKFGTKCAAKFIGGELVGMAVGEAISGLFSNPVDVTTGQKILLPETD
FTLPGRLPVTCSRFYASHLETVGLLGRGWRLNWETSLRDDDEHITLTGVQGRELRYPKTMLTPGHQIFDPEEQLYLSRLH
DGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVFGHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGG
TRGNLVEYRYDDNGQLTGVVNRAGTQVRQFAYENGLMTAHSNATGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
GTTTVTGRQGETWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIELPGGRTVAYEYDIQNRVVKTTDPEGRVTQTQW
NGEFDEITRTALDDDAVWKTQYNAHGQSVQETDPEGRVTQYAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRS
TGYDHDEDGNLTRVTDAEGKVVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EKRETRWQYTPEGRVAALTNGNGAQYRFSHDADGRLVREVRPDGLSRTFILDDSGYLTAIQTTGTQGGVRRETQQRDALG
RLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLP
DGQHLTHLYYGSGHLLQTALDGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLASGSAPVLPGQRPARQGC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNEGQGAVMSNRLPGCGRDRYGY
NEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALGRRTRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWR
TYLYDAEQPYTPVASVTGRGESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYQYAPNPLKYIDPLGLTATVGRWMGPAEYQQMLDTGTVVQSSTGT
THVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEV
EAKC

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0321 NP_454898.1 Rhs-family protein Not tested SPI-6 Protein 0.0 99