Gene Information

Name : CFSAN001921_15950 (CFSAN001921_15950)
Accession : YP_008256536.1
Strain : Salmonella enterica CFSAN001921
Genome accession: NC_021814
Putative virulence/resistance : Unknown
Product : type IV secretion protein Rhs
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3230221 - 3234315 bp
Length : 4095 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGTATGAAGCAGCCCGTGTGGATGATCCTATCTACCACACCAGCGCGCTCGCCGGGTTTCTTATCGGCGCTATCATCGG
CATCGCCATTATCGCGCTTGCCGCCTTTGCCTTCTTTAGCTGCGGTTTTCTTGCCGGGCTGATTCTGGGTTTTATGGCCG
ATCAAATAGCCTCCGGGGTATTGCAACTGGGCGAGGCCATCGGGCGCTCCATCCACCACACGGCAGGAAAAATCCTCACC
GGTTCGGAGAATGTCAGCACCAACAGTCGCCCGGCGGCGCGCGCGGTACTGAGTACGGTGAAATGCGATAACCATATCGC
AGAAAAACGCATCGCCCAAGGGTCGGAAAATATCTACATCAACAGCCAGCCCGCCGCCCGTAAGGATGACCACACCGAAT
GCGACGCGGTGATTGAAGACGGTTCGCCGAATGTGTTTCTCGGCGGCGGCACACAGACGGTACTGGAAATCAGTTCTGAA
ATTCCGGACTGGCTGCGCAAGGTGGTGGATGTATTGTTTGTCGTGGCGAGTCTGCTCGGCGGGCTGGCCGGGGCGTGGCG
GCAGGCGGCAAAGCTGGGGACGAAATTTGGCACTAAATGTGCCGCTAAGTTTATCGGCGGGGAGCTTGTCGGGATGGCCG
TGGGTGAGGCTATCAGCGGGCTGTTCAGCAATCCGGTGGATGTGACCACCGGGCAGAAAATCCTGCTGCCGGAAACGGAC
TTCACCCTGCCCGGTCGCCTGCCGGTCACCTGCTCGCGTTTTTACGCCAGCCACCTGGAAACTGTGGGACTGTTGGGACG
GGGCTGGCGGCTGAACTGGGAAACCAGCCTGCGCGATGACGATGAACACATCACGCTGACCGGCGTACAGGGGCGGGAAC
TGCGTTACCCGAAAACGATGCTGACGCCCGGCCACCAGATATTTGACCCGGAAGAACAGTTATACCTCAGCCGCCTGCAT
GACGGGCGTTACGTGCTGCATTACACCGATCGCAGCTATTACGTATTTGGTGATTTTGACAGTGACGGCATGGCATACCT
GCTGTTTATGGAGACGCCGCACCGCCAGCGCATTGTCTTCGGGCACGAAGGAGGCAGACTGGTACGGATAGCCTCCAGCA
GCGGGCATCACCTGTTACTGCACCGCACACAGACCCCGGCAGGGGAGCGGCTGTCGCGAATTGAACTGGTGCAGGGCGGC
ACCCGTGGCAATCTGGTGGAGTACCGGTATGACGATAACGGTCAACTGACCGGCGTGGTGAACCGGGCGGGAACGCAGGT
GCGTCAGTTTGCTTATGAAAACGGGCTGATGACGGCGCACAGCAATGCGACGGGGTTCACCTGCCGCTACCGCTGGCAGG
AACTCGACGGCGCGCCGCGCGTGACGGAGCACGACACCAGTGACGGCGAACATTACCGCTTTGACTATGATTTTGCCGCA
GGCACCACCACCGTCACCGGCAGGCAGGGGGAGACATGGCAGTGGTGGTACGACAGGGAAACGTATATCACCGCGCACCG
GACGCCGGGCGGTGGAATGTACCGCTTCACGTACAACGAAGACCACTTCCCTGTCAACATTGAGCTGCCCGGCGGTCGCA
CGGTGGCGTATGAATATGACATCCAGAACCGGGTGGTGAAGACGACAGATCCGGAAGGCCGGGTGACGCAGACGCAGTGG
AACGGCGAGTTCGACGAAATCACGCGCACGGCGCTGGACGATGACGCTGTCTGGAAAACGCAGTACAACGCCCACGGCCA
GCCAGTGCAGGAGACGGACCCGGAAGGGCGGGTGACGCAGTACGCTTACGATGAACAGGGGCAGATGTGCAGCCGGACGG
ATGCGGCGGGCGGCACGGTGGTGACGGCGTTCGACAGCCGGGGGCAGATGACGCGGTACACCGACTGTTCAGGGCGCAGC
ACAGGATATGACCACGATGAGGACGGCAACCTGACGCGGGTGACGGACGCGGAAGGGAAGGTGGTACGCATCAGCTACAA
CCGACTTGGGTTGCCGGAGACGGTAAACTCACCGGGGAAACAGCAGGACAGGTATACCTGGAATGCGCTGGGGCTGATGA
GCAGCCACCGGCGCATCACGGGGAGCGTGGAGAGCTGGCGGTATACGCCGCGCGGTCTGCTGGCGGCGCACACGGATGAG
GAGAAGCGCGAGACGCGCTGGCAGTACACGCCGGAAGGCCGGGTGGCAGCGCTGACCAACGGCAACGGGGCGCAGTACCG
GTTCAGTCACGATGCGGACGGCAGGCTGGTGCGTGAGGTTCGCCCGGACGGACTGAGCCGTACTTTTATCCTGGACGACA
GCGGTTATCTGACGGCGATACAGACCACGGGCACGCAGGGCGGCGTGCGGCGGGAGACGCAGCAGCGGGATGCGCTGGGC
CGTCTGTTACGGACGGAGAATGAACACGGCCAGCGGACGTTCAGCTACAACCGGCTGGACCAGATAACGGCAGTGACGCT
CACGCCCACGGAGGCGGGGCAACAGCAGCACCGGATGCAGGCCGACACGGTGCGTTTTGAGTATGACCGCAGCGGCTGGC
TGACGGCGGAGCACGCGGGGAACGGTAGCATATGTTATCAGCGCGACGCGCTGGGCAACCCGACGGACATCACGCTGCCG
GACGGGCAGCACCTGACGCATCTGTATTACGGGAGCGGGCATCTGTTACAGACGGCGCTGGACGGCCTGACGGTGAGCGA
GTATGAGCGCGACAGCCTGCACCGTCAGATAATGCGCACGCAGGGGCAGCTTGCGACGTACAGCGGCTATGACGACGACG
GGCTGCTGAGCTGGCAGCGCAGTCTGGCGTCCGGCAGTGCCCCTGTTCTTCCTGGCCAGCGCCCGGCGCGGCAGGGCTGC
GTGACGTCGAGGGACTATTACTGGAACAACCACGGCGAGGTGGGCACGATTGACGACGGCCTGCGTGGCAGCGTGGTGTA
CAGCTATGACAGAAGCGGTTACCTGACCGGGCGCTCAGGTCAGATGTATGACCATGACCGTTATTATTACGATAAGGCGG
GCAACCTGCTGGATAACGAAGGGCAGGGAGCGGTGATGAGCAACCGGCTGCCGGGCTGTGGTCGTGACCGTTACGGCTAT
AACGAGTGGGGCGAGCTGACCACGCGGCGCGACCAGCAACTGGAGTGGAACGCGCAGGGGCAGCTGACGCGGGTCATCAG
CGGCAACACGGAGACGCACTACGGCTACGATGCGCTGGGGAGGCGAACCCGCAAGGCGACGTACGGGCGGCACACGGGCC
ATACGGCGCGGAGCCGGACGGACTTTGTGTGGGAGGGGTTCAGGCTGTTGCAGGAGAACGTGCAGCAGCAGGGCTGGCGG
ACCTATCTGTACGATGCGGAACAGCCGTACACGCCGGTGGCGAGCGTGACGGGGCGGGGAGAAAGCAGGCAGGTGTGGTA
TTACCACACGGATGTGACGGGCACGCCGCAGGAGGTGACGGCGGCGGACGGAACGCTGGTGTGGGCGGGGTATATCAGGG
GGTTTGGAGAGAATGCGGCGGACATCAGCAACAGCGGGGCGTACTTTCACCAGCCGCTGCGGCTGCCGGGGCAGTATTTT
GACGACGAGACAGGGCTGCATTACAATCTGTTCAGATATTATGCACCGGAGTGTGGACGGTTTGTCAGTCAGGATCCGAT
CGGGCTGAGGGGCGGGTTAAACCTTTATCAGTATGCGCCAAATCCTCTCAAATATATAGACCCACTTGGTTTAACCGCGA
CTGTTGGGCGATGGATGGGGCCTGCGGAATATCAGCAAATGCTTGATACTGGGACAGTAGTACAAAGTTCAACAGGGACA
ACTCATGTTGCCTACCCTGCTGATATAGATGCTTTTGGTAAGCAAGCAAAAAATGGTGCTATGTATGTTGAATTTGATGT
GCCTGAAAAATCATTAGTACCTACAAATGAAGGATGGGCAAAAATAGTAGGGCCAGATTCTATCGAAGGGCGATTAGCTA
AACGCAAAGGTTTGCCTGTTCCTGAAATGCCAACAGCAGAAAACATAACTGTAAGGGGCGAGAAAATTAATGGGGAAGTT
GAAGCAAAATGCTAA

Protein sequence :
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGVLQLGEAIGRSIHHTAGKILT
GSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSENIYINSQPAARKDDHTECDAVIEDGSPNVFLGGGTQTVLEISSE
IPDWLRKVVDVLFVVASLLGGLAGAWRQAAKLGTKFGTKCAAKFIGGELVGMAVGEAISGLFSNPVDVTTGQKILLPETD
FTLPGRLPVTCSRFYASHLETVGLLGRGWRLNWETSLRDDDEHITLTGVQGRELRYPKTMLTPGHQIFDPEEQLYLSRLH
DGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVFGHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGG
TRGNLVEYRYDDNGQLTGVVNRAGTQVRQFAYENGLMTAHSNATGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
GTTTVTGRQGETWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIELPGGRTVAYEYDIQNRVVKTTDPEGRVTQTQW
NGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQYAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRS
TGYDHDEDGNLTRVTDAEGKVVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EKRETRWQYTPEGRVAALTNGNGAQYRFSHDADGRLVREVRPDGLSRTFILDDSGYLTAIQTTGTQGGVRRETQQRDALG
RLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLP
DGQHLTHLYYGSGHLLQTALDGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLASGSAPVLPGQRPARQGC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNEGQGAVMSNRLPGCGRDRYGY
NEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALGRRTRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWR
TYLYDAEQPYTPVASVTGRGESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYQYAPNPLKYIDPLGLTATVGRWMGPAEYQQMLDTGTVVQSSTGT
THVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEV
EAKC

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0321 NP_454898.1 Rhs-family protein Not tested SPI-6 Protein 0.0 99