PAI Gene Information


Name : sopB (SC1043)
Accession : YP_216030.1
PAI name : SPI-5
PAI accession : NC_006905_P1
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Virulence
Product : outer protein
Function : -
Note : similar to ipgD of Shigella
Homologs in the searched genomes :   50 hits    ( 50 protein-level )  
Publication :
    -Chiu,C.-H., Tang,P., Chu,C., Bao,Q., Hu,S., Yu,J., Chou,Y.-Y., Wang,H.-S. and Lee,Y.-S., "Direct Submission", Submitted (03-SEP-2004) Chang Gung Genomic Medical Center, No. 5, Fu-Shing St., Kweishan, Taoyuan 333, Taiwan.

    -Chiu,C.H., Tang,P., Chu,C., Hu,S., Bao,Q., Yu,J., Chou,Y.Y., Wang,H.S. and Lee,Y.S., "The genome sequence of Salmonella enterica serovar Choleraesuis, a highly invasive and resistant zoonotic pathogen", Nucleic Acids Res. 33 (5), 1690-1698 (2005) PUBMED 15781495 REMARK Erratum:[Nucleic Acids Res. 2005;33(7):2351] Publication Status: Online-Only.

    -Chiu,C.H., Tang,P., Chu,C., Hu,S., Bao,Q., Yu,J., Chou,Y.Y., Wang,H.S. and Lee,Y.S., "Direct Submission", Submitted (04-APR-2005) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGCAAATACAGAGCTTCTATCACTCAGCTTCACTAAAAACCCAGGAGGCTTTTAAAAGCCTACAAAAAACCTTATACAA
CGGAATGCAGATTCTCTCAGGCCAGGGCAAAGCGCCGGCTAAAGCGCCCGACGCTCGCCCGGAAATTATTGTCCTGCGAG
AACCCGGCGCGACATGGGGGAATTATCTACAGCATCAGAAGGCGTCTAACCACTCGCTGCATAACCTCTATAACTTACAG
CGCGATCTTCTTACCGTCGCGGCAACCGTTCTGGGTAAACAAGACCCGGTTCTAACGTCAATGGCAAACCAAATGGAGTT
AGCCAAAGTTAAAGCGGACCGGCCAGCAACAAAACAAGAAGAAGCCGCGGCAAAAGCATTGAAGAAAAATCTTATCGAAC
TTATTGCAGCACGCACTCAGCAGCAGGATGGCTTACCTGCAAAAGAAGCTCATCGCTTTGCGGCAGTAGCGTTTAGAGAT
GCTCAGGTCAAGCAGCTTAATAACCAGCCCTGGCAAACCATAAAAAATACACTCACGCATAACGGGCATCACTATACCAA
CACGCAGCTCCCTGCAGCAGAGATGAAAATCGGCGCAAAAGATATCTTTCCCAGTGCTTATGAGGGAAAGGGCGTATGCA
GTTGGGATACCAAGAATATTCATCACGCCAATAATTTGTGGATGTCCACGGTGAGTGTGCATGAGGACGGTAAAGATAAA
ACGCTTTTTTGCGGGATACGTCATGGCGTGCTTTCCCCCTATCATGAAAAAGATCCGCTTCTGCGTCACGTCGGCGCTGA
AAACAAAGCCAAAGAAGTATTAACTGCGGCACTTTTTAGTAAACCTGAGTTGCTTAACAAAGCCTTAGCGGGCGAGGCGG
TAAGCCTGAAACTGGTATCCGTCGGGTTACTCACCGCGTCGAATATTTTCGGCAAAGAGGGAACGATGGTCGAGGACCAA
ATGCGCGCATGGCAATCGTTGACCCAGCCGGGAAAAATGATTCATTTAAAAATCCGCAATAAAGATGGCGATCTACAGAC
GGTAAAAATAAAACCGGACGTCGCCGCATTTAATGTGGGTGTTAATGAGCTGGCGCTCAAGCTCGGCTTTGGCCTTAAGG
CATCGGATAGCTATAATGCCGAGGCGCTACATCAGTTATTAGGCAATGATTTACGCCCTGAAGCCAGACCAGGTGGCTGG
GTTGGCGAATGGCTGGCGCAATACCCGGATAATTATGAGGTCGTCAATACATTAGCGCGCCAGATTAAGGATATATGGAA
AAATAACCAACATCATAAAGATGGCGGCGAACCCTATAAACTCGCACAACGCCTTGCCATGTTAGCCCATGAAATTGACG
CGGTACCCGCCTGGAATTGTAAAAGCGGCAAAGATCGTACAGGGATGATGGATTCAGAAATCAAGCGAGAGATCATTTCC
TTACATCAGACCCATATGTTAAGTGCGCCTGGTAGTCTTCCGGATAGCGGTGGACAGAAAATTTTCCAAAAAGTATTACT
GAATAGCGGTAACCTGGAGATTCAGAAACAAAATACGGGCGGGGCGGGAAACAAAGTAATGAAAAATTTATCGCCAGAGG
TGCTCAATCTTTCCTATCAAAAACGAGTTGGGGATGAAAATATTTGGCAGTCAGTAAAAGGCATTTCTTCATTAATCACA
TCTTGA

Protein sequence :
MQIQSFYHSASLKTQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVLREPGATWGNYLQHQKASNHSLHNLYNLQ
RDLLTVAATVLGKQDPVLTSMANQMELAKVKADRPATKQEEAAAKALKKNLIELIAARTQQQDGLPAKEAHRFAAVAFRD
AQVKQLNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANNLWMSTVSVHEDGKDK
TLFCGIRHGVLSPYHEKDPLLRHVGAENKAKEVLTAALFSKPELLNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQ
MRAWQSLTQPGKMIHLKIRNKDGDLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGW
VGEWLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSGKDRTGMMDSEIKREIIS
LHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAGNKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLIT
S