PAI Gene Information


Name : STY0294
Accession : NP_454876.1
PAI name : SPI-6
PAI accession : NC_003198_P1
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Not determined
Product : ClpB-like protein
Function : -
Note : Similar to Escherichia coli ClpB heat shock protease SW:CLPB_ECOLI (P03815) (857 aa) fasta scores: E(): 0, 37.9% id in 892 aa; Paralogue of E. coli clpB (CLPB_ECOLI); Fasta hit to CLPB_ECOLI (857 aa), 38% identity in 892 aa overlap
Homologs in the searched genomes :   1865 hits    ( 1864 protein-level,   1 DNA-level )  
Publication :
    -Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGGAAACTCCTGTTTCACGCAGTGCGTTATATGGAAAACTGGCCGGCCCACTATTCCGGTCGCTGGAATCGGCAACGGC
ATTTTGCAAACTACGCTCTAATCCCTGGGGTGAGCTGACTCACTGGCTGCACCAGTTAACACAGCAGCCCGATAACGATA
TTCTCCACGTTCTTCGGCATTACCAGATCCCTCTTTCTGATGTGGAGAAAGCGTTACTCCGGCAACTGGATATGCTGCCC
GCCGGGGCCAGCGCCATTAGTGATTTTTCTCACCATATCGATCTCAGCGTTGAAAAGGCCTGGATGCTGGCGAGCGTCCG
TTACGGCGATAACAAAATTCGCAGCGGCTGGTTGCTGCTGGCCTTGTTGACCACGCCAGAACTGCGTCGGGTACTGAGCA
GTATCTGCGCGCCGCTGGCCACGCTTCCGGTTGATGAACTGACGGAAATACTGCCCTCGTTGATCGAAACATCGCCGGAA
GCGCAGGAGCGCCCTTACGACGGCTCCGGTCTGGCATCAGCCATTCCCGGTGAAAGCAGCCAGGCGATTCCCAACGGCGT
GCAGGACGGTAAATCCGCGCTGGCAAAATACTGTCAGGACATGACGGCACAGGCGCGCGACGGCAAAATCGACCCGGTGA
CGGGGCGTGAGCATGAAATCCGCACCATGACGGATATTCTGCTGCGCCGTCGCCAGAATAATCCCCTACTGACTGGTGAG
GCGGGCGTCGGAAAGACGGCGGTCGTCGAAGGTTTTGCCCTCGCGATTGCGCAGGGGGAAGTGCCGCCCGCGCTGCGGGA
AGTACGGCTGCTGGCGCTGGACGTTGGCGCTCTGTTGGCCGGAGCCAGCATGAAAGGCGAGTTTGAATCGCGTCTGAAAG
GGTTACTGGAAGAGGCCGGGCGCTCGCCGCAGCCGGTTATTCTGTTTGTCGATGAAGTTCACACTCTGGTGGGCGCGGGC
GGCGCATCCGGCACGGGCGATGCCGCTAACCTGCTGAAACCGGCGCTGGCGTGCGGCACCCTGCGGACTATCGGCGCCAC
CACCTGGAGCGAATACAAGCGCCATATTGAGAAAGATCCGGCGCTGACCCGTCGTTTTCAGGTGTTGCAGATTGCCGAGC
CGGAAGAGATCCCCGCAATGGAAATGGTGCGTGGTCTGGTGGATACGCTGGAAAAACACCATAACGTACTGATTCTGGAT
GAGGCGGTACGTGCGGCGGTACAGCTTTCTCACCGCTACATTCCCGCCCGGCAGTTGCCGGATAAGGCCATCAGCCTACT
GGATACCGCCGCGGTCCGCGTGGCGCTGACGCTGCACACGCCGCCTGCCAGCGTACAGTTCCTGCGTCAGCAGCTAAAAG
CGGCGGAAATGGAACGGTCGCTGTTGCAGAAGCAGGAAAAAATGGGGATTCAGTCAGATGAGCGGTGCGATGCGCTGACG
GCGCGAATTTTCTCGCTCAACGATGAACTGACTGCATCCGAATCCCGCTGGCAGCGGGAGCTGGAACTGGTACATACGTT
GCAGGAACTGCGTGTCGCAGAGTCTGATGCTGATGACAAAACCACGCTGCAACAGGCCGAAACAGCGCTAAGGGAGTGGC
AGGGCGACGCGCCGGTGGTGTTCCCGGAAGTCAGCGCGGCGATTGTCGCGGCGATTGTCGCCGACTGGACCGGTATTCCT
GCCGGGCGCATGGTGAAAGATGAGGCCAGCCAGGTGCTGGAACTGCCTGCCCGACTGGCGCAACGCGTTACCGGGCAAGA
CGGCGCGCTGGCGCAGATTGGTGAACGTATTCAGACCGCCAGGGCGGGACTGGGCGATCCACGCAAACCGGTGGGCGTGT
TTATGCTGGCCGGGCCGTCCGGTGTCGGTAAAACCGAAACCGCGCTGGCGCTGGCGGAGGCTATCTACGGCGGTGAGCAG
AACCTGGTAACCATCAATATGAGCGAGTTCCAGGAGGCTCACACCGTTTCCACACTGAAAGGCGCGCCGCCCGGCTATGT
GGGCTATGGCGAGGGTGGTGTGCTGACGGAAGCGGTGCGTCGTCACCCCTGGAGCGTAGTGCTGCTCGACGAGATCGAAA
AAGCGCACCATGACGTCCATGAACTCTTCTATCAGGTGTTTGACAAGGGCGGGATGGAGGACGGTGAGGGAACACATGTC
GATTTCAAAAACACCACGCTACTACTCACCACCAATGTAGGTTCCGACCTCATCAGCCAGATGTGTGAAGATCCGGCCTT
AATGCCCGATGCTACGGGGCTTAAAGAGGCGCTAATGCCGGAATTGCGCAAGCATTTCCCGGCGGCATTTCTGGGCCGCG
TGACGGTGATCCCTTACCTGCCGCTGGATGAAACGTCGCGTGGCGTGATTGCCCGTCTGCATCTTGACCGGCTGGTGGCG
CGGATGGGTGAACAGCACGGGGTGACGCTGACGTATAGTGAGGAACTGGTCGCACATATTGTGGCGTGCTGTCCAATGCA
TGAAACGGGCGCGCGGTTGCTGATTGGCTACATCGAACAGCACATCCTGCCACAACTGTCGCGCTACTGGTTGCAGGCCA
TGACGGAAAAAGCCGCTATCAGGCAGATTGATATCGGCGTTAATGGTGATGAGCAGATTGTTTTTGAGACAACCTCGCAG
GAGGGAATATGCCAAAAGAGTTAA

Protein sequence :
METPVSRSALYGKLAGPLFRSLESATAFCKLRSNPWGELTHWLHQLTQQPDNDILHVLRHYQIPLSDVEKALLRQLDMLP
AGASAISDFSHHIDLSVEKAWMLASVRYGDNKIRSGWLLLALLTTPELRRVLSSICAPLATLPVDELTEILPSLIETSPE
AQERPYDGSGLASAIPGESSQAIPNGVQDGKSALAKYCQDMTAQARDGKIDPVTGREHEIRTMTDILLRRRQNNPLLTGE
AGVGKTAVVEGFALAIAQGEVPPALREVRLLALDVGALLAGASMKGEFESRLKGLLEEAGRSPQPVILFVDEVHTLVGAG
GASGTGDAANLLKPALACGTLRTIGATTWSEYKRHIEKDPALTRRFQVLQIAEPEEIPAMEMVRGLVDTLEKHHNVLILD
EAVRAAVQLSHRYIPARQLPDKAISLLDTAAVRVALTLHTPPASVQFLRQQLKAAEMERSLLQKQEKMGIQSDERCDALT
ARIFSLNDELTASESRWQRELELVHTLQELRVAESDADDKTTLQQAETALREWQGDAPVVFPEVSAAIVAAIVADWTGIP
AGRMVKDEASQVLELPARLAQRVTGQDGALAQIGERIQTARAGLGDPRKPVGVFMLAGPSGVGKTETALALAEAIYGGEQ
NLVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRHPWSVVLLDEIEKAHHDVHELFYQVFDKGGMEDGEGTHV
DFKNTTLLLTTNVGSDLISQMCEDPALMPDATGLKEALMPELRKHFPAAFLGRVTVIPYLPLDETSRGVIARLHLDRLVA
RMGEQHGVTLTYSEELVAHIVACCPMHETGARLLIGYIEQHILPQLSRYWLQAMTEKAAIRQIDIGVNGDEQIVFETTSQ
EGICQKS