PAI Gene Information


Name : sigD
Accession : NP_455588.1
PAI name : SPI-5
PAI accession : NC_003198_P2
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Virulence
Product : cell invasion protein
Function : -
Note : Salmonella dublin SopB TR:O34105 (EMBL:U90203) fasta scores: E(): 0, 97.0% id in 561 aa and to Salmonella typhimurium invasion protein SigD TR:O30916 (EMBL:AF021817) fasta scores: E(): 0, 97.7% id in 563 aa.
Homologs in the searched genomes :   48 hits    ( 48 protein-level )  
Publication :
    -Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGCAAATACAGAGCTTCTATCACTCAGCTTCACTAAAAACCCAGGAGGCTTTTAAAAGCCTACAAAAAACCTTATACAA
CGGAATGCAGATTCTCTCAGGCCAGGGCAAAGCGCCGGCTAAAGCGCCCGACGCTCGCCCGGAAATTATTGTCCTGCGAG
AACCTGGCGCGACATGGGGGAATTATCTACAGCATCAGAAGACGTCTAACCACTCGCTGCATAACCTCTATAACTTACAG
CGCGATCTTCTTACCGTCGCGGCAACCGTTCTGGGTAAACAAGACCCGGTTCTAACGTCAATGGCAAACCAAATGGAGTT
AGCCAAAGTTAAAGCGGACCGGCCAGCAACAAAACAAGAAGAAGCTGCGGCAAAAGCATTGAAGAAAAATCTTATCGAAC
TTATTGCAGCACGCACTCAGCAGCAAAATGGCTTACCTGCAAAAGAAGCTCATCGCTTTGCGGCAGTAGCGTTTAGAGAT
GCTCAGGTCAAGCAGCTCAATAACCAGCCCTGGCAAACCATAAAAAATACACTCACGCATAACGGGCATCACTATACCAA
CACGCAGCTCCCTGCCGCAGAGATGAAAATCGGCGCAAAAGATATCTTTCCCAGTGCTTATGAGGGAAAGGGCGTATGCA
GTTGGGATACCAAGAATATTCATCACGCCAATAATTTGTGGATGTCCACGGTGAGTGTGCATGAGGACGGTAAAGATAAA
ACGCTTTTTTGCGGGATACGTCATGGTGTGCTTTCCCCCTATCATGAAAAAGATCCGCTTCTGCGTCAGGCCGGCGCTGA
AAACAAAGCCAAAGAAGTATTAGCTGCGGCACTTTTTAGTAAACCTGAGTTGCTTAACAGAGCCTTAGAGGGCGAAGCGG
TAAGCCTGAAACTGGTATCCGTCGGGTTACTCACCGCGTCGAATATTTTCGGCAAAGAGGGAACTATGGTCGAGGATCAA
ATGCGCGCATGGCAATCGTTGACCCAGCCGGGAAAAATGATTCATTTAAAAATCCGCAATAAAGATGGCGATCTACAGAC
GGTAAAAATAAAACCGGACGTCGCCGCATTTAATGTGGGTGTTAATGAGCTGGCGCTCAAGCTCGGCTTTGGCCTTAAAG
CATCAGATAGCTATAATGCCGAAGCGCTACATCAGTTATTAGGCAATGATTTACGCCCTGAAGCCAGACCAGGTGGCTGG
GTTGGCGAATGGCTGGCGCAATACCCGGATAATTATGAGGTCGTCAATACATTAGCGCGCCAGATTAAGGATATCTGGAA
AAATAACCAACATCATAAAGATGGCGGCGAACCCTATAAACTCGCACAACGCCTTGCCATGTTAGCCCATGAAATTGACG
CGGTGCCCGCCTGGAATTGTAAAAGCGGCAAAGATCGTACAGGGATGATGGATTCAGAAATCAAGCGAGAGCTCATTTCT
TTCCATCAGACCCATATGTTAAGTGCGCCTGGTAGTCTTCCGGATAGCGGTGGACAGAAAATTTTCCAAAAAGTATTACT
GAATAGCGGTAACCTGGAGATTCAGAAACAAAATACGGGCGGGGCGGGAAACAAAGTAATGAAAAATTTATCGCCAGAGG
TGCTCAATCTTTCCTATCAAAAACGAGTTGGGGATGAAAATATTTGGCAGTCAGTAAAAGGTATTTCTTCATTAATCACA
TCTTGA

Protein sequence :
MQIQSFYHSASLKTQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVLREPGATWGNYLQHQKTSNHSLHNLYNLQ
RDLLTVAATVLGKQDPVLTSMANQMELAKVKADRPATKQEEAAAKALKKNLIELIAARTQQQNGLPAKEAHRFAAVAFRD
AQVKQLNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANNLWMSTVSVHEDGKDK
TLFCGIRHGVLSPYHEKDPLLRQAGAENKAKEVLAAALFSKPELLNRALEGEAVSLKLVSVGLLTASNIFGKEGTMVEDQ
MRAWQSLTQPGKMIHLKIRNKDGDLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGW
VGEWLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSGKDRTGMMDSEIKRELIS
FHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAGNKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLIT
S