PAI Gene Information


Name : sinH
Accession : AAD25115.1
PAI name : CS54 island
PAI accession : AF140550
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Virulence
Product : SinH
Function : -
Note : similar to Citrobacter freundii and Escherichia coli intimin proteins; similar to Yersinia pseudotuberculosis and Yersinia pestis invasin proteins
Homologs in the searched genomes :   62 hits    ( 54 protein-level,   8 DNA-level )  
Publication :
    -Kingsley,R.A. and Baumler,A.J., "Direct Submission", Submitted (05-APR-1999) Medical Microbiology and Immunology, Texas A&M University, Reynolds Medical Building, College Station, TX 77843-1114, USA.

    -Kingsley,R.A. and Baumler,A.J., "Direct Submission", Submitted (21-JUN-1999) Medical Microbiology and Immunology, Texas A&M University, Reynolds Medical Building, College Station, TX 77843-1114, USA REMARK Sequence update by submitter.

    -Kingsley,R.A., Humphries,A.D., Weening,E.H., De Zoete,M.R., Winter,S., Papaconstantinopoulou,A., Dougan,G. and Baumler,A.J., "Molecular and phenotypic analysis of the CS54 island of Salmonella enterica serotype typhimurium: identification of intestinal colonization and persistence determinants", Infect. Immun. 71 (2), 629-640 (2003) PUBMED 12540539.

    -Kingsley,R.A., van Amsterdam,K. and Baumler,A.J., "The presence of a pathogenicity island specific to Salmonella enterica subspecies I correlates with adaptation to warm blooded animals", Unpublished.

    -Kingsley,R.A., van Amsterdam,K. and Baumler,A.J., "Direct Submission", Submitted (09-SEP-1998) Medical Microbiology and Immunology, Texas A&M University, 407 Reynolds Medical Building, College Station, TX 77843-1114, USA.

    -Kingsley,R.A., van Amsterdam,K., Edwards,E.W., Hargis,B.M. and Baumler,A.J., "Complete sequence of the xseA-hisS intergenic region of the S. enterica serotype Typhimurium genome and its distribution within the genus Salmonella", Unpublished.


DNA sequence :
ATGGTGCGAAGGATTCGCATTCTGGTTTTGCTATTAGTGTCTGGAGTCTCAGTTGCGGCTTCTGGCGCAGATAACTCTGC
ATCTGCAAATGATGAAAGCCACTCCTCCCTGCCTGATTTAGCCAGCGAATCCGCGAAAAAAGAAGAACAGGAAAATAAAG
GAAAGTCCTTTAAAGAACAGGGCGCTGACTACTTTATTAATTCCGCCACCCAGGGCTTTGATAATCTCACCCCGGAAGCG
CTGGAATCCCAGGCGCGTAGCTATTTGCAAAACCAAATCACTTCCTCAGCACAGTCATATCTTGAAGGTGTAATGTCACC
TTATGGCAAGATCCGTACCAGCCTGTCCGTGGGCGAAGGCGGCGATCTGGACGGCAGTTCGCTGGATTATTTTATTCCCT
GGTATGACAACCAGAGCACGCTTTTCTTTAGCCAGATTTCGGCTCAGCGTAAGGAAGACCGCACCATTGGTAACTTTGGG
CTTGGCGTTCGCCAGAACGTCGGCAACTGGCTGCTTGGCGGCAACGCATTCTATGACTACGATTTTACGCGCGGTCACCG
TCGATTAGGTCTGGGAACTGAAGCCTGGACGGATTACCTGAAATTCTCCGGCAACTATTACCACCCTCTTTCTGACTGGA
AAGACTCCGAAGATTTCGACTTTTATGAAGAACGACCCGCGCGCGGCTGGGATATCCGCATGGAAAGCTGGCTGCCGTTT
TATCCGCAGCTTGGCGCTAAATTAGTTTACGAACAGTACTACGGCGATGAAGTAGCGCTGTTTGGCACCGATAATCTGCA
AAAAGATCCTCATGCCGTCACTTTGGGGCTGGAATATACGCCGGTACCGCTGGTCACGGTTGGTACGGATTACAAAGCCG
GTACTGGCGATAGTAACGACTTTAGCGTTAACGCGACCGTGAACTACCAGATAGGCACGCCGCTGGCGGCGCAGCTCGAT
CCGGAAAACGTTAAAATTCAGCACTCCTTAATGGGCAGCCGCACCGATTTTGTCGATCGTAACAATTTCATCATTCTCGA
ATATCGTGAGAAAGATCCGCTGGACGTCACGTTGTGGCTGAAGGCTGACGCCACCAATGAACACCCGGAATGCGTGATTG
AAGATACGCCTGAGGCCGCTGTTGGTCTGGAAAAGTGCAAATGGACTGTCAACGCGCTGATTAACCACCATTACAAAATT
ATTTCTGCCTCCTGGCAGGCGAAAAACAACGCCGCCCGTACGCTGGTGATGCCGGTGGTAAAAGCTAATGCGTTAACCGA
AGGCAACAATAACAGCTGGAATCTGGTGCTTCCTGCCTGGGTTAATGCCGATACCGAAGAGCAGCGCACCGCCCTGAACA
CCTGGAAAGTACGCATGACGCTGGAAGATGAAAAGGGCAATAAGCAGAACTCCGGCGTGGTGGAAATCACCGTCCAGCAG
GATCGCAAAATTGAGCTGATCGTCGACAACATTGCCGATACCGATCGCAGCGATCACAGCCATGAAGCCAGCGCGCTGGC
CGATGGCGAAGACGGCGTGGTGATGGATCTGCTGATCACCGATTCCTTTGGCGACTCCACGGATCGCAATGGTAACGAAC
TGGTGGACGATGCCATGACGCCCGTGTTGTACGACAGCAATGATAAAAAAGTCACCCTGGCCCAGACGCCCTGCACCACG
GAAACCCCCTGCGTGTTTATTGCCAGTCGTGACAAAGAAGCGGGGACGGTGACGCTTTCCAGCACCCTGCCGGGGACTTT
CCGCTGGAAGGCGAAAGCAGACGCCTATGGCGACAGTAACTACGTTGATGTCACTTTTATCGGCGACAACCTGAGCGCGC
TGAATGCGGTGATTTATCAGGTCAAAGCCGCCAACCCTGTCAACCTTATCGGCAAAGAAGATAAGCACCCTACGGTCAAT
AATGCTTACCGTTTCCTGCTGTGGCGCGACAAAAATAAAGACGGCGTATTCCAGATGTCCGAGCAGCTCACGGAAGAAGA
GATGGCGCTGTACGACTACCAGTGGGAGTTTACCGGGCAGAGTACAAACGGACATACCGGCGCGCTGGCCAACACCATGA
ATGAAGATCTTGTCCTGCCAGTAACCAATAAAGAGGCGGCGCAGAAGTTTGCCGCCAATGTGGAAGATGGCGTTCAGGGT
TACGGCATACGCGTTACGTATAGTCAGAAATAG

Protein sequence :
MVRRIRILVLLLVSGVSVAASGADNSASANDESHSSLPDLASESAKKEEQENKGKSFKEQGADYFINSATQGFDNLTPEA
LESQARSYLQNQITSSAQSYLEGVMSPYGKIRTSLSVGEGGDLDGSSLDYFIPWYDNQSTLFFSQISAQRKEDRTIGNFG
LGVRQNVGNWLLGGNAFYDYDFTRGHRRLGLGTEAWTDYLKFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRMESWLPF
YPQLGAKLVYEQYYGDEVALFGTDNLQKDPHAVTLGLEYTPVPLVTVGTDYKAGTGDSNDFSVNATVNYQIGTPLAAQLD
PENVKIQHSLMGSRTDFVDRNNFIILEYREKDPLDVTLWLKADATNEHPECVIEDTPEAAVGLEKCKWTVNALINHHYKI
ISASWQAKNNAARTLVMPVVKANALTEGNNNSWNLVLPAWVNADTEEQRTALNTWKVRMTLEDEKGNKQNSGVVEITVQQ
DRKIELIVDNIADTDRSDHSHEASALADGEDGVVMDLLITDSFGDSTDRNGNELVDDAMTPVLYDSNDKKVTLAQTPCTT
ETPCVFIASRDKEAGTVTLSSTLPGTFRWKAKADAYGDSNYVDVTFIGDNLSALNAVIYQVKAANPVNLIGKEDKHPTVN
NAYRFLLWRDKNKDGVFQMSEQLTEEEMALYDYQWEFTGQSTNGHTGALANTMNEDLVLPVTNKEAAQKFAANVEDGVQG
YGIRVTYSQK