PAI Gene Information


Name : gspD
Accession : CAE85234.1
PAI name : PAI V 536
PAI accession : AJ617685
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : GspD, hypothetical type II secretion protein
Function : -
Note : ORF84
Homologs in the searched genomes :   209 hits    ( 207 protein-level,   2 DNA-level )  
Publication :
    -Dobrindt,U., "Direct Submission", Submitted (11-NOV-2003) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Universitaet Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY.

    -Schneider,G., Dobrindt,U., Bruggemann,H., Nagy,G., Janke,B., Blum-Oehler,G., Buchrieser,C., Gottschalk,G., Emody,L. and Hacker,J., "The pathogenicity island-associated K15 capsule determinant exhibits a novel genetic structure and correlates with virulence in uropathogenic Escherichia coli strain 536", Infect. Immun. 72 (10), 5993-6001 (2004) PUBMED 15385503.


DNA sequence :
ATGGGGCCGGGCGTTCAGGGGAAAGTGAGTATTCGCACTATGACCCCGCTCAATGAACGCCAGTATTACCAGCTATTCCT
TAACCTGCTGGAAGCACAGGGGTATGCCGTCGTACCGATGGAAAACGACGTGCTGAAGGTGGTGAAATCCAGCGCTGCGA
AAGTCGAGCCGCTGCCGCTGGTCGGTGAAGGCAGCGACAACTACGCGGGCGATGAAATGGTCACCAAAGTCGTGCCGGTA
CGTAATGTGTCGGTACGCGAGCTGGCACCGATTCTGCGCCAGATGATTGACAGCGCAGGCTCAGGCAACGTTGTTAATTA
CGATCCCTCCAACGTGATTATGCTTACCGGACGCGCCTCAGTTGTGGAGCGCCTGACAGAAGTGATCCAGCGCGTGGATC
ACGCAGGTAATCGCACCGAAGAGGTGATCCCGCTGGATAACGCTTCAGCCTCGGAAATTGCCCGCGTGCTGGAAAGCCTG
ACGAAAAACAGCGGCGAGAACCAGCCTGCGACGCTGAAATCTCAAATTGTCGCCGACGAACGCACCAACAGTGTGATTGT
CAGTGGTGACCCGGCCACGCGGGACAAAATGCGCCGCCTGATCCGTCGGCTGGACTCAGAAATGGAGCGCAGCGGCAACA
GCCAGGTTTTCTATCTCAAATACAGCAAAGCCGAAGATCTGGTCGATGTACTGAAGCAGGTCAGCGGCACGCTCACGGCG
GCTAAAGAAGAGGCGGAAGGCACCGTCGGTAGCGGGCGTGAGGTTGTCTCCATCGCCGCCAGCAAACACAGTAATGCCCT
GATTGTTACCGCGCCGCAGGACATCATGCAGTCGCTGCAAAGCGTGATTGAACAACTGGATATTCGCCGTGCTCAGGTGC
ATGTCGAGGCGTTAATCGTGGAAGTTGCCGAAGGCAGCAATATCAATTTCGGCGTGCAGTGGGCGTCGAAAGATGCCGGA
TTAATGCAGTTTGCTAACGGTACGCAGATCCCTATTGGCACGCTGGGTGCAGCCATTTCTCAGGCAAAACCGCAGAAAGG
CTCGACGGTAATCAGTGAAAACGGCGCTACCACCATTAACCCGGATACCAACGGCGATCTCTCCACGCTCGCCCAGCTTC
TTTCCGGCTTTAGCGGTACGGCGGTTGGTGTGGTGAAAGGCGACTGGATGGCGCTGGTGCAGGCGGTTAAAAACGACTCC
AGCTCTAACGTGCTCTCCACGCCGAGCATCACCACGCTGGACAACCAGGAAGCCTTCTTCATGGTGGGTCAGGACGTTCC
GGTATTAACCGGCTCTACCGTTGGCTCCAATAACAGCAATCCTTTCAATACAGTAGAAAGGAAAAAAGTCGGCATCATGC
TGAAAGTCACGCCGCAGATTAACGAAGGAAACGCGGTACAGATGGTGATTGAGCAGGAAGTCTCGAAGGTGGAAGGACAG
ACCAGTCTCGACGTCGTGTTTGGCGAGCGCAAACTGAAAACCACCGTGCTGGCAAACGATGGTGAGCTGATCGTGCTTGG
CGGTCTGATGGACGATCAGGCGGGAGAAAGCGTGGCGAAAGTGCCATTGCTGGGCGATATCCCGTTGATTGGTAACCTGT
TTAAATCGACGGCGGATAAAAAAGAAAAACGTAACCTGATGGTATTTATCCGCCCGACCATTCTGCGTGACGGTATGGCG
GCAGACGGCGTGTCGCAGCGCAAATATAACTATATGCGCGCCGAACAGATCTATCGCGATGAGCAAGGCTTGAGCCTGAT
GCCGCACACCGCGCAGCCGGTACTGCCAGCGCAAAATCAGGCCTTACCGCCAGAAGTTCGTGCGTTCCTTAATGCCGGGA
GAACGCGTTAA

Protein sequence :
MGPGVQGKVSIRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEMVTKVVPV
RNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVDHAGNRTEEVIPLDNASASEIARVLESL
TKNSGENQPATLKSQIVADERTNSVIVSGDPATRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTA
AKEEAEGTVGSGREVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGVQWASKDAG
LMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLSTLAQLLSGFSGTAVGVVKGDWMALVQAVKNDS
SSNVLSTPSITTLDNQEAFFMVGQDVPVLTGSTVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQ
TSLDVVFGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNLMVFIRPTILRDGMA
ADGVSQRKYNYMRAEQIYRDEQGLSLMPHTAQPVLPAQNQALPPEVRAFLNAGRTR