PAI Gene Information


Name : sfaF
Accession : CAC16953.2
PAI name : PAI III 536
PAI accession : X16664
Strain : Escherichia coli 042
Virulence or Resistance: Virulence
Product : SfaF protein
Function : -
Note : -
Homologs in the searched genomes :   404 hits    ( 404 protein-level )  
Publication :
    -Dobrindt,U., "Direct Submission", Submitted (09-NOV-2000) Dobrindt U., Institut fuer Molekulare Infektionsbiologie, Univ. Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY REMARK revised by [4].

    -Dobrindt,U., "Direct Submission", Submitted (02-JUL-2001) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Univ. Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY REMARK revised by [6].

    -Dobrindt,U., "Direct Submission", Submitted (11-FEB-2003) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Univ. Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY.

    -Dobrindt,U., Blum-Oehler,G., Hartsch,T., Gottschalk,G., Ron,E.Z., Funfstuck,R. and Hacker,J., "S-Fimbria-encoding determinant sfa(I) is located on pathogenicity island III(536) of uropathogenic Escherichia coli strain 536", Infect. Immun. 69 (7), 4248-4256 (2001) PUBMED 11401961.

    -Dobrindt,U., Blum-Oehler,G., Nagy,G., Schneider,G., Johann,A., Gottschalk,G. and Hacker,J., "Genetic structure and distribution of four pathogenicity islands (PAI I(536) to PAI IV(536)) of uropathogenic Escherichia coli strain 536", Infect. Immun. 70 (11), 6365-6372 (2002) PUBMED 12379716.

    -Hacker,J., "Direct Submission", Submitted (02-OCT-1989) Hacker J., Inst. f. Genetik und Mikrobiologie, Univ. Wuerzburg, Roentgenring 11, 8700 Wuerzburg, GERMANY REMARK revised by [3].


DNA sequence :
ATGTTTTCCGGAGATGGCGGACAACTCCTGTCTGATAAATCACTGACCGGCTCTGCTGGCGGGGGTAATAACAGAATGAG
ATTTAATATACTGCCGCTGGCATTTTTTATCGGGATAATTGTTTCTCCGGCCCGGGCTGAGCTCTATTTTAATCCGCGTT
TTCTGTCTGATGACCCGGATGCGGTTGCGGACCTGTCTGCATTTACACAGGGGCAGGAGCTTCCGCCCGGCGTGTACCGG
GTGGATATTTATCTGAATGATACGTATATCTCAACCCGTGACGTGCAGTTTCAGATGAGTCAGGACGGGAAGCAACTTGC
CCCCTGCCTGAGTCCGGAGCATATGAGTGCGATGGGGGTAAACCGTTATGCCGTGCCGGGTATGGAGAGACTGCCGGCTG
ACACCTGCACATCACTGAATTCCATGATTCAGGGTGCCACATTCCGGTTTGATGTCGGGCAGCAGCGATTGTACCTGACC
GTTCCTCAGATATATATGAGTAATCAGGCCCGTGGTTATATTGCGCCGGAATACTGGGATAACGGAATTACAGCGGCATT
GCTGAATTATGACTTCAGTGGTAATCGGGTGCGCGACAGCTATGGTGGCACCAGCGATTATGCCTATCTGAATCTGAAAA
CAGGACTGAATATCGGGAGCTGGCGTCTTCGGGACAACACCAGCTGGAGTTACAGTGCCGGAAAGGGGTACAGCCAGAAT
AACTGGCAGCACATTAACACCTGGCTGGAGCGGGATATAGTTCCCCTGCGCTCCCGTCTGACGATGGGGGACAGTTATAC
CCGGGGGGATATTTTTGACGGTGTAAATTTCAGGGGTATTCAACTGGCCTCTGATGACAATATGGTTCCGGACAGCCAGA
GAGGGTATGCACCGACAATTCACGGTATTTCCCGGGGAACATCCCGGATTAGTATCCGCCAGAACGGGTATGAAATTTAC
CAGAGTACGCTACCTCCAGGCCCGTTCGAGATAAACGATATTTATCCGGCCGGGAGTGGAGGTGACTTACAGGTAACCCT
GCAGGAAGCTGATGGCAGTGTACAGCGATTTAATGTGCCATGGTCTTCGGTGCCTGTACTGCAACGTGAAGGGCATCTGA
AATATGCTCTCAGTGCCGGAGAGTTTCGCAGTGGCGGACATCAGCAGGATAATCCCCGTTTTGCTGAAGGAACGCTGAAG
TATGGTCTGCCTGCAGGCTGGACGGTGTATGGAGGAGCCTGGATAGCTGAACGCTACCGTGCCTTCAATCTGGGGGTGGG
TAAAAACATGGGCTGGCTGGGTGCTGTTTCACTGGATGCCACCCGTGCGAATGCCCGGTTACCGGATGAAAGCCGGTATG
ACGGTCAGTCATATCGTTTTCTGTATAACAAGTCACTGACGGAAACAGGGACAAATATCCAGTTAATCGGATACCGCTAC
TCCACACGGGGCTATTTCAGCTTTGCTGATACCGCCTGGAAGAAAATGAGTGGTTACAGTGTTCTTACCCAGGACGGAGT
GATACAGATACAGCCGAAGTATACGGATTACTACAATCTGGCTTACAACAAGCGGGGAAGGGTGCAGGTGAGTATCAGCC
AGCAGACGGGGGAATCGTCAACGTTGTATCTGAGTGGCAGCCACCAGAGTTACTGGGGAACGGACAGGACGGACCGTCAG
CTTAATGCTGGCTTTAACTCATCTGTGAATGACATCAGCTGGTCCCTGAACTACAGCCTGTCCCGGAATGCATGGCAACA
TGAAACTGACCGGATACTGTCATTTGATGTCAGCATTCCGTTCAGCCACTGGATGCGTTCGGACAGCACATCTGCATGGA
GAAACGCCAGTGCCCGTTACAGTCAGACCCTGGAAGCTCACGGGCAGGCTGCCAGTACAGCAGGGCTGTATGGCACACTG
CTGGAGGACAATAATCTGGGATACAGCATCCAGAGTGGTTATACCCGGGGAGGGTATGAGGGGAGCAGTAAAACAGGATA
TGCCTCTCTGAATTACAGAGGGGGATACGGTAATGCCAGTGCAGGATACAGTCACAGTGGGGGTTACCGGCAACTGTATT
ATGGGCTGAGTGGCGGTATTCTGGCACATGCCAATGGTCTGACACTGAGCCAGCCTCTGGGAGATACGCTGATTCTTGTC
CGTGCACCAGGTGCATCTGATACGCGCATTGAAAACCAGACGGGGGTGTCTACAGACTGGAGGGGGTATGCTGTACTGCC
TTATGCAACAGACTATCGTGAAAACCGGGTGGCACTGGATACCAACACCCTTGCAGATAATGTTGATATAGAGAATACCG
TGGTCAGCGTCGTGCCCACGCATGGTGCCGTTGTGCGGGCTGACTATAAAACCCGTGTGGGTGTGAAAGTGCTGATGACG
CTGATGAGAAACGGGAAAGCAGTTCCTTTTGGCTCTGTCGTTACGGCCAGAAACGGGGGGAGCAGTATCGCCGGAGAGAA
TGGTCAGGTTTATCTGAGCGGAATGCCTCTGTCAGGACAGGTTAGTGTGAAATGGGGGAGTCAGACCACGGACCAGTGTA
CTGCAGATTATAAATTACCGAAGGAAAGCGCCGGACAGATATTAAGTCATGTCACGGTTAGTTGCAGGTAA

Protein sequence :
MFSGDGGQLLSDKSLTGSAGGGNNRMRFNILPLAFFIGIIVSPARAELYFNPRFLSDDPDAVADLSAFTQGQELPPGVYR
VDIYLNDTYISTRDVQFQMSQDGKQLAPCLSPEHMSAMGVNRYAVPGMERLPADTCTSLNSMIQGATFRFDVGQQRLYLT
VPQIYMSNQARGYIAPEYWDNGITAALLNYDFSGNRVRDSYGGTSDYAYLNLKTGLNIGSWRLRDNTSWSYSAGKGYSQN
NWQHINTWLERDIVPLRSRLTMGDSYTRGDIFDGVNFRGIQLASDDNMVPDSQRGYAPTIHGISRGTSRISIRQNGYEIY
QSTLPPGPFEINDIYPAGSGGDLQVTLQEADGSVQRFNVPWSSVPVLQREGHLKYALSAGEFRSGGHQQDNPRFAEGTLK
YGLPAGWTVYGGAWIAERYRAFNLGVGKNMGWLGAVSLDATRANARLPDESRYDGQSYRFLYNKSLTETGTNIQLIGYRY
STRGYFSFADTAWKKMSGYSVLTQDGVIQIQPKYTDYYNLAYNKRGRVQVSISQQTGESSTLYLSGSHQSYWGTDRTDRQ
LNAGFNSSVNDISWSLNYSLSRNAWQHETDRILSFDVSIPFSHWMRSDSTSAWRNASARYSQTLEAHGQAASTAGLYGTL
LEDNNLGYSIQSGYTRGGYEGSSKTGYASLNYRGGYGNASAGYSHSGGYRQLYYGLSGGILAHANGLTLSQPLGDTLILV
RAPGASDTRIENQTGVSTDWRGYAVLPYATDYRENRVALDTNTLADNVDIENTVVSVVPTHGAVVRADYKTRVGVKVLMT
LMRNGKAVPFGSVVTARNGGSSIAGENGQVYLSGMPLSGQVSVKWGSQTTDQCTADYKLPKESAGQILSHVTVSCR