PAI Gene Information


Name : unnamed
Accession : CAE85238.1
PAI name : PAI V 536
PAI accession : AJ617685
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : ORF88
Homologs in the searched genomes :   63 hits    ( 63 protein-level )  
Publication :
    -Dobrindt,U., "Direct Submission", Submitted (11-NOV-2003) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Universitaet Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY.

    -Schneider,G., Dobrindt,U., Bruggemann,H., Nagy,G., Janke,B., Blum-Oehler,G., Buchrieser,C., Gottschalk,G., Emody,L. and Hacker,J., "The pathogenicity island-associated K15 capsule determinant exhibits a novel genetic structure and correlates with virulence in uropathogenic Escherichia coli strain 536", Infect. Immun. 72 (10), 5993-6001 (2004) PUBMED 15385503.


DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACGCCGGAACCGACACCTGATCCTGAGCCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGATAACTGGTGCTACTTG
TAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCGACAAAGTCACCTGTGTGGCAGGGAACAACACGACAATTG
CTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCGCAA
GAACTGGCAGCTTCCGATGACAAGAAAAGCAATGCGGTTTCGCTGGTAACGTCCAGTAACAGCTGTCCGGCGGATACAGA
ACAGGTTTGCCTGACGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATCTGGCACCGG
AAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACCGATAAAGCGCCATCCACTCATACCTCACCG
GTCGTGCCTGTCACCACGCCGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCA
GTATCAACCCACTGAAATCATTCTTTCCGAAGGCCGACTGGTGGATAGCATGGGCAATGGTGTGGTTGGCGTAAATTACT
ACACCAGCTCAGGCCGTGGCGTAACTGGCGAAAACGGCAAATTCAACTTCAGCTGGGGCGAAACCATCTCCTTTGGTATC
GACACCTTTGAACTGGGCTCAGTGCGCGGCAATAAGTCGACCATTGCGTTGACTGAACTGGGTGACGAAGTTCGCGGCGC
GAATATTGATCAGCTTATTCATCGTTACTCCCAGGCCGGAAAAAATGATGAGCGTGAAGTGCCGGACGTAGTGCGCAAGG
TCTTTGCCGAGTATCCCAACGTAATCAACGAGATTATCAATCTCTCGTTATCCAATGGCGAGGCGTTGAGCGAAGGCGAT
CAAACCTTTGAGCGGACAAACGAATTTCTTGAGCAGTTTGAATCCGGGCAGGCTAAAGAGATTGATACGGCGATTTGTGA
CTCCCTTGGGGGCTGCAACTCTCAGCGTTGGTTCTCGTTGACAGCACGCAATGTTAATGACGGCCAGATTCAGGGCGTTA
TTAACAAGCTGTGGGGGGTGGATACGAACTACAAATCTGTCAGCAAGTTCCATGTATTCCATGACTCTACCAACTTCTAT
GGCAGCACCGGTAATGCGCGCGGTCAGGCAGTGGTGAATATCTCCAACGCGGCATTCCCGATTCTGATGGCGCGTAATGA
TAAAAACTACTGGCTGGCCTTCGGCGAAAAACGCGCCTGGGATAAAAACGAGCTGGCGTACATTACGGAAGCGCCTTCTC
TCGTTGAGCCGGAAAACGTTACGCGCGATACCGCCACCTTTAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAGGGC
AAACTGATGGTTATCGGTAACCCGCACTACAACAGCATTTTGCGTTGTCCGAACGGTTACAGCTGGGAAGGCGGTGTTGA
TAAAAACGGTCAGTGTACGCGTAACAGTGATTCTAATGATATGAAGCACTTTATGCAGAACGTGTTGCGCTATCTGTCCG
ACGATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAGGTACCAACCTGGATACTGTCTATTTCAAACGTCATGGT
CAGGTTACAGGAAACAGCGCTGAGTTCGGCTTTCATCCGGATTTTGCGGGTATCTCTGTTGAGCATTTAAGTAGCTATGG
CGATCTCGACCCGCAGGAAATGCCGCTGCTGATCCTTAACGGCTTTGAATATGTGACTCAGGTTGGTAACGATCCTTATG
CAATCCCGCTGCGAGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCCTATCTGAACAAA
GGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGCTTTGTACGTCTGCTTGA
TGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAATAACGATCCGCAAGGGTATCCGAACCGCGTTCGTCAGC
AGCGCGCAACGGGCATTTGGGTCTATGAACGTTATCCTGCCGTAGATGGTGCGCTGCCGTACACCATCGATAGTAAGACA
GGGGAAGTTAAGTGGAAATATCAGGTAGAAAACAAACCTGATGACAAACCGAAGCTGGAAGTTGCCAGCTGGCTGGAAGA
TGTAGATGGCAAACAGGAAACGCGTTATGCCTTTATTGATGAGGCCGATCATAAAACAGAGGATTCTCTGAAGGCTGCGA
AGGCAAAAATCTTTGAGAAGTTTCCTGGATTAAAGGAGTGTAAGGACCCAACTTACCACTACGAGGTCAACTGTCTGGAA
TATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAACTAAGCCTTAACGCCGACAC
GGCAAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGACCA
ATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAATAAA
ATTGAATATCGTTATGAAAACGACAAGGATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGCCAA
CGATGCCTATACTGGCGGCACGCAGTGTTCTGATGAGCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGCGAGA
AGAGCGTTAATAAAGCGGGCATGATGAACCCGAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGCCTGATGCTG
GGTCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCGGTATCGGCAGAAGGTGAGAAGGT
TACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCTACTGGCCTGTGGGCTCCGG
CTCAGAAAGAGGTCACCATTGAGTCTACTGCATCAGTTGCTGTGACTGTCACCGTGGCGCTGGCCGACGACCTTACCGGA
CGTGAGAAGCATGAAGTCGCTCTGAACCGTCCGCCAAAAGTGACGAAAACCTATGAGCTGAAAGCCAATGGTGAGGTGAA
GTTTACGGTTCCTTACGGTGGTCTGATTTATATCAAGGGCAACAGCCCACAGAATGAGTCAGCCGAATTCACCTTTACTG
GTGTGGTGAAAGCGCCGTTCTATAAAGATGGCGCATGGAAAAACGCTCTGAACTCCCCTGCGCCGTTGGGCGAGCTGGAG
TCAGACGCTTTCGTCTACACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTACAAGGGCGGTCAGGAACAATTCGCTGA
GGAACTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCT
ATAAAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCTGGTTATCCG
GTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCACGAAGC
TGGTCACAACGCCGCCGAAACGCCGTTGACTGTACCGGGCGCGACCGAAGTAGCGAACAACGTGCTGGCGCTGTACATGC
AGGATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACCGGAATATCTGGAGGAGAGCAACAAC
CAGGCATGGGCACGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCGCAGCTGAAGGAATGGGCAGAGAAAAACTTTGA
TATCACGAAGTGGTATCCAGAAGGTAACCTGCCTAAGTTCTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGTTCC
AGTTGATGCACCGTAAAGCGCGCGGCGATGAGGTTGGCAAAACCAAGTTTGGAGAAAGAAATTACTGTGCCGAATCCAAC
GGTAACGCTGCCGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGCGTTCTTTAAGAAATGGAA
TCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGAGCGAGATGAACTTCGAGGGCGGTGTGAGCCAGTCGGCTTACGAGA
CGCTGGCGGCGCTTAATCTGCCGAAACCGCAGCAAGGGCCGGAAACCATTAATAAAGTTACCGAGTATTCGATGCCTGCT
GAATAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTP
EPEPEPVPTKTGYLTLGGSQRITGATCNGESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDAQ
ELAASDDKKSNAVSLVTSSNSCPADTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSP
VVPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSMGNGVVGVNYYTSSGRGVTGENGKFNFSWGETISFGI
DTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSQAGKNDEREVPDVVRKVFAEYPNVINEIINLSLSNGEALSEGD
QTFERTNEFLEQFESGQAKEIDTAICDSLGGCNSQRWFSLTARNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY
GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTATFNLPFISLGQVGEG
KLMVIGNPHYNSILRCPNGYSWEGGVDKNGQCTRNSDSNDMKHFMQNVLRYLSDDKWTPDAKASMTVGTNLDTVYFKRHG
QVTGNSAEFGFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNK
GGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSKT
GEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKAKIFEKFPGLKECKDPTYHYEVNCLE
YRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNK
IEYRYENDKDDELGFKTFTEFLNCYANDAYTGGTQCSDELKKSLVDNNMIYGEKSVNKAGMMNPSYPLNYMEKPLTRLML
GRSWWDLNIKVDVEKYPGAVSAEGEKVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIESTASVAVTVTVALADDLTG
REKHEVALNRPPKVTKTYELKANGEVKFTVPYGGLIYIKGNSPQNESAEFTFTGVVKAPFYKDGAWKNALNSPAPLGELE
SDAFVYTTPKKNLEASNYKGGQEQFAEELDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYP
VMNSSFSTNSTTLPTTPLNDWLIWHEAGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNN
QAWARGGAGDRLLMYAQLKEWAEKNFDITKWYPEGNLPKFYSEREGMKGWNLFQLMHRKARGDEVGKTKFGERNYCAESN
GNAADTLMLCASWVAQTDLSAFFKKWNPGANAYQLPGASEMNFEGGVSQSAYETLAALNLPKPQQGPETINKVTEYSMPA
E