PAI Gene Information


Name : unnamed
Accession : CAC39291.1
PAI name : LPA
PAI accession : AJ278144
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : ORF12
Homologs in the searched genomes :   585 hits    ( 585 protein-level )  
Publication :
    -Schmidt,H., Zhang,W.L., Hemmrich,U., Jelacic,S., Brunder,W., Tarr,P.I., Dobrindt,U., Hacker,J. and Karch,H., "Identification and characterization of a novel genomic island integrated at selC in locus of enterocyte effacement-negative, Shiga toxin-producing Escherichia coli", Infect. Immun. 69 (11), 6863-6873 (2001) PUBMED 11598060.

    -Zhang,W.L., "Direct Submission", Submitted (22-MAY-2000) Zhang W.L., Institut fuer Hygiene und Mikrobiologie, Universitaet Wuerzburg, Josef-Schneider-Str. 2, D-97080, GERMANY.


DNA sequence :
ATGGATTTCTTTCGTTTTTTGATGAGCGATGTACTTTCCGAACCCGCAGTGCTGGTCGGTTTAATCGCCCTGATTGGCCT
GATTGCACAGAAAAAACCGGTAACCGAATGCATTAAAGGTACCGTCAAGACCATTATGGGGTTTGTGATTTTAGGCGCAG
GGGCGGGTTTAGTTGTGTCGTCACTGGGTGACTTCGCAAACATTTTCCAGCATGCCTTTGGTATTCAGGGTGTCGTGCCG
AACAATGAAGCCATTGTTTCCGTAGCGCAGAAAAGCTTCGGGAAAGAAATGGCGATGATTATGTTCTTTGCGATGGTTAT
CAACATTATGATTGCCCGTTTCACACCGTGGAAATTTATCTTCCTGACCGGTCATCACACGTTGTTTATGTCGATGATGG
TAGCGGTAATTCTGGCAACAGCAGGCATGACCGGCATTACGCTTATAGCCGTTGGCTCTTTGGTTGTTGGCGTGGCAATG
GTCTTTTTCCCGGCTATTGCGCATCCGTACATGAAGAAAGTGACCGGTTCTGATGACGTTGCCATCGGCCACTTCTCAAC
ATTATCTTATGTATTGGCCGGTTTCATCGGCAGCAAGTTTGGTAATAAAGAGCACTCGACTGAAGACATGAATGTGCCGA
AAAGTCTGCTGTTCTTGCGCGACACGCCGGTTGCTATCTCTTTTACCATGAGCATAATTTTCCTGGTGACGTGCCTGTTT
GCGGGTGCAGATGCGGTGAAAGAGCTTAGCGGTGGTAAAAACTGGTTCATGTTCTCTATCATGCAATCCATTACCTTCGC
AGCTGGCGTGTACATCATTCTGCAGGGTGTGCGCATGGTGATTGCGGAGATTGTCCCGGCATTCAAAGGTATTTCAGACA
AGCTGGTTCCGAATGCCAGACCTGCTCTCGATTGCCCGGTCGTCTTCCCTTACGCACCTAACGCTGTACTGGTTGGTTTC
CTGAGTAGCTTTGCTGCAGGTTTAATCGGTATGTTCACGTTGTATCTGCTGAACATGATCGTGATTATTCCTGGTGTGGT
GCCGCACTTCTTCGTAGGTGCAGCTGCAGGGGTATTCGGTAATGCAACGGGTGGACGTCGTGGTGCGATTTTGGGTGCCT
TCGCTCAGGGCTTGCTTATTACCTTCCTGCCAGTATTCTTATTGCCTGTGCTGGGTGATATTGGTTTTGCTAACACCACC
TTCAGTGATGCTGACTTTGGCGCACTGGGTATTTTGTTAGGGATTATCGTTCGCTAA

Protein sequence :
MDFFRFLMSDVLSEPAVLVGLIALIGLIAQKKPVTECIKGTVKTIMGFVILGAGAGLVVSSLGDFANIFQHAFGIQGVVP
NNEAIVSVAQKSFGKEMAMIMFFAMVINIMIARFTPWKFIFLTGHHTLFMSMMVAVILATAGMTGITLIAVGSLVVGVAM
VFFPAIAHPYMKKVTGSDDVAIGHFSTLSYVLAGFIGSKFGNKEHSTEDMNVPKSLLFLRDTPVAISFTMSIIFLVTCLF
AGADAVKELSGGKNWFMFSIMQSITFAAGVYIILQGVRMVIAEIVPAFKGISDKLVPNARPALDCPVVFPYAPNAVLVGF
LSSFAAGLIGMFTLYLLNMIVIIPGVVPHFFVGAAAGVFGNATGGRRGAILGAFAQGLLITFLPVFLLPVLGDIGFANTT
FSDADFGALGILLGIIVR