PAI Gene Information


Name : c5144 (c5144)
Accession : NP_756992.1
PAI name : PAI II CFT073
PAI accession : NC_004431_P2
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : Residues 7 to 492 of 499 are 40.77 pct identical to residues 10 to 495 of 504 from SwissProt.40 : >sp|P55501|Y4JA_RHISN HYPOTHETICAL 57.2 KD PROTEIN Y4JA/Y4NE/Y4SE
Homologs in the searched genomes :   196 hits    ( 194 protein-level,   2 DNA-level )  
Publication :
    -Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli", Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024 (2002) PUBMED 12471157.

    -Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (10-SEP-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Welch,R.A., Burland,V., Plunkett,G.D. III, Redford,P., Roesch,P., Rasko,D.A., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (20-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA.


DNA sequence :
GTGACGCTCAATACTTCTCAGGTCAGTTACTATATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAA
GGCTGGTATCTCAGTCCGTTCTGGTCGTCGGATCGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACAC
GCAAAGATCCTCTGGAAGCTGTGTGGGACAGCATGCTTGTTCCTCTGTTGAAAGAGAGGCCGGCTCTGACACCAACAACT
CTGCTGGAGATGCTACAGGATAAATATCCCGGCCAGTACCCCAACAGCCTTCGAAGAACAATGCAACGGCGGGTCCGCGA
ATGGAAGCTACAGTATGGTGCAGAGCAGGAGGTCATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACT
TTACTGAACTGAAAGGTGTAGTTGTCACCATCGCCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGG
AGCCACTGGAGCTGGATGCGGGTTGTGCTGGGTGGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGG
ACAACTGGGCGGAGTGCCGGTAGAACATAAAACGGACAGCCTGAGGGCAGCATGGAAACAACAGGGCGAAGATGGACGCC
GCGAGCTGACTGAGCGTTATGCTGCTCTCTGTCAGCACTACGGAATGCAGGGCGTACACAATAATGCCGGTCGGGGCCAC
GAAAATGGCTCGGTTGAAAGTGCCCACGGACATCTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGA
CTTCAGCACCATAGAAGAATATCAGGCCTTCATCACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCA
AGGAAGAACGTCTTCATCTGAAACCGCTGCCGCTTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGC
AGCAGTACCATCAATGTGAAGCACGTCGTCTACAGCGTACCTTCCCGGCTTGTAGGTCAACTGTTACGGGTCCGGTTATG
GGACGATCGTCTGAGCTGTTACGTTGGCAGCAGCGAGGTCATGAGCTGCCCACGTGTCAGACCAGAAAAAGGGAAGACGC
GGGCCCGTCGTATCGACTTCCGACATGTGATCGACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGA
AATGACATCCTGCCAGACGATGAATGGCGGAGGCTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAG
GCTGATGGTACATGCTCTGAAACTGGCTGCAGGATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGA
ATACCCCGGGAAACGTGGATCTGCACCGGCTGATGCGCTTCCTGGGTATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTG
AAACAGCATAACCTGAGCAGTTATGAGCAACTACTGCGTGGCAAGGGAGGTTCGCAGTGA

Protein sequence :
MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRHWRTRKDPLEAVWDSMLVPLLKERPALTPTT
LLEMLQDKYPGQYPNSLRRTMQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEW
SHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDSLRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGH
ENGSVESAHGHLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHLKPLPLRRSADYDELTVRVSR
SSTINVKHVVYSVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNV
KQHNLSSYEQLLRGKGGSQ