PAI Gene Information


Name : APECO1_1067 (APECO1_1067)
Accession : YP_853084.1
PAI name : PAI IV APEC-O1
PAI accession : NC_008563_P3
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : shikimate transporter
Function : -
Note : -
Homologs in the searched genomes :   385 hits    ( 385 protein-level )  
Publication :
    -Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (14-SEP-2006) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes", J. Bacteriol. 189 (8), 3228-3236 (2007) PUBMED 17293413 REMARK Erratum:[J Bacteriol. 2007 Jun;189(12):4554].

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "Direct Submission", Submitted (08-NOV-2006) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGGACTCCACGCTCATCTCCACTCGTCCCGATGAAGGGACGCTTTCGTTAAGTCGCGCCCGACGAGCTGCGTTAGGCAG
TTTCGCTGGTGCCGTCGTCGACTGGTATGATTTTTTACTCTATGGCATTACCGCCGCACTGGTGTTTAATCGCGAGTTTT
TCCCGCAGGTAAGCCCGGCGATGGGAACGCTCGCCGCATTTGCCACCTTTGGTGTCGGCTTCCTTTTCCGACCGCTCGGC
GGTGTCATTTTCGGTCACTTTGGTGACCGGCTGGGACGTAAGCGCATGTTAATGCTGACCGTCTGGATGATGGGCATCGC
GACAGCCTTGATTGGTATTCTTCCTTCATTCTCGACCATTGGGTGGTGGGCACCTATTTTGCTGGTGACACTGCGTGCCA
TTCAGGGATTTGCCGTCGGCGGCGAATGGGGAGGCGCAGCGTTGCTTTCCGTTGAAAGTGCGCCGAAAAATAAAAAAGCC
TTTTACAGTAGCGGTGTACAAGTTGGCTACGGAGTAGGTTTACTGCTTTCAACCGGACTGGTTTCATTGATCAGTATGAT
GACGACTGACGAACAGTTTTTAAGCTGGGGCTGGCGCATTCCTTTCCTGTTTAGCATCGTACTGGTACTGGGAGCATTGT
GGGTGCGCAATGGCATGGAGGAGTCCGCGGAATTTGAACAACAGCAACATAATCAAGCGGCCGCGAAAAAACGCATCCCG
GTTATCGAAGCTCTGTTACGACATCCCGGTGCTTTCCTGAAGATTATTGCACTACGACTGTGCGAGTTGCTGACGATGTA
CATTGTTACTGCCTTTGCACTTAATTATTCAACCCAGAATATGGGGTTACCGCGCGAACTTTTCCTTAATATTGGTTTGC
TGGTAGGTGGATTAAGCTGCCTGACAATTCCCTGTTTTGCCTGGCTTGCCGATCGTTTTGGTCGGCGCAGGGTTTATATC
ACAGGCGCGTTGATCGGAACGTTGAGCGCATTTCCTTTCTTTATGGCGCTTGAAGCACAATCTATTTTCTGGATAGTTTT
CTTCTCCATAATGCTGGCAAACATTGCGCATGACATGGTGGTGTGTGTGCAACAACCGATGTTTACCGAAATGTTTGGTG
CCAGTTATCGCTACAGTGGTGCTGGAGTCGGTTATCAGGTTGCCAGTGTGGTTGGCGGTGGATTTACACCTTTTATTGCC
GCTGCACTCATCACTTACTTTGCCGGGAACTGGCATAGCGTCGCCATTTATTTGCTGGCTGGATGCCTGATTTCTGCAAT
GACCGCTTTATTGATGAAAGACAATCAACGCTCGTGA

Protein sequence :
MDSTLISTRPDEGTLSLSRARRAALGSFAGAVVDWYDFLLYGITAALVFNREFFPQVSPAMGTLAAFATFGVGFLFRPLG
GVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFAVGGEWGGAALLSVESAPKNKKA
FYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWGWRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIP
VIEALLRHPGAFLKIIALRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRRVYI
TGALIGTLSAFPFFMALEAQSIFWIVFFSIMLANIAHDMVVCVQQPMFTEMFGASYRYSGAGVGYQVASVVGGGFTPFIA
AALITYFAGNWHSVAIYLLAGCLISAMTALLMKDNQRS