PAI Gene Information


Name : APECO1_1715 (APECO1_1715)
Accession : YP_851469.1
PAI name : PAI III APEC-O1
PAI accession : NC_008563_P2
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : integrase
Function : -
Note : -
Homologs in the searched genomes :   9 hits    ( 9 protein-level )  
Publication :
    -Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (14-SEP-2006) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes", J. Bacteriol. 189 (8), 3228-3236 (2007) PUBMED 17293413 REMARK Erratum:[J Bacteriol. 2007 Jun;189(12):4554].

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "Direct Submission", Submitted (08-NOV-2006) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGCATTACAATCAATTTTCACATTACAAACGCTCCGACACTTTAGCCGGGGAGTTTATGAGATACAAGCCCAATCAATA
CATAATCTGTGATTCTTACGGTAACTACTATTTAAGGATCACGCTGCCTGTGTATATGCAGCCCTTTTTTGAAGGAAAAA
GGACGTTTGTCAGGAGTCTGCATACAAGTAACCTTCGTGTTGCACGTAGAAAGCGTGATCAGATTGCAGATGAATACCAT
TGCTTACGGGAGAGTGTTGCCCCTGTAAACAGCACAATAGAAAACACGCTGGAACTGTTACGCAGTAAGGCTAAATACGC
CAAAACAGCTACCAGAATGCAAGATACAGCGTCTTCGTGTCCGTCATTGCTTAAAATTCTTGAAATCTACCTGACAATTA
ACAGCACGAAGAAGAAGCCAGCCACTTTAGCTAAGGCAAGAAAAGCGGTAGAGATGTTTCTCTCCTACCGTAAAAAGCCT
GATATTGCATTGCAAGATGTGAGCCGCACCACTGTTACAGGCTGGATTGAACACATGCAAAAAACCCTTTCACAACAATC
AATTGCAAATTATATCAGCCCAATGGCCCAGCTATGGGAGTTAGCTTCATCACGTTACCACGATGCGCCAGAAAGGGCGC
TCTCCCCCTGGCGAGGGCATAGGCTTGATGTGGCACAAAGTAGAGAGAGCTACGAGGCATTTTCTAACAAAGAGCTATTG
CAGGTGTTGCAAGTATTTTCCGGTAATTCAGCAGAAAACAAAGAAATGATAGCTTTGTGTCTTATCGGTGCTTATACAGG
TATGCGGATCAATGAGATAGCAAGTCTCACAATAGACGATGTGAAAGAGATCGAAGGTGTGCTGTGTTTTGAAATCACAC
AGGGAAAAACGAAAGCTGCGGCACGTGTTGTGCCTGTGCATAGCCTTATCACTCCGTTGGTGTTGTCGCTGCGTGAAAAG
CCTCATAATGGCTTTTTGTTCTATCACGCCAGCATCACAGAACGTGCAGACGGCAAGCGTTCCACGTGGCACACGCAGAG
ATTTACAAGAGCTAAACGAAAGGCTTTAGGGGAAAAGGGAACAGAAAGGAAGGTGTTTCATTCTCTGAGACACGGAGTAG
CACAGCTTCTTGATCGAAATCAAATTCCAGAAGACAGGATCGCCCTTCTCCTGGGCCATACACGCGGCAATACAGAGACA
TTCCGCACATATAGCAAGAATGCAGCTTCTCCAGTAGAGCTTAAAAACTATATTGAGCTTCTACGCTACCCTGAAATAGA
GAAAGGCTTATCAATCAATAAAAAATCAAATTTAAGGCGTAAAACAACGCCATAG

Protein sequence :
MHYNQFSHYKRSDTLAGEFMRYKPNQYIICDSYGNYYLRITLPVYMQPFFEGKRTFVRSLHTSNLRVARRKRDQIADEYH
CLRESVAPVNSTIENTLELLRSKAKYAKTATRMQDTASSCPSLLKILEIYLTINSTKKKPATLAKARKAVEMFLSYRKKP
DIALQDVSRTTVTGWIEHMQKTLSQQSIANYISPMAQLWELASSRYHDAPERALSPWRGHRLDVAQSRESYEAFSNKELL
QVLQVFSGNSAENKEMIALCLIGAYTGMRINEIASLTIDDVKEIEGVLCFEITQGKTKAAARVVPVHSLITPLVLSLREK
PHNGFLFYHASITERADGKRSTWHTQRFTRAKRKALGEKGTERKVFHSLRHGVAQLLDRNQIPEDRIALLLGHTRGNTET
FRTYSKNAASPVELKNYIELLRYPEIEKGLSINKKSNLRRKTTP