Name : tnp Accession : AEA34687.1 REI name : Not named REI accession : HQ018801 Strain : Escherichia coli 042 Resistance or Virulence: Not determined Product : IS66 family transposase Function : - Note : similar to ZP_02814205.1 IS66 family element, transposase in Escherichia coli O157:H7 str. EC869 and YP_001458801.1 IS66 family transposase in Escherichia coli HS Homologs in the searched genomes : 360 hits ( 359 protein-level, 1 DNA-level ) Publication :
-Ziebell,K., Johnson,R.P., Kropinski,A.M., Reid-Smith,R., Ahmed,R., Gannon,V.P., Gilmour,M. and Boerlin,P., "Gene Cluster Conferring Streptomycin, Sulfonamide, and Tetracycline Resistance in Escherichia coli O157:H7 Phage Types 23, 45, and 67", Appl. Environ. Microbiol. 77 (5), 1900-1903 (2011) PUBMED 21239555. DNA sequence : ATGAGTCAGAAATACCTCATTCGCATCGCAGAGCTGGAAAGGTTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGA CCAGCAACTGAGTCTGGTTGAAGAGACGGAAGCCTTCCTGCGCTCTGCACTGACACGTGCCGAAGAAAAGATCGAAGAAG ATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAACTG CGTCGTGAAGTTGAACTGGCTGAGGCCCTGCTGAAACAACGTGAACAGGACAGCGATCGTTACAGTGGGCGGGAAGACGA TCCTCAGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACACCTTCCCCGTGAAATACACCGCC TGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCTGAACAGCTGGAA CTGGTGAGCAGTGCCCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGTATTGTTGAAGC ACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCCGGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCG AACATCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGCATCGAACTGAGCCGGGCATTACTCTCCAACTGG GTTGACGCATGTTGTCAGTTAATGACTCCGCTGAATGATACCCTGTACCGTTACGTGATGAACACCCGCAAGGTTCACAC TGACGACACACCAGTAAAAGTGCTGGCACCGGGCAGAAAAAAGGCAAAAACAGGACGCATCTGGACGTATGTCCGGGATG ACCGGAATGCGGGCTCATCAGAGCCACCGGCGGTCTGGTTCGCCTACTCACCAGACAGGCAGGGAAAACATCCGGTACAA CACCTTCGTCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCAGCGGTTACGATCGGCTGTTCAGTGCCGAACGTGAAGG TGGTGCACTGACAGAAGTTGCGTGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGTAAAGATCCAC GGGCTGCCATAGCCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATC CGCCAGTGGCGACAGCGTTATGCCCGTCCGATACTGGAAGATCTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTC TCCGGGAAGTAAGCGTACAGCCTGA Protein sequence : MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKL RREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLE LVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNW VDACCQLMTPLNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSSEPPAVWFAYSPDRQGKHPVQ HLRPFRGILQADAFSGYDRLFSAEREGGALTEVACWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKI RQWRQRYARPILEDLWSWLEEQEPQCSPGSKRTA |