Name : DIP0764 (DIP0764) Accession : NP_939135.1 REI name : Not named REI accession : NC_002935_R2 Strain : Corynebacterium diphtheriae 241 Resistance or Virulence: Not determined Product : transposase Function : - Note : Similar to Escherichia coli transposase InsI for insertion sequence element IS30B/C/D (InsI1 or B0256) and (InsI2 or B1404) and (InsI3 or B4284) SW:INSI_ECOLI (P37246) (383 aa) fasta scores: E(): 7.2e-19, 37.94% id in 390 aa Homologs in the searched genomes : 339 hits ( 339 protein-level ) Publication :
-Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (08-APR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA. -Cerdeno-Tarraga,A.M., Efstratiou,A., Dover,L.G., Holden,M.T., Pallen,M., Bentley,S.D., Besra,G.S., Churcher,C., James,K.D., De Zoysa,A., Chillingworth,T., Cronin,A., Dowd,L., Feltwell,T., Hamlin,N., Holroyd,S., Jagels,K., Moule,S., Quail,M.A., Rabbinowits, "The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129", Nucleic Acids Res. 31 (22), 6516-6523 (2003) PUBMED 14602910. DNA sequence : GTGCAACGCCAGTTCTGGGGTCTGATCGCGACGGGAATCACCACTGCGCAGGCCGCCCTTCAGGTGGGGGTGTCCGTACC GGTGGGAACGAGGTGGTTCCGTCATGCTGGTGGAATGAGACCGTTAGCTTTGGACGAGCCCTCGGGGCGGTATCTATCGT TTGCTGAACGAGAGGAAATCGCGATCCTGTGGGAGAAACAGACGGGTGTGCGTGAGATCGCTCGACGGATTGGCCGCAAT CCGGCGACGATTTCTCGTGAGCTACGGCGTAATGCTGCCACCCGTGGTGGCAAGCAGATTTATCGGGTAGGGGTGGCCCA GTGGAAAGCTCAGGAAGCCGCTAAGCGTCCCAAACAAGCAAAACTCGTGGATAATCCGCGCCTGCGGGACTATGTTCAGG AGCGTCTTGCTGGGACGGTCCGAGATGAAAACGGGGTCGTCATGGCGGGGCCTGATACGCCTGCGTGGAAGGGCCGGAGC AAACCTCACAGGGCAGACCGACGGTGGTTGACCGCGTGGAGCCCGGAGCAGATATCGCAGCGGTTAAGGATCGACTTCCC TGATGATGAGAGTATGAGAATCTCCCATGAAGCGATCTACCAGGCGCTCTATATTGAGGGGCGCGGGGCGCTGAAGCGTG AACTCGTGGCCGCACTGCGGACTGGGCGGGCCCTGCGTAAACCCCGCGCCCGGTCCAAGAGCAAGCCTAAAGGTCATGTC ACCGCCGATGTTGTCATCTCTAACCGGCCCCCGGAAGCCTCGGATCGGGCGGTGCCCGGGCATTGGGAAGGAGACTTGAT CATCGGTACCGGCAGGTCAGCGATCGGGACCGTGGTCGAACGCTACAGCCGGTCGATCCTGCTGGTGCACCTGCCCCGGT TGGACGGCTGGGGTGAACAACCAAGGACGAAGAACGGACCTGCCTTGGGTGGGTACGGTGCTGAAGCGATGAACACCGCC CTACAAACCGTGATGAAAGATCTACCGTTACAACTGCGTCAAACGTTGACCTGGGACCGCGGAACCGAGCTCGCTGACCA TGCCGCTTTCACTCTAGCCACGGGCACGAAAGTGTTCTTCGCGGACCCACACTCGCCATGGCAACGTCCCACGAATGAGA ACAGCAACGGACTCCTACGCCAGTACTTTCCCAAGGGTACCGACCTGTCTCGATGGACAGCCCAAGACCTCGCCGCCATC GCCTACACGCTCAACAACAGACCAAGAAAAGTCCTTGGATTTCGAACCCCAGCCGAAGTCTTCAATGAACAACTACAATC AATTCAAAACAACAGTGTTGCAACCACCGATTGA Protein sequence : MQRQFWGLIATGITTAQAALQVGVSVPVGTRWFRHAGGMRPLALDEPSGRYLSFAEREEIAILWEKQTGVREIARRIGRN PATISRELRRNAATRGGKQIYRVGVAQWKAQEAAKRPKQAKLVDNPRLRDYVQERLAGTVRDENGVVMAGPDTPAWKGRS KPHRADRRWLTAWSPEQISQRLRIDFPDDESMRISHEAIYQALYIEGRGALKRELVAALRTGRALRKPRARSKSKPKGHV TADVVISNRPPEASDRAVPGHWEGDLIIGTGRSAIGTVVERYSRSILLVHLPRLDGWGEQPRTKNGPALGGYGAEAMNTA LQTVMKDLPLQLRQTLTWDRGTELADHAAFTLATGTKVFFADPHSPWQRPTNENSNGLLRQYFPKGTDLSRWTAQDLAAI AYTLNNRPRKVLGFRTPAEVFNEQLQSIQNNSVATTD |