Name : c3658 (c3658) Accession : NP_755533.1 Strain : Escherichia coli CFT073 Genome accession: NC_004431 Putative virulence/resistance : Unknown Product : hypothetical protein Function : - COG functional category : L : Replication, recombination and repair COG ID : COG3436 EC number : - Position : 3495830 - 3497431 bp Length : 1602 bp Strand : + Note : Residues 13 to 298 of 533 are 71.32 pct identical to residues 1 to 277 of 285 from GenPept.129 : >gb|AAG55716.1|AE005309_6 (AE005309) unknown in ISEc8 [Escherichia coli O157:H7 EDL933] DNA sequence : ATGAATATCCGTATCTGGAGTGGTATACTCCCCTGTATGGATATCTCCGCTCTCAACACCACGAATGACATCGAAAAACT GCGTGCTATGGCACTTGCCATGGTACAAGAAGTCATGTCGGAGAATGCCGAAAAAGAGCGGGAATTACTGGAGAAAAGCC GGCGCATCCAGCTTCTGGAAGAAATGCTGAAACTGGTTCGTCAACAGCGCTTCGGAAAAAAATGTGGAACGCTGGCTGGT ATGCAACGCTCCCTGTTCGAAGAGGATGTTGATGCCGATATCGCCGCGCTTACCGCACATCTGGATAAACTGCTCCCGCA ATCCCCTGAAGAAGACGAAAAAGCGTCCCGTTCACGCCCGATACGCAAACCCTTACCGGTTCATCTTCCACGGGTGGAAA AAATTATCCAGCCGGACACTGACCATTGCCCTGAATGTGACGAGCCGCTGCACTATATCCGCGATGCGGTGAGTGAAAAG CTGGAGTATATTCCCGCTCACTTTGTGGTGAACCGTTATGTCCGTCCACAATACAGTTGTCCCTGTTGCCAGAAGGTGTT CAGCGGTGAAATGCCGGCACATATCCTCCCGAAAAGTGCCGTTGAGCCATCAGTCATCGCACAGGTGATCATCAATAAAT ACGGTGACCACCTGCCTCTGTATCGCCAGCAACAGGTCTTTGCCCGTTCAGATGTCGGGCTGCCCGTCAGTTCGATGGCT GACATGGTTGGCGCGGCGGGTGCCGCATTATCTCCCCTGGCGGCGTTACTCCATCGCGAGTTGATAAACCGTCCGGTGGT GCATGCAGATGAGACTACCCTGAAGATCCTGAACACGAAGAAAGGCGGTAAATCCTGCTCCGGTTATCTGTGGGCATACG TCAGTGGAGAAAGGACGGGACCGTCAGTTGTGTGCTTCGACTGCCGGACCGGACGTAGCCATGAGTATCCTGAAAACTGG CTTCAGGGCTGGGGCGGGACGCTGGTTGTCGACGGACATAAAGCTTACCGGACTCTGGCAAACAAAGTGCCGGAGATCAC GCTGGCCGGATGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGTAAAGATCCACGGGCTGCCATAG CCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATCCGCCAGTGGCGA CAGCGTTATGCCCGTCCGATACTGGAAGAACTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTCTCCGGGAAGGGC ATTACACAAAGCCATTGCCTATGCGCTGTCTCATCGCGTGGAACTGAGCCGCTTCCTGGAAGATGGTGCGGTGCCGCTGG ATAATAATGTGTGTGAACGGGCCATCAAAAACGTGGTTCTGGGCAGAAAATCGTGGCTGTTCGCCGGTTCGCAGATGGCG GGAGAACGCGCCGCGCAAATAATGAGCTTGCTGGAAACCGCGAAACGCAACGGTCTGGAGCCGCATGCCTGGTTGACAGA CGTCCTGATGCGTCTGCCGGAGTGGCCGGAGGAGCGACTGGCAGAGTTGCTGCCTCTTGAGGGATTTACCTTCTCCGGGT GA Protein sequence : MNIRIWSGILPCMDISALNTTNDIEKLRAMALAMVQEVMSENAEKERELLEKSRRIQLLEEMLKLVRQQRFGKKCGTLAG MQRSLFEEDVDADIAALTAHLDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKIIQPDTDHCPECDEPLHYIRDAVSEK LEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSAVEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMA DMVGAAGAALSPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTGPSVVCFDCRTGRSHEYPENW LQGWGGTLVVDGHKAYRTLANKVPEITLAGCWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR QRYARPILEELWSWLEEQEPQCSPGRALHKAIAYALSHRVELSRFLEDGAVPLDNNVCERAIKNVVLGRKSWLFAGSQMA GERAAQIMSLLETAKRNGLEPHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG |