Gene Information

Name : ECOK1_2424 (ECOK1_2424)
Accession : YP_006101577.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Unknown
Product : sulfatase family protein
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.6.-
Position : 2502597 - 2504357 bp
Length : 1761 bp
Strand : +
Note : identified by match to protein family HMM PF00884

DNA sequence :
ATGGTAACTCATCGTCAGCGCTACCGTGAAAAAGTCTCCCAGATGGTCAGTTGGGGGCACTGGTTTGCACTGTTCAATAT
TCTGCTTTCGCTCGTCATTGGCAGCCGTTACCTGTTTATCGCCGACTGGCCGACAACGCTTGCTGGTCGCATTTATTCCT
ACGTAAGCATTATCGGTCATTTCAGCTTCCTGGTGTTCGCCACCTACTTGCTGATCCTCTTCCCGCTAACCTTTATCGTC
GGCTCCCAGAGGCTGATGAGGTTTTTGTCCGTCATTCTGGCAACGGCGGGAATGACGCTATTACTGATCGACAGCGAAGT
CTTTACTCGTTTCCATCTCCATCTTAATCCCATCGTCTGGCAGCTGGTTATCAACCCAGACGAAAATGAGATGGCGCGCG
ACTGGCAGCTAATGTTCATCAGTGTGCCGGTTATTTTATTGCTTGAACTGGTGTTTGCGACGTGGAGCTGGCAAAAGCTG
CGCAGCCTGACGCGTCGTCGGCGCTTCGCGCGCCCGCTGGCCGCATTCTTATTTATCGCCTTTATCGCCTCGCATGTGGT
TTATATCTGGGCCGATGCCAACTTCTATCGCCCGATCACCATGCAGCGCGCTAACCTGCCGCTTTCGTACCCGATGACGG
CACGACGTTTTCTTGAGAAGCATGGCCTGCTTGATGCGCAGGAGTATCAACGCCGTCTTATTGAGCAAGGCAATCCAGAC
GCCGTTTCCGTTCAGTATCCGTTAAGCGAACTGCGCTATCGCGATATGGGCACCGGGCAGAATGTGCTGTTGATTACTGT
CGATGGCCTGAACTACTCACGCTTCGAGAAGCAGATGCCTGCGCTGGCAGGTTTTGCTGAGCAAAATATTTCGTTCACGC
GCCATATGAGCTCCGGCAACACTACAGACAACGGCATCTTTGGTCTGTTCTATGGCATCTCGCCGAGCTATATGGACGGC
ATTCTGTCGACTCGTACGCCTGCGGCGTTAATTACTGCGCTTAATCAGCAAGGCTATCAGCTGGGGTTATTCTCATCAGA
TGGCTTTACCAGCCCGCTGTATCGTCAGGCATTGTTGTCAGATTTCTCGATGCCGAGTGTACGCACCCAATCCGACGAAC
AGACCGCCACGCAGTGGATCAACTGGCTGGGCCGCTACGCACAAGAAGATAACCGCTGGTTCTCGTGGGTCTCTTTCAAT
GGCACAAACATTGACGACAGCAATCAGCAGGCATTTGCACGGAAATATAGCCGGGCGGCAGGCAATGTCGATGACCAGAT
CAACCGCGTGCTCAATGCACTGCGTGATTCTGGCAAACTGGACAATACGGTGGTGATTATCACTGCCGGGCGGGGTATTC
CGCTGAGCGAAGAGGAAGAAACCTTTGACTGGTCCCACGGTCATCTGCAGGTGCCATTAGTGATTCACTGGCCAGGCACG
CCGGCGCAGCGTATTAATGCGCTGACTGATCATACCGATCTGATGACGACGCTGATGCAACGCCTGCTACATGTCAGCAC
ACCTGCCAGCGAATATTCGCAAGGTCAGGATTTGTTCAACCCTCAACGCCGCCATTACTGGGTTACCGCCGCGGATAACG
ATACGCTGGCAATTACCACCCCGAAAAAGACACTGGTGCTGAACAATAACGGTAAATACCGCACTTACAACTTACGTGGT
GAAAGAGTGAAAGATGAAAAACCACAGTTAAGTTTGTTATTGCAAGTACTGACAGACGAGAAGCGTTTTATCGCTAACTG
A

Protein sequence :
MVTHRQRYREKVSQMVSWGHWFALFNILLSLVIGSRYLFIADWPTTLAGRIYSYVSIIGHFSFLVFATYLLILFPLTFIV
GSQRLMRFLSVILATAGMTLLLIDSEVFTRFHLHLNPIVWQLVINPDENEMARDWQLMFISVPVILLLELVFATWSWQKL
RSLTRRRRFARPLAAFLFIAFIASHVVYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLIEQGNPD
AVSVQYPLSELRYRDMGTGQNVLLITVDGLNYSRFEKQMPALAGFAEQNISFTRHMSSGNTTDNGIFGLFYGISPSYMDG
ILSTRTPAALITALNQQGYQLGLFSSDGFTSPLYRQALLSDFSMPSVRTQSDEQTATQWINWLGRYAQEDNRWFSWVSFN
GTNIDDSNQQAFARKYSRAAGNVDDQINRVLNALRDSGKLDNTVVIITAGRGIPLSEEEETFDWSHGHLQVPLVIHWPGT
PAQRINALTDHTDLMTTLMQRLLHVSTPASEYSQGQDLFNPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNLRG
ERVKDEKPQLSLLLQVLTDEKRFIAN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01048 YP_001437152.1 hypothetical protein Not tested Not named Protein 0.0 81