Gene Information

Name : ECH74115_3327 (ECH74115_3327)
Accession : YP_002271603.1
Strain : Escherichia coli EC4115
Genome accession: NC_011353
Putative virulence/resistance : Unknown
Product : sulfatase family protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG3083
EC number : 3.1.6.-
Position : 3060668 - 3062428 bp
Length : 1761 bp
Strand : +
Note : identified by match to protein family HMM PF00884

DNA sequence :
ATGGTAACTCATCGTCAGCGCTACCGTGAAAAAGTCTCCCAGATGGTCAGTTGGGGGCACTGGTTTGCACTGTTCAATAT
TCTGCTTTCGCTCGTCATTGGCAGCCGTTACCTGTTTATCGCCGACTGGCCGACCACGCTTGCTGGTCGCATTTATTCCT
ACGTAAGCATTATCGGCCATTTCAGCTTCCTGGTGTTCGCCACCTACTTGCTGATTCTCTTCCCGCTGACCTTTATCGTC
GGCTCCCAGAGGCTGATGAGGTTTTTGTCCGTCATTCTGGCAACGGCGGGAATGACGCTATTACTGATCGATAGCGAAGT
CTTTACTCGTTTCCATCTCCATCTTAATCCCATCGTCTGGCAACTGGTTATTAACCCAGACGAAAATGAGATGGCGCGCG
ACTGGCAGCTGATGTTCATCAGCGTGCCGGTTATTTTATTGCTTGAACTGGTGTTTGCGACGTGGAGCTGGCAAAAGCTG
CGCAGCCTGACGCGTCGTCGGCGCTTCGCGCGTCCGCTGGCCGCATTCTTATTTATCGCCTTTATCGCCTCGCATGTGGT
GTATATCTGGGCCGATGCCAACTTCTATCGCCCTATCACCATGCAGCGCGCTAACCTGCCGCTTTCGTACCCGATGACGG
CGCGACGTTTTCTTGAGAAGCATGGCCTGCTTGATGCGCAGGAGTATCAACGCCGTCTTATTGAGCAAGGTAATCCAGAC
GCCGTTTCCGTTCAGTATCCGTTAAGCGAACTGCGCTATCGCGATATGGGCACCGGGCAGAATGTCTTGTTGATTACTGT
CGATGGCCTGAACTACTCACGCTTCGAGAAGCAGATGCCTGCGCTGGCAGGTTTTGCTGAGCAAAATATTTCGTTCACGC
GCCATATGAGCTCCGGCAACACTACAGACAACGGCATCTTTGGCCTGTTCTATGGCATCTCGCCGAGCTATATGGACGGC
ATTCTGTCGACCCGTACGCCTGCGGCATTAATTACTGCGCTTAATCAGCAAGGCTATCAGCTGGGGTTATTCTCATCAGA
TGGCTTTACCAGCCCGCTGTATCGCCAGGCATTGTTGTCAGATTTCTCGATGCCGAGCGTACGCACCCAATCCGACGAGC
AGACCGCCACGCAGTGGATCAACTGGCTGGGACGCTACGCACAAGAAGATAACCGCTGGTTCTCGTGGGTCTCTTTCAAT
GGTACTAACATTGACGACAGCAATCAGCAGGCATTTGCACGGAAATATAGCCGGGCGGCAGGCAATGTCGATGACCAGAT
CAACCGCGTGCTCAATGCACTGCGTGATTCTGGCAAACTGGACAATACGGTGGTGATTATCACTGCCGGTCGGGGTATTC
CACTGAGCGAAGAGGAAGAAACCTTTGACTGGTCCCACGGTCATCTGCAGGTGCCATTAGTGATTCACTGGCCAGGCACG
CCGGCGCAGCGTATTAATGCGCTGACTGATCATACCGATCTGATGACGACGCTGATGCAACGCCTGCTACATGTCAGCAC
ACCTGCCAGCGAATATTCGCAAGGTCAGGATTTGTTCAACCCTCAACGCCGTCATTACTGGGTTACCGCAGCGGATAACG
ATACGCTGGCAATTACCACCCCGAAAAAGACGCTGGTGCTGAACAATAACGGTAAATACCGCACTTACAACTTACGTGGT
GAAAGAGTGAAAGATGAAAAACCACAGTTAAGTTTGTTATTGCAAGTACTGACAGACGAGAAGCGTTTTATCGCTAACTG
A

Protein sequence :
MVTHRQRYREKVSQMVSWGHWFALFNILLSLVIGSRYLFIADWPTTLAGRIYSYVSIIGHFSFLVFATYLLILFPLTFIV
GSQRLMRFLSVILATAGMTLLLIDSEVFTRFHLHLNPIVWQLVINPDENEMARDWQLMFISVPVILLLELVFATWSWQKL
RSLTRRRRFARPLAAFLFIAFIASHVVYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLIEQGNPD
AVSVQYPLSELRYRDMGTGQNVLLITVDGLNYSRFEKQMPALAGFAEQNISFTRHMSSGNTTDNGIFGLFYGISPSYMDG
ILSTRTPAALITALNQQGYQLGLFSSDGFTSPLYRQALLSDFSMPSVRTQSDEQTATQWINWLGRYAQEDNRWFSWVSFN
GTNIDDSNQQAFARKYSRAAGNVDDQINRVLNALRDSGKLDNTVVIITAGRGIPLSEEEETFDWSHGHLQVPLVIHWPGT
PAQRINALTDHTDLMTTLMQRLLHVSTPASEYSQGQDLFNPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNLRG
ERVKDEKPQLSLLLQVLTDEKRFIAN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01048 YP_001437152.1 hypothetical protein Not tested Not named Protein 0.0 81