Gene Information

Name : SbBS512_E0769 (SbBS512_E0769)
Accession : YP_001879478.1
Strain : Shigella boydii CDC 3083-94
Genome accession: NC_010658
Putative virulence/resistance : Unknown
Product : sulfatase family protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG3083
EC number : 3.1.6.-
Position : 721055 - 722815 bp
Length : 1761 bp
Strand : -
Note : identified by match to protein family HMM PF00884

DNA sequence :
ATGGTAACTCATCGTCAGCGCTACCGTGAAAAAGTCTCCCAGATGGTCAGTTGGGGGCACTGGTTTGCACTGTTCAATAT
TCTGCTTTCGCTCGTCATTGGCAGCCGTTACCTGTTTATCGCCGACTGGCCGACAACGCTTGCTGGTCGCATTTATTCCT
ACGTAAGCATTATCGGCCATTTCAGCTTCCTGGTGTTCGCCACCTACTTGCTGATCCTCTTCCCGCTGACCTTTATCGTC
GGCTCCCAGAGGCTGATGAGGTTTTTGTCCGTCATTCTGGCAACGGCGGGAATGACGCTATTACTGATCGATAGCGAAGT
CTTTACTCGTTTCCATCTCCATCTTAATCCCATCGTCTGGCAGTTGGTTATCAACCCAGACGAAAATGAGATGGCGCGCG
ACTGGCAGCTGATGTTCATCAGCGTGCCGGTTATTTTATTGCTTGAACTGGTGTTTGCGACGTGGAGCTGGCAAAAGCTG
CGCAGCCTGACGCGTCGTCGACGCTTCGCGCGCCCGCTGGCCGCATTCTTATTTATCGCCTTTATCGCCTCGCATGTGGT
GTATATCTGGGCCGATGCCAACTTCTATCGCCCGATCACCATGCAGCGCGCTAACCTGCCGCTTTCGTACCCGATGACGG
CGCGACGTTTCCTTGAGAAGCATGGCCTGCTTGATGCGCAGGAGTATCAACGCCGTCTTATTGAGCAAGGTAATCCAGAC
GCCGTTTCCGTACAGTATCCGTTAAGCGAACTGCGCTATCGCGATATGGGCACCGGGCAGAATGTGCTGTTGATTACTGT
CGATGGCCTGAACTACTCACGCTTCGAGAAGCAGATGCCTGCGCTGGCAGGTTTTGCTGAGCAAAATATTTCGTTCACGC
GCCATATGAGCTCCGGCAACACTACAGACAACGGCATCTTTGGCCTGTTCTATGGCATCTCGCCGAGCTATATGGACGGC
ATTCTGTCGACCCGTACCCCTGCGGCGTTAATTACTGCGCTTAATCAGCAAGGCTATCAGCTGGGATTATTCTCGTCAGA
TGGCTTTACCAGTCCGCTGTATCGCCAGGCATTGTTGTCAGATTTCTCGATGCCGAGCGTACGCACCCAATCCGACGAGC
AGACCGCCACGCAGTGGATCAACTGGCTGGGCCGCTACGCACAAGAAGATAACCGCTGGTTCTCGTGGGTCTCTTTCAAT
GGCACTAACATTGACGACAGCAATCAGCAGGCATTTGCACGGAAATATAGCCGGGCGGCAGGCAATGTCGATGACCAGAT
CAACCGCGTGCTCAATGCACTGCTTGATTCTGGCAAACTGGACAATACGGTTGTGATTATCACTGCCGGTCGGGGTATTC
CGCTGAGCGAAGACGAAGAAACCTTTGACTGGTCCCACGGTCATCTGCAGGTGCCATTAGTGATTCACTGGCCAGGCACG
CCGGCGCAGCGTATTAATGCGCTGACTGATCATACCGATCTGATGACGACGCTGATGCAACGCCTGCTACATGTCAGCAC
ACCTGCCAGCGAATATTCGCAAGGTCAGGATTTGTTCAACCCTCAACGCCGTCATTACTGGGTCACTGCAGCGGATAACG
ATACGCTGGCAATTACCACCCCGAAAAAGACGCTGGTGCTGAACAATAACGGTAAATACCGCACTTACAACTTACGTGGT
GAAAGAGTGAAAGATGAAAAACCACAGTTAAGTTTGTTATTGCAAGTGCTGACAGACGAGAAGCGTTTTATCGCTAACTG
A

Protein sequence :
MVTHRQRYREKVSQMVSWGHWFALFNILLSLVIGSRYLFIADWPTTLAGRIYSYVSIIGHFSFLVFATYLLILFPLTFIV
GSQRLMRFLSVILATAGMTLLLIDSEVFTRFHLHLNPIVWQLVINPDENEMARDWQLMFISVPVILLLELVFATWSWQKL
RSLTRRRRFARPLAAFLFIAFIASHVVYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLIEQGNPD
AVSVQYPLSELRYRDMGTGQNVLLITVDGLNYSRFEKQMPALAGFAEQNISFTRHMSSGNTTDNGIFGLFYGISPSYMDG
ILSTRTPAALITALNQQGYQLGLFSSDGFTSPLYRQALLSDFSMPSVRTQSDEQTATQWINWLGRYAQEDNRWFSWVSFN
GTNIDDSNQQAFARKYSRAAGNVDDQINRVLNALLDSGKLDNTVVIITAGRGIPLSEDEETFDWSHGHLQVPLVIHWPGT
PAQRINALTDHTDLMTTLMQRLLHVSTPASEYSQGQDLFNPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNLRG
ERVKDEKPQLSLLLQVLTDEKRFIAN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01048 YP_001437152.1 hypothetical protein Not tested Not named Protein 0.0 81