Gene Information

Name : clpB (P9303_18321)
Accession : YP_001017839.1
Strain : Prochlorococcus marinus MIT 9303
Genome accession: NC_008820
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease Hsp 100, ATP-binding subunit ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : 3.4.21.92
Position : 1589658 - 1592249 bp
Length : 2592 bp
Strand : -
Note : COG542 ATPases with chaperone activity, ATP-binding subunit [Posttranslational modification, protein turnover, chaperones]

DNA sequence :
ATGCAACCTACGGCGGATCAATTTACCGAAAAAGGCTGGGCAGCAATCGTTTTGGCCCAACAACTTGCCCAGCAACGCAA
GCACCAGCAACTGGAAACAGAGCATTTGTTGTTGTCACTGCTAGAGCAAAATGCTTTGGCAGGAAGAATCCTAGAAAAAG
CAGGTGTGTCGATCGGCAACTTGCAAACAGCCGTGGAGGCACACCTGCATGAACAGCCCACTCTGCAGGCAGCGCCAGAC
TCTGTCTATCTGGGCAAAGGAGTCAATGATCTGCTCGATCAAGCTGATAAACATAAGCAAGCTTTTGGCGATGGTTTCAT
CTCAATTGAACATCTGTTACTTGCATTGGCCGGGGACAATCGTTGCGGCCGCAAGCTGCTCAATCAAGCGGGCGTGGATG
CGGGCAAACTAAAGGTTGCAATTGACGCGGTACGCGGAAACCAGAAGGTGACCGATCAGAACCCAGAAGGAACTTACGAA
TCTCTCGAAAAATATGGCAGAGACCTGACTGCTGCTGCTCGCGAAGGCAAACTCGACCCTGTCATCGGAAGGGACGATGA
GATTCGACGCACGATTCAGATTCTTAGTAGGCGCACCAAAAACAACCCTGTCCTCATTGGGGAGCCAGGGGTGGGAAAAA
CCGCAATTGTTGAAGGCCTCGCACAACGGATTGTTAATGGCGATGTACCTGCAGCCCTTCAGAACAGACAGTTAATCGCC
CTTGATATGGGTGCTTTGATTGCAGGGGCAAAATATCGAGGTGAATTCGAAGAGCGACTCAAGGCAGTGCTCAAAGAAGT
TACCGCCTCTGAAGGACAAATCGTACTTTTTATTGATGAAATCCATACCGTTGTAGGAGCTGGAGCTACCGGTGGGGCAA
TGGATGCCAGCAATCTGCTTAAGCCAATGCTGGCTCGAGGGGAACTGCGCTGCATTGGTGCAACGACCCTCGATGAGCAC
CGACAGCACATCGAAAAAGACCCCGCTTTAGAAAGAAGATTCCAGCAAGTGCTTGTGGATCAACCCACCGTGCAAGACAC
GATCTCCATCCTGCGTGGTCTCAAAGAACGCTACGAAGTGCATCATGGTGTGCGCATTGCTGATAACGCCCTGGTTGCTG
CAGCTGTGCTCAGTAGCCGATATATTGCCGATCGTTTTTTGCCAGATAAAGCCATCGATTTGATGGATGAATCTGCTGCA
CGACTAAAAATGGAAATCACCTCCAAGCCGGAGGAAATCGATGAAATCGATCGCAAGATTGTGCAGCTCGAGATGGAGAA
GCTTTCCTTAGGACGTGAGTCCGACTCCGTTAGTAAGGAAAGATTGGAAAAACTAGAACGCGAGCTTGCTGAATTAGCAG
AGCAACAAAGTGCGCTCAATGCACAGTGGCAACAAGAAAAAGGTGCGATTGACGACCTTTCTTCGCTCAAAGAAGAAATC
GAAAGGGTGCAATTGCAAGTTGAGCAAGCCAAGCGCAGCTACGACCTCAACAAAGCAGCTGAACTGGAGTACGGAACTCT
GGCGGGATTGCAGAAACAACTCAGCGAGAAAGAAACTGCCCTGGCCCAAGATGGAGAGGCAGGCGATAAATCACTGCTGC
GAGAGGAGGTTACGGAAGACGATATTGCTGATGTCATCGCTAAATGGACCGGGATTCCTGTCGCCAAGCTTGTGCAGTCG
GAAATGGAAAAGCTGCTGGGACTTGAAGCAGAACTCCACCAACGTGTGATTGGCCAAGAACAGGCTGTGCAAGCTGTTGC
GGATGCGATTCAGCGTTCTAGGGCAGGTTTAAGTGATCCCAACCGCCCGATCGCAAGTTTTTTATTTCTAGGCCCCACTG
GTGTAGGTAAAACGGAGCTATCCAAGGCATTGGCCTCTCAACTGTTTGACAGCGAGGCGGCTTTGGTGCGAATCGATATG
TCTGAGTATATGGAGAAGCACAGTGTAAGCAGACTGATCGGGGCACCCCCGGGTTATGTCGGCTATGAGGCTGGTGGACA
GCTCACCGAAGCCGTACGCAGACGCCCTTATGCGGTAATCCTGTTCGATGAAGTGGAGAAGGCACACCCAGATGTGTTCA
ATGTGATGTTACAGATCCTCGATGATGGACGTGTTACCGATGGTCAGGGTCGCACAGTTGATTTTACAAATACGGTGCTA
ATTCTTACCAGTAATATTGGCAGCCAGTCGATTCTCGACTTGGGAGGCGATGATAGTCAATACGGGGAAATGGAGCGTCG
AGTTCATGATGCTTTGCACGCTCATTTTAGGCCTGAGTTTCTCAACCGCCTGGATGAAACAATCATCTTTCATAGCCTCA
GGCGCGAGGAACTGCGTCAGATCGTTGCCCTTCAGGTCAATCGGCTGCGTGAGCGTCTTGGCGATCGCAAGCTTGGCCTA
GAGATCAGTGATACAGCAGCTGATTGGCTCGCTAATGCTGGCTATGACCCTGTTTATGGGGCCAGACCCCTCAAGCGAGC
GATCCAACGCGAGCTTGAAACTCCAATAGCCAAAAGTATCCTGGCTGGCTTTTATGGAGATAGTCAGATTGTGCATGTGG
ATGTGGACGAGGAGCGTCTGAGCTTTCGATAA

Protein sequence :
MQPTADQFTEKGWAAIVLAQQLAQQRKHQQLETEHLLLSLLEQNALAGRILEKAGVSIGNLQTAVEAHLHEQPTLQAAPD
SVYLGKGVNDLLDQADKHKQAFGDGFISIEHLLLALAGDNRCGRKLLNQAGVDAGKLKVAIDAVRGNQKVTDQNPEGTYE
SLEKYGRDLTAAAREGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVNGDVPAALQNRQLIA
LDMGALIAGAKYRGEFEERLKAVLKEVTASEGQIVLFIDEIHTVVGAGATGGAMDASNLLKPMLARGELRCIGATTLDEH
RQHIEKDPALERRFQQVLVDQPTVQDTISILRGLKERYEVHHGVRIADNALVAAAVLSSRYIADRFLPDKAIDLMDESAA
RLKMEITSKPEEIDEIDRKIVQLEMEKLSLGRESDSVSKERLEKLERELAELAEQQSALNAQWQQEKGAIDDLSSLKEEI
ERVQLQVEQAKRSYDLNKAAELEYGTLAGLQKQLSEKETALAQDGEAGDKSLLREEVTEDDIADVIAKWTGIPVAKLVQS
EMEKLLGLEAELHQRVIGQEQAVQAVADAIQRSRAGLSDPNRPIASFLFLGPTGVGKTELSKALASQLFDSEAALVRIDM
SEYMEKHSVSRLIGAPPGYVGYEAGGQLTEAVRRRPYAVILFDEVEKAHPDVFNVMLQILDDGRVTDGQGRTVDFTNTVL
ILTSNIGSQSILDLGGDDSQYGEMERRVHDALHAHFRPEFLNRLDETIIFHSLRREELRQIVALQVNRLRERLGDRKLGL
EISDTAADWLANAGYDPVYGARPLKRAIQRELETPIAKSILAGFYGDSQIVHVDVDEERLSFR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 2e-99 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_001017839.1 ATP-dependent Clp protease Hsp 100, ATP-binding subunit ClpB VFG2084 Protein 5e-108 44
clpB YP_001017839.1 ATP-dependent Clp protease Hsp 100, ATP-binding subunit ClpB VFG2076 Protein 2e-121 43