Gene Information

Name : clpB (BN44_10426)
Accession : YP_007259109.1
Strain : Mycobacterium canettii CIPT 140060008
Genome accession: NC_019950
Putative virulence/resistance : Virulence
Product : Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 467867 - 470413 bp
Length : 2547 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 10493122, 10510226, 11271494, 11385512, 11567012, 12368446, 12657046, 15525680, 17611072; Product type pf : putative factor

DNA sequence :
GTGGACTCGTTTAACCCGACGACCAAGACGCAGGCGGCGCTAACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC
CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGACGGTATCGCCGCACCGCTACTGGAGGCTGTCG
GTGTCGAGCCCGCCACCGTCCGCGCCGAAACCCAGCGCCTGCTCGACCGTTTGCCGCAGGCGACTGGAGCCAGCACGCAG
CCGCAGCTGTCCCGCGAGTCGTTAGCGGCGATCACCACCGCGCAGCAGCTGGCCACCGAGCTGGACGACGAGTACGTCTC
CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC
AGGCGCTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACCTATCAGGCG
CTGCAGAAGTACTCCACCGACCTGACCGCCCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT
CCGCCGCGTGGTGCAGGTGCTGTCCCGTCGCACCAAGAACAACCCGGTGCTGATCGGTGAGCCCGGCGTCGGCAAGACCG
CGATCGTGGAGGGCCTGGCGCAGCGCATCGTGGCCGGCGACGTGCCGGAGAGCTTGCGCGACAAGACCATCGTCGCGCTC
GATCTCGGCTCGATGGTCGCCGGCTCCAAATACCGCGGCGAATTCGAGGAACGGCTCAAGGCCGTCCTCGACGACATCAA
GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGCGCCGGCGCCACCGGCGAGGGGGCGA
TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACGCTGGACGAGTAC
CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGCCGTTTCCAACAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC
CATCGGCATCCTGCGCGGGCTCAAAGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCGCTGGTGGCAG
CTGCCACTTTGAGCGACCGGTATATCACCGCCCGCTTCCTGCCCGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC
CGGCTGCGGATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA
GATGGCGCTGTCCAAAGAAGAAGACGAAGCGTCGGCGGAGCGGTTGGCCAAGCTGCGCTCCGAGCTGGCCGATCAGAAAG
AGAAGTTGGCCGAGCTCACCACCCGCTGGCAGAACGAGAAGAACGCGATCGAAATCGTCCGCGACCTCAAGGAGCAGCTG
GAAGCCCTGCGCGGGGAATCCGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGACGCAT
CCCCGAGGTGGAGAAGAAGCTCGACGCGGCGTTGCCGCAGGCGCAGGCCCGGGAGCAGGTGATGCTCAAGGAGGAGGTCG
GTCCCGACGACATCGCCGACGTGGTGTCGGCGTGGACCGGCATCCCGGCCGGTCGGCTGCTGGAAGGCGAGACCGCCAAG
CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG
GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA
CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC
GAGAAGCACACCGTGGCTCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGATACGAGGCGGGCGGTCAGCTGACCGAGGC
GGTGCGCCGGCGTCCCTACACGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC
AGGTCCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATCCTGACGTCC
AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT
CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCGCATCGTCGACATCCAGCTGGCGCAGCTGGGCA
AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAGCGCGGATTCGACCCG
GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGTTGGCCGGCCA
GGTGCACGACGGCGATACCGTGCCGGTCAACGTCAGCCCCGACGCCGACTCGCTGATCCTGGGCTGA

Protein sequence :
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ
PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-104 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-105 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-105 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_007259109.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2076 Protein 4e-112 44
clpB YP_007259109.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2084 Protein 1e-111 41