Gene Information

Name : clpB (BR1864)
Accession : NP_698844.1
Strain :
Genome accession: NC_004310
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease, ATP-binding subunit ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1800024 - 1802648 bp
Length : 2625 bp
Strand : -
Note : identified by similarity to EGAD:20010; match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM TIGR01369

DNA sequence :
ATGAATATCGAAAAATATACAGAACGGGTCCGCGGCTTCATCCAGTCTGCCCAGACCTTCGCGCTTTCCTCCGGCAATCA
GCAATTCACTCCAGAACATATCCTGAAGGTTCTCATCGACGACGATGAGGGGTTGGCGGCGTCGCTGGTCGAGCGGGCTG
GCGGACGGGTTGGCGATGTGCGCATGGGCCTGCAAAGCGCGCTTGAAAAACTGCCGAAAGTTTCGGGTGGCAATGACCAG
CTCTATCTTTCGCAGCCTCTTGCCAAGGTGTTCTCGCTTGCCGAGGAACTGGCCAGCAAGGCGGGTGACAGCTTCGTCAC
CGTCGAGCGCCTCCTGACGGCGCTGGCGATGGAAAAATCCGCCAAAACCTCCGAGATACTTTCTGCTGCGGGCGTCACCC
CGACGGCATTGAACAGGGTTATCAACGATATGCGCAAAGGCCGCACCGCCGATTCCGCCTCAGCCGAAAGCAATTATGAT
GCTTTGAAGAAATATGCGCGCGATCTGACGGAAGACGCACGCGCGGGCAAGCTTGATCCGGTCATCGGCCGCGATGAGGA
AATCCGCCGCACGATCCAGGTTCTGTCACGCCGCACCAAGAACAATCCGGTGCTGATCGGTGAGCCGGGCGTGGGCAAGA
CCGCAATCGCGGAAGGTCTGGCGCTGCGCATCGTCAATGGCGACGTGCCCGAATCGCTGAAGGACAAGCAATTGATGGCG
CTCGACATGGGCGCGCTGATTGCGGGCGCCAAATATCGCGGCGAATTCGAGGAACGCCTGAAGGCCGTTCTCTCGGAAGT
GCAGACGGCTGCCGGGCAGATCATCCTCTTCATCGACGAAATGCACACATTGGTCGGCGCGGGCAAGACGGATGGCGCAA
TGGATGCGTCGAACCTTTTGAAGCCTGCTCTGGCGCGCGGTGAATTGCATTGCGTCGGCGCCACCACGCTTGAGGAATAT
CGCAAATATGTGGAGAAGGACGCCGCCCTTGCGCGCCGTTTCCAGCCCGTTTTTGTCGATGAGCCAACAGTGGAGGACAC
AATCTCCATTCTGCGTGGCCTGAAGGAGAAATACGAGCAGCACCATAAGGTGCGCGTGTCGGACTCCGCGCTGGTGGCTG
CGGCAACGCTCTCAAACCGCTATATCACCGACCGTTTCCTGCCGGACAAGGCCATCGACCTTGTGGATGAGGCTGCATCG
CGCCTGCGTATGCACGTCGATTCCAAGCCGGAAGAACTGGACGAGATCGATCGCCGCATCATGCAGCTCAAGATCGAGCG
CGAAGCACTGAAGGTGGAAACGGACGCCGCTTCCAAGGACCGCTTGCAGCGCATTGAAAAGGAACTGAGCGATCTTGAGG
AAGAATCAGCCGAACTGACCGCCAAGTGGCAGGCGGAAAAGCAGAAGCTTGGGCTTGCTGCCGATCTCAAGCGTCAGCTT
GAAGAGGCACGCAATGCACTTGCCATTGCGCAGCGCAATGGTGAGTTTCAGAAAGCGGGCGAGCTTGCCTATGGCACGAT
CCCGCAACTGGAAAAGCAACTTGCTGATGCGGAAAGTCAGGAAAACAAGGGTTCCCTCCTGGAAGAAACCGTGACGCCGG
ACCATGTGGCGCAGGTCATTTCGCGCTGGACCGGCATTCCGGTTGACCGGATGCTGGAAGGTGAGCGCGAAAAGCTGTTG
CGCATGGAAGACGAGATCGGCAAGCGCGTCGTCGGCCAGGGCGAGGCGGTGCAGGCCATATCCAAGGCGGTGCGCCGTGC
CCGTGCCGGTCTGCAGGACCCGAACCGGCCCATCGGCTCGTTCATCTTCCTCGGCCCCACTGGCGTCGGCAAGACGGAAC
TCACCAAGGCGCTCGCCTCGTTCCTGTTCCAGGATGATACCGCAATGGTGCGTATCGATATGTCGGAATTCATGGAAAAG
CATTCCGTGAGCCGCCTTATCGGTGCTCCTCCCGGCTATGTCGGCTATGAAGAGGGTGGTGTGCTGACCGAAGCTGTCCG
GCGCAGGCCCTATCAGGTGATCCTGTTCGACGAGATTGAAAAGGCGCACCCGGATGTCTTCAATGTTCTGTTGCAGGTGC
TGGACGACGGACGCCTGACGGACGGCCAGGGCCGCACGGTTGATTTCCGCAATACGGTCATCATCATGACGTCAAACCTC
GGCGCGGAATATCTCGTCAATCTGGGCGAAAACGATGATGTCGAGACTGTTCGCGATGATGTGATGGGCGTGGTCCGTGC
TTCGTTCCGGCCGGAATTCCTCAACCGTGTTGATGAAATCATCCTGTTCCACCGGCTGCGCCGCGAAGACATGGGTGCAA
TTGTCGATATCCAGATGCAGCGTCTCCAATATCTGCTTTCCGATCGCAAGATCACCTTGCAGCTTGAGGACGACGCCCGC
GAATGGCTCGCCAACAAGGGGTATGACCCGGCCTATGGCGCGCGCCCGCTGAAGCGCGTAATCCAGAAGGAAGTGCAGGA
TCCGCTGGCCGAACGTATCCTGCTTGGTGATATTCTCGACGGTTCCCTCGTCAAGATCACTGCCGGTTCGGATCGGCTCA
ATTTCCGTCCGATCAGCGGTGCGTTCAGCGCGGCGGAGCCGGAAAGAGAAGACGAGAAAGCCTGA

Protein sequence :
MNIEKYTERVRGFIQSAQTFALSSGNQQFTPEHILKVLIDDDEGLAASLVERAGGRVGDVRMGLQSALEKLPKVSGGNDQ
LYLSQPLAKVFSLAEELASKAGDSFVTVERLLTALAMEKSAKTSEILSAAGVTPTALNRVINDMRKGRTADSASAESNYD
ALKKYARDLTEDARAGKLDPVIGRDEEIRRTIQVLSRRTKNNPVLIGEPGVGKTAIAEGLALRIVNGDVPESLKDKQLMA
LDMGALIAGAKYRGEFEERLKAVLSEVQTAAGQIILFIDEMHTLVGAGKTDGAMDASNLLKPALARGELHCVGATTLEEY
RKYVEKDAALARRFQPVFVDEPTVEDTISILRGLKEKYEQHHKVRVSDSALVAAATLSNRYITDRFLPDKAIDLVDEAAS
RLRMHVDSKPEELDEIDRRIMQLKIEREALKVETDAASKDRLQRIEKELSDLEEESAELTAKWQAEKQKLGLAADLKRQL
EEARNALAIAQRNGEFQKAGELAYGTIPQLEKQLADAESQENKGSLLEETVTPDHVAQVISRWTGIPVDRMLEGEREKLL
RMEDEIGKRVVGQGEAVQAISKAVRRARAGLQDPNRPIGSFIFLGPTGVGKTELTKALASFLFQDDTAMVRIDMSEFMEK
HSVSRLIGAPPGYVGYEEGGVLTEAVRRRPYQVILFDEIEKAHPDVFNVLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNL
GAEYLVNLGENDDVETVRDDVMGVVRASFRPEFLNRVDEIILFHRLRREDMGAIVDIQMQRLQYLLSDRKITLQLEDDAR
EWLANKGYDPAYGARPLKRVIQKEVQDPLAERILLGDILDGSLVKITAGSDRLNFRPISGAFSAAEPEREDEKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 4e-112 43
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-112 43
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-105 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB NP_698844.1 ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2076 Protein 4e-123 42
clpB NP_698844.1 ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2084 Protein 3e-113 41