Gene Information

Name : clpB (BS1330_I1858)
Accession : YP_005616668.1
Strain :
Genome accession: NC_017251
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease, ATP-binding subunit ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1800011 - 1802635 bp
Length : 2625 bp
Strand : -
Note : identified by similarity to EGAD:20010; match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM TIGR01369; corresponds to BR1864

DNA sequence :
ATGAATATCGAAAAATATACAGAACGGGTCCGCGGCTTCATCCAGTCTGCCCAGACCTTCGCGCTTTCCTCCGGCAATCA
GCAATTCACTCCAGAACATATCCTGAAGGTTCTCATCGACGACGATGAGGGGTTGGCGGCGTCGCTGGTCGAGCGGGCTG
GCGGACGGGTTGGCGATGTGCGCATGGGCCTGCAAAGCGCGCTTGAAAAACTGCCGAAAGTTTCGGGTGGCAATGACCAG
CTCTATCTTTCGCAGCCTCTTGCCAAGGTGTTCTCGCTTGCCGAGGAACTGGCCAGCAAGGCGGGTGACAGCTTCGTCAC
CGTCGAGCGCCTCCTGACGGCGCTGGCGATGGAAAAATCCGCCAAAACCTCCGAGATACTTTCTGCTGCGGGCGTCACCC
CGACGGCATTGAACAGGGTTATCAACGATATGCGCAAAGGCCGCACCGCCGATTCCGCCTCAGCCGAAAGCAATTATGAT
GCTTTGAAGAAATATGCGCGCGATCTGACGGAAGACGCACGCGCGGGCAAGCTTGATCCGGTCATCGGCCGCGATGAGGA
AATCCGCCGCACGATCCAGGTTCTGTCACGCCGCACCAAGAACAATCCGGTGCTGATCGGTGAGCCGGGCGTGGGCAAGA
CCGCAATCGCGGAAGGTCTGGCGCTGCGCATCGTCAATGGCGACGTGCCCGAATCGCTGAAGGACAAGCAATTGATGGCG
CTCGACATGGGCGCGCTGATTGCGGGCGCCAAATATCGCGGCGAATTCGAGGAACGCCTGAAGGCCGTTCTCTCGGAAGT
GCAGACGGCTGCCGGGCAGATCATCCTCTTCATCGACGAAATGCACACATTGGTCGGCGCGGGCAAGACGGATGGCGCAA
TGGATGCGTCGAACCTTTTGAAGCCTGCTCTGGCGCGCGGTGAATTGCATTGCGTCGGCGCCACCACGCTTGAGGAATAT
CGCAAATATGTGGAGAAGGACGCCGCCCTTGCGCGCCGTTTCCAGCCCGTTTTTGTCGATGAGCCAACAGTGGAGGACAC
AATCTCCATTCTGCGTGGCCTGAAGGAGAAATACGAGCAGCACCATAAGGTGCGCGTGTCGGACTCCGCGCTGGTGGCTG
CGGCAACGCTCTCAAACCGCTATATCACCGACCGTTTCCTGCCGGACAAGGCCATCGACCTTGTGGATGAGGCTGCATCG
CGCCTGCGTATGCACGTCGATTCCAAGCCGGAAGAACTGGACGAGATCGATCGCCGCATCATGCAGCTCAAGATCGAGCG
CGAAGCACTGAAGGTGGAAACGGACGCCGCTTCCAAGGACCGCTTGCAGCGCATTGAAAAGGAACTGAGCGATCTTGAGG
AAGAATCAGCCGAACTGACCGCCAAGTGGCAGGCGGAAAAGCAGAAGCTTGGGCTTGCTGCCGATCTCAAGCGTCAGCTT
GAAGAGGCACGCAATGCACTTGCCATTGCGCAGCGCAATGGTGAGTTTCAGAAAGCGGGCGAGCTTGCCTATGGCACGAT
CCCGCAACTGGAAAAGCAACTTGCTGATGCGGAAAGTCAGGAAAACAAGGGTTCCCTCCTGGAAGAAACCGTGACGCCGG
ACCATGTGGCGCAGGTCATTTCGCGCTGGACCGGCATTCCGGTTGACCGGATGCTGGAAGGTGAGCGCGAAAAGCTGTTG
CGCATGGAAGACGAGATCGGCAAGCGCGTCGTCGGCCAGGGCGAGGCGGTGCAGGCCATATCCAAGGCGGTGCGCCGTGC
CCGTGCCGGTCTGCAGGACCCGAACCGGCCCATCGGCTCGTTCATCTTCCTCGGCCCCACTGGCGTCGGCAAGACGGAAC
TCACCAAGGCGCTCGCCTCGTTCCTGTTCCAGGATGATACCGCAATGGTGCGTATCGATATGTCGGAATTCATGGAAAAG
CATTCCGTGAGCCGCCTTATCGGTGCTCCTCCCGGCTATGTCGGCTATGAAGAGGGTGGTGTGCTGACCGAAGCTGTCCG
GCGCAGGCCCTATCAGGTGATCCTGTTCGACGAGATTGAAAAGGCGCACCCGGATGTCTTCAATGTTCTGTTGCAGGTGC
TGGACGACGGACGCCTGACGGACGGCCAGGGCCGCACGGTTGATTTCCGCAATACGGTCATCATCATGACGTCAAACCTC
GGCGCGGAATATCTCGTCAATCTGGGCGAAAACGATGATGTCGAGACTGTTCGCGATGATGTGATGGGCGTGGTCCGTGC
TTCGTTCCGGCCGGAATTCCTCAACCGTGTTGATGAAATCATCCTGTTCCACCGGCTGCGCCGCGAAGACATGGGTGCAA
TTGTCGATATCCAGATGCAGCGTCTCCAATATCTGCTTTCCGATCGCAAGATCACCTTGCAGCTTGAGGACGACGCCCGC
GAATGGCTCGCCAACAAGGGGTATGACCCGGCCTATGGCGCGCGCCCGCTGAAGCGCGTAATCCAGAAGGAAGTGCAGGA
TCCGCTGGCCGAACGTATCCTGCTTGGTGATATTCTCGACGGTTCCCTCGTCAAGATCACTGCCGGTTCGGATCGGCTCA
ATTTCCGTCCGATCAGCGGTGCGTTCAGCGCGGCGGAGCCGGAAAGAGAAGACGAGAAAGCCTGA

Protein sequence :
MNIEKYTERVRGFIQSAQTFALSSGNQQFTPEHILKVLIDDDEGLAASLVERAGGRVGDVRMGLQSALEKLPKVSGGNDQ
LYLSQPLAKVFSLAEELASKAGDSFVTVERLLTALAMEKSAKTSEILSAAGVTPTALNRVINDMRKGRTADSASAESNYD
ALKKYARDLTEDARAGKLDPVIGRDEEIRRTIQVLSRRTKNNPVLIGEPGVGKTAIAEGLALRIVNGDVPESLKDKQLMA
LDMGALIAGAKYRGEFEERLKAVLSEVQTAAGQIILFIDEMHTLVGAGKTDGAMDASNLLKPALARGELHCVGATTLEEY
RKYVEKDAALARRFQPVFVDEPTVEDTISILRGLKEKYEQHHKVRVSDSALVAAATLSNRYITDRFLPDKAIDLVDEAAS
RLRMHVDSKPEELDEIDRRIMQLKIEREALKVETDAASKDRLQRIEKELSDLEEESAELTAKWQAEKQKLGLAADLKRQL
EEARNALAIAQRNGEFQKAGELAYGTIPQLEKQLADAESQENKGSLLEETVTPDHVAQVISRWTGIPVDRMLEGEREKLL
RMEDEIGKRVVGQGEAVQAISKAVRRARAGLQDPNRPIGSFIFLGPTGVGKTELTKALASFLFQDDTAMVRIDMSEFMEK
HSVSRLIGAPPGYVGYEEGGVLTEAVRRRPYQVILFDEIEKAHPDVFNVLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNL
GAEYLVNLGENDDVETVRDDVMGVVRASFRPEFLNRVDEIILFHRLRREDMGAIVDIQMQRLQYLLSDRKITLQLEDDAR
EWLANKGYDPAYGARPLKRVIQKEVQDPLAERILLGDILDGSLVKITAGSDRLNFRPISGAFSAAEPEREDEKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 4e-112 43
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-112 43
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-105 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_005616668.1 ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2076 Protein 4e-123 42
clpB YP_005616668.1 ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2084 Protein 3e-113 41