Gene Information

Name : clpB2 (H16_B2428)
Accession : YP_841940.1
Strain :
Genome accession: NC_008314
Putative virulence/resistance : Virulence
Product : ATP-dependent protease Clp, ATPase subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2740820 - 2743540 bp
Length : 2721 bp
Strand : -
Note : -

DNA sequence :
ATGGACATCGATATCCGCACGCTGCTCAGCCGCCTCAATCCCGAGTGCCGGCACGCGATGGAGCAGGCCGCCCAGCTTTG
CGTCCGGCAGACGCACTACAGCGTCGACGTCGAGCACCTGCTGCTGCAATTGCTGGAAGGCGGCGCCACGGACCTGCAGG
CGATCCTCGCGCACTTCGACCTGGCGCCCGCGCAGGTGGTGTCGCAGCTGCAGAAGGCGATCGACGGCTTCAAGCGCGGC
AACGGCCGCACGCCGGCGCTGTCGCCTAACTTCTCGCCGCTGTTCCAGGAGGCCTGGCTGCTCAGTTCGATGCTGCTCGG
CGAGCAGCAAGTGCGCGCCGGCACGCTGCTGCTGGCACTGCTGGAGGTGGAAAGCCTGCGCGGCATGCTGCTGGAATCGG
CGCCGGCGCTGCTGCGGATTCCGCGTGCGGCGCTGCGCGAGGCCCTGCCCGCGCTGCTGGGGGCATCCGCCGTTGGCGGC
GGGCCCGGAACCGATGCAATCCCGGCAGCGGCGGCCACGCCAGCGTTCGGCATGCCGCAGCGCGGACCCGGCAGCGGGCA
GGCATCGGCGCTGGAGCAGTTCACTGTCGACCTGACGGCGCTGGCGCGAGCCGGAGCGATCGATCCGGTGCGCGGGCGCG
ACAGTGAAATCCGGCAACTGATCGATGTGCTGCTGCGCCGCCGCCAGAACAACCCGATCCTGACCGGCGAGGCCGGCGTG
GGCAAGACCGCGGTGGTCGAAGGCTTTGCCCAGCGCATCGTCTCCGGCGACGTGCCGCCGGCGCTGCGCCAAGTGTCGGT
GCGCTCGCTGGACCTGGCGTTGCTGCAGGCCGGCGCCGGCGTCAAGGGCGAGTTCGAGAACCGGCTCAAGTCCGTGATCG
CCGAGGTGGCGGCCTCGCCGGTGCCGGTGATCCTGTTTATCGACGAAGCGCACCAGCTGATTGGCGCCGGCGGCAGCGAA
GGGCAGGGCGATGCCGCCAACCTGCTCAAGCCCGCGCTGGCGCGCGGCGAGTTGCGCACCATTGCCGCGACCACCTGGGC
CGAGTACAAGAAGTACATCGAGCGCGACCCAGCGCTCGCGCGCCGCTTCCAGATCGTCAAGGTGGACGAGCCGGCCGAGG
CCGCGGCCGTGGACATGCTGCGCGGCATGGTGCGCAAGCTTGAAGCGCACCACGGCGTGGAGATCCTCGACGACGCCGTG
CGCGACGCGGTCAAGCTTTCGCACCGCTATGTCTCGGGCCGGCAGTTGCCCGACAAGGCCATCAGCGTGCTCGACACCGC
CTGCGCCCGCGTGGCGGCCGCGCAGAGCGGCGTGCCCGAGGCGATCGAAGCGCTTGGGCGCTCGATCGAAAGCGCCGAGA
ACCAGCTGCGCATCCTGCGCCATGAGGCGGCCACCGGCACCGGGCGCGCCGACGAAATCGCTGCCGTCACGCGCCAGCGC
GACGAAGCCCGGGCGCAGCACGCGCGGCTCAGCGACAAGCTCGCCACCGAGAAGCTGGCCGTCGAGGAAATCCTGGCCTG
GCGCCGCCGGATCGCGGCGTTCCTCGAGAAGGAGAGCCAGGCCGAGGCCGGTGCCGACGGTGTGGCGAGGCATGAGGACG
ACGACAACGCCGACAGCGCGGAGTCGCTTGGCGCCAGTCTCGCGCGCCTGGAAAAAGGCCTGGAGGCGGTGCAGAACGAT
GAGCCGATGGTGCCGGTATGCGTGGACTCGGCCGCAGTGGCCGAGGTCATTTCGGGATGGACCGGTGTCCCGGTCGGCCG
CATGCTGGCCGACGAACTGCACACGGTGCTGTACCTCCAGGACAAGCTCGGCGAGCGCGTGGTGGGCCAGGACGAGGCCC
TCGACGCGATCGCGCGCCGCATCCGCACGTTCCGCGCCGATCTCGATGACCCGGGCAAGCCGGTCGGCGTGTTCCTGCTG
GTCGGCCCGAGCGGCGTCGGCAAGACCGAGACCGCGCATGCGCTGGCCGACCTGCTCTACGGCGGCGAACGCAACATGAT
CACGGTCAACATGTCGGAGTTCCAGGAGGCCCACAGCGTGTCGGGCCTGAAGGGCGCGCCGCCGGGCTATGTCGGTTACG
GCCGGGGTGGCGTGCTGACCGAGGCGGTGCGCCGCCGCCCGTACAGCGTGGTGCTGCTCGACGAGATGGAGAAGGCGCAT
CCCGACGTGCTGGAGCTGTTCTTCCAGGTCTTCGACAAGGGCGTGATGGAAGACGGCGAAGGCGTGCCGATCGATTTCCG
CAACACGGTGATCCTGCTGACCTCGAACGCGGCGCAGGACGTGATCACCGAGGCCTCGCGCGGCGGCCGGCGCCCGCCGC
CGCAGGAACTGGTCGAGAAGCTGCGCCCGGCGCTGCTCAGGCAGTTCAGCCCGGCGTTCCTGGCGCGCCTGGTGCTGGTG
CCGTACTACCACCTGGGCGATGCGCAGATCCGCAATATCGTCGACCTCAAGCTGGCGCGGCTGGCGCAGCGCTTTGCGCG
CAACCACAACGCGCGCCTGAGCTGGGACGAGGCGCTGGCGCAGGCGATCACGCAGCGCTGCACCGAGGTCGACAGCGGCG
CGCGCAATGTCGACCATATCCTGACCCAGTCGGTGCTGCCGGAACTGGCGCGGCGCGTGCTGGAGCAGCTGTCGATCTCG
GAGCCGTTCGGCGGCGTGCATCTCTCGCTCGACACCGGCGGCGGGGTGGCGTTCCGCTTCCTGCCGCAAGCGGGGTGCTG
A

Protein sequence :
MDIDIRTLLSRLNPECRHAMEQAAQLCVRQTHYSVDVEHLLLQLLEGGATDLQAILAHFDLAPAQVVSQLQKAIDGFKRG
NGRTPALSPNFSPLFQEAWLLSSMLLGEQQVRAGTLLLALLEVESLRGMLLESAPALLRIPRAALREALPALLGASAVGG
GPGTDAIPAAAATPAFGMPQRGPGSGQASALEQFTVDLTALARAGAIDPVRGRDSEIRQLIDVLLRRRQNNPILTGEAGV
GKTAVVEGFAQRIVSGDVPPALRQVSVRSLDLALLQAGAGVKGEFENRLKSVIAEVAASPVPVILFIDEAHQLIGAGGSE
GQGDAANLLKPALARGELRTIAATTWAEYKKYIERDPALARRFQIVKVDEPAEAAAVDMLRGMVRKLEAHHGVEILDDAV
RDAVKLSHRYVSGRQLPDKAISVLDTACARVAAAQSGVPEAIEALGRSIESAENQLRILRHEAATGTGRADEIAAVTRQR
DEARAQHARLSDKLATEKLAVEEILAWRRRIAAFLEKESQAEAGADGVARHEDDDNADSAESLGASLARLEKGLEAVQND
EPMVPVCVDSAAVAEVISGWTGVPVGRMLADELHTVLYLQDKLGERVVGQDEALDAIARRIRTFRADLDDPGKPVGVFLL
VGPSGVGKTETAHALADLLYGGERNMITVNMSEFQEAHSVSGLKGAPPGYVGYGRGGVLTEAVRRRPYSVVLLDEMEKAH
PDVLELFFQVFDKGVMEDGEGVPIDFRNTVILLTSNAAQDVITEASRGGRRPPPQELVEKLRPALLRQFSPAFLARLVLV
PYYHLGDAQIRNIVDLKLARLAQRFARNHNARLSWDEALAQAITQRCTEVDSGARNVDHILTQSVLPELARRVLEQLSIS
EPFGGVHLSLDTGGGVAFRFLPQAGC

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-147 50
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-129 48
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 1e-129 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB2 YP_841940.1 ATP-dependent protease Clp, ATPase subunit VFG2076 Protein 1e-176 56
clpB2 YP_841940.1 ATP-dependent protease Clp, ATPase subunit VFG2084 Protein 3e-134 47