Gene Information

Name : clpB (VFMJ11_0578)
Accession : YP_002155324.1
Strain :
Genome accession: NC_011184
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 602287 - 604872 bp
Length : 2586 bp
Strand : +
Note : identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728; match to protein family HMM TIGR03346

DNA sequence :
ATGCGTTTAGATCGATTCACTAGTAAATTTCAAATGGCCATCTCTGATGCTCAATCATTAGCATTAGGTCAAGATCATCA
ATATATCGAACCAACCCATTTAATGGTTGCACTGTTAAATCAGGATGGAAGTACGATAAGGCCTTTACTTACTCTTCTAA
ATGTAGACGTAACACAATTACGTTCAAAACTAACTGAAATATTAGATAAGACACCAAAGGTAACGGGTATCGGTGGTGAT
GTTCAACTTTCATCTAATATGGGGGTTATTTTTAATCTTTGCGATAAAGTCGCTCAGAAACGCAAAGACGCTTATATCTC
TTCTGAAATATTCATGCTTGCCGCAATCGAAGATAAAGGGCCTTTGGGTAATTTACTTCGAGAATTAGGATTAACTGAAC
CTAAAATATCTAAGTCAATCGATGAGATCCGTGGTGGCGAAAAAGTTAATGATCAAAATGCTGAAGAAAAACGTCAAGCA
TTAGAAAAGTTTACTGTTGATTTAACTGAGCGAGCAGAGCAAGGAAAACTAGATCCTGTTATTGGTCGTGATGATGAAAT
ACGAAGAACAATTCAGGTATTGCAACGTCGAACTAAAAATAATCCAGTTATAATAGGTCAACCTGGTGTCGGCAAGACAG
CTATTGTTGAAGGGTTAGCGCAACGTATTATAAATGGTGAAGTACCTGAAGGTTTAAAAAATAAACGAGTACTTTCTTTA
GATATTGGTGCATTGGTAGCTGGTGCTAAATTCCGAGGTGAATTTGAAGAGCGTTTAAAAGCGGTTTTAAATGAGCTAGC
GAAAGAAGAAGGTAGTGTCATCCTCTTTATTGATGAAGTACATACAATGGTAGGTGCTGGTAAAGGTGAAGGCTCTATGG
ATGCCGGCAACATGCTTAAACCTGCACTCGCACGTGGTGAACTACATTGTGTCGGAGCAACAACTCTTGATGAATATCGT
CAGTATATTGAAAAAGATCCAGCATTAGAGCGTCGTTTCCAAAAAGTACTTGTTGACGAACCAACAGTTGAAGATACCGT
TGCAATTTTGCGTGGTTTAAAAGAGCGCTATGAAATTCATCATCATGTTGAAATTACTGATCCTGCGATCGTAGCAGCAG
CAAGCCTGTCACATCGTTATATTTCAGATCGTCAATTACCGGATAAAGCGATAGATTTAATTGACGAAGCAGCATCAAGC
ATTCGTATGGAAATTGATTCAAAACCAGAATCATTAGATAAGCTTGATCGTAAGATAATTCAATTAAAGATAGAGCAGCA
AGCACTAGTAAAAGAAAGCGATGATGCGAGCTTAAAGCGTCTTGACTCTTTAAATCTTGAATTAATGCAAAAAGAGCGTG
AATACGCAGAGCTTGAAGAAGTGTGGAAAGCTGAAAAAGCGGCTTTATCTGGCACTCAGCATATAAAAACAGAATTAGAA
ACAGCTCGAAGTAATATGGAAATTGCACGTCGTGCAGGTGATCTAAATAGAATGTCAGAGCTGCAATATGGACGTATTCC
AGAATTAGAGAAACAACTAGATCTTGCCGCTCAAGCTGAAATGCAAGAAATGAGCTTATTAAAAAACAAAGTGACGGATG
CAGAGATAGCAGAAGTGCTTTCACGTCAGACGGGCATTCCTGTAAATAAAATGCTTGAAGGTGAAAGAGATAAGCTATTA
AAGATGGAAGAGGTATTACATCACCGAGTAATAGGCCAGGCTGAAGCTGTCGAAGCTGTCTCAAATGCAATTCGTCGTAG
TCGTGCCGGTCTATCAGATCCAAATCGACCAATCGGTTCATTCTTGTTTTTAGGCCCAACAGGGGTTGGTAAAACGGAAT
TATGTAAATCACTCGCTAACTTTATGTTTGATAGTGAAGATGCGATGGTACGTATCGACATGTCAGAGTTTATGGAGAAA
CATTCTGTTGCTCGTTTAGTAGGTGCGCCTCCTGGTTATGTTGGTTATGAAGAGGGTGGATACTTAACAGAAGCAGTAAG
ACGCAAACCTTATTCAGTTTTACTATTAGATGAAGTAGAAAAAGCACATCCAGATGTATTTAATATTCTACTTCAAGTTC
TTGATGATGGTCGTTTAACTGATGGTCAGGGAAGAACGGTTGATTTTAAAAATACCGTTATCATTATGACATCGAACTTA
GGTTCAGAAAAAATTCAGCAGCACTTTGGTGAGTTGAATTATGGTGGCATCAAAGAAATAGTAATGGATGTTGTGAGTCA
ACATTTTAGACCAGAGTTTTTAAACCGTGTTGATGAAACCGTAGTGTTCCATCCATTGGCACAAGAACACATCAAGAATA
TCGCCTCTATTCAATTACAACGCTTAGAAAAACGTTTGAATGAAAAAGATTACCAATTGCAAGTAACAGACGAAGCGTTA
AACTTAATTGCTGAGGCAGGTTTTGATCCTGTTTATGGAGCAAGACCATTAAAACGAGCAATTCAGACATATATAGAGAA
CCCACTTGCTCAGGATATATTGAGTGGAAAGCTAAGGGTTGGAGAAGTAATTAAGTTGAAAGTTAAAAATGAACAGCTAA
TTGCGACTCAAAATGACGACTTTTGA

Protein sequence :
MRLDRFTSKFQMAISDAQSLALGQDHQYIEPTHLMVALLNQDGSTIRPLLTLLNVDVTQLRSKLTEILDKTPKVTGIGGD
VQLSSNMGVIFNLCDKVAQKRKDAYISSEIFMLAAIEDKGPLGNLLRELGLTEPKISKSIDEIRGGEKVNDQNAEEKRQA
LEKFTVDLTERAEQGKLDPVIGRDDEIRRTIQVLQRRTKNNPVIIGQPGVGKTAIVEGLAQRIINGEVPEGLKNKRVLSL
DIGALVAGAKFRGEFEERLKAVLNELAKEEGSVILFIDEVHTMVGAGKGEGSMDAGNMLKPALARGELHCVGATTLDEYR
QYIEKDPALERRFQKVLVDEPTVEDTVAILRGLKERYEIHHHVEITDPAIVAAASLSHRYISDRQLPDKAIDLIDEAASS
IRMEIDSKPESLDKLDRKIIQLKIEQQALVKESDDASLKRLDSLNLELMQKEREYAELEEVWKAEKAALSGTQHIKTELE
TARSNMEIARRAGDLNRMSELQYGRIPELEKQLDLAAQAEMQEMSLLKNKVTDAEIAEVLSRQTGIPVNKMLEGERDKLL
KMEEVLHHRVIGQAEAVEAVSNAIRRSRAGLSDPNRPIGSFLFLGPTGVGKTELCKSLANFMFDSEDAMVRIDMSEFMEK
HSVARLVGAPPGYVGYEEGGYLTEAVRRKPYSVLLLDEVEKAHPDVFNILLQVLDDGRLTDGQGRTVDFKNTVIIMTSNL
GSEKIQQHFGELNYGGIKEIVMDVVSQHFRPEFLNRVDETVVFHPLAQEHIKNIASIQLQRLEKRLNEKDYQLQVTDEAL
NLIAEAGFDPVYGARPLKRAIQTYIENPLAQDILSGKLRVGEVIKLKVKNEQLIATQNDDF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 1e-164 45
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-101 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-101 41
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-102 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_002155324.1 ATP-dependent chaperone ClpB VFG2084 Protein 7e-109 43