Gene Information

Name : AGROH133_09374 (AGROH133_09374)
Accession : YP_004442882.1
Strain :
Genome accession: NC_015508
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease, ATP-binding subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 39434 - 42103 bp
Length : 2670 bp
Strand : -
Note : ATPase family associated with various cellular activities (AAA); ATPases with chaperone activity, ATP-binding subunit

DNA sequence :
ATGTCGCATATCGATCTCAACCGCCTTGTTGGAGCGCTTGAGCCAGTCCTGCGTGTTACCCTTGAAGCTGCGGCTTCTGT
TGCGGTGCGCATGGGGCACAGATACGTGGATATCCCGCATTGGCTGCTTGCGGTTGTGGACTCCGGCATCTATGCGAAAA
CATTCGAGGAATTAAAAATTCCACTTCCGGTATTGCAGGCTGAAATCAGCCGCAGTCTGGAGGAGTCCATCATCGGCGAC
GGAGAGGCATTGTCGCTCTCCCAGAACATCCTCACGGCAGCCCGGGAGGCATGGATTCTCGCATCGCTGGAGGCTGGCCG
CGACCGTGTTACGCTTTGCGATCTTCTGTTGGCGATGGATGAGGAAACCTCACTTCGCTCTTTCGTCCGCTCGGCATTTC
CTTCTCTCAAGGCGATGGATCGCACCGCGCTTGAGCGTCTTCGCAACTCGACTGAAAATGGCGCAGGCGTGGATGTTCCT
TCCGCGCTGGCAGGCTCAGGGCAGGCAGGGACAGGACAGTCTGCAGGCCAGAATGATTTCTTGCGCCTTTACACCCAGGA
CATGACCGCAGATGCCCGCAATGGCAAGGTTGATCCTGTTATCGGCCGCGATGATGAGTTACGCCAGCTCGTTGATATTC
TAACGCGCCGCCGACAGAACAATCCCATCCTGGTAGGTGAGGCGGGGGTTGGCAAAACGGCGGTTGCCGAGGCGCTGGCG
CTGGAAATCGCGTCTGGCAACGTTCCGGAAAAGTTGCGCAATGTCCGCCTGCTGAACCTCGATATTTCGCTACTTCAGGC
CGGTGCCGGCGTTAAGGGTGAGTTTGAGCGCCGCCTTCATGGCGTAATCGATGCGGTCAAACGTTCTACCGAACCGGTCA
TCCTGTTTATCGATGAGGCCCACGGTCTCGTTGGTGCGGGTGGTGCGGCCGGGCAGGGCGATGCGGCCAATATTCTCAAG
CCCGCTCTGGCGCGGGGTGAAGTGCGCACCGTCGCGGCGACGACCTGGAGCGAATACAAGAAGTATTTCGAAAAGGATGC
GGCGCTGACCCGTCGTTTCCAGCCTGTACATGTGCGCGAGCCGGATGAGACGACCGCCATACGTATGTTGCGCGGTGTCG
CTGACAGCTTCGTCAGCCATCACAACGTTGTCGTTCGTGACGAGGCGGTCGTGGCCGCGGTGCAATTATCCGCACGCTAC
ATGCCCGCGCGGCAGTTGCCGGACAAGGCCGTCAGCCTTCTCGATACGGCCGCGGCTGCCGTCTCACTGGCACGGCAGAC
CGAGCCGGAGCGGCTGCGCGCTATGGAAAGCGAGCGGCGCCTTCTCACCGATGAGTTGAACTGGCTTCTGCGTGAACCGC
AGGATGGCGAGATTGACAGCCGTATCCAGTCGATACGGGGCGAAGTCGAAAAGCTTGAGAGTGGGATCGATGATCTGCGC
AGCCGTTACGACGCGGAGATGGCTGAGCTAAGCGAAGAGCAGCCGGTTGAAGGCGATGTTTCCAATGTCTCTCCCCTGCG
GCCTGCGGTTGAGGCAAAACCGGCAAATGCCGAGAGACTTGTCCCGACGGTCGTTGATCGTGAGGCGATCGCCGCCGTCG
TTTCGCGATGGACCGGAATTCCACTTGGCAAACTTCTGGCGGACCAGATCGAAAGTGCCCGGACACTCGATGCGCGTATG
CGGCAGCGGGTGGTGGGGCAGGATGCTGCAATTGCACGGATTGCCGATGCCATGCGCACTGCGCGGGCCGGATTATCCGA
TCCGCGCCGCCCGCCGGCCGTTTTTTTCCTTGTTGGCATGTCCGGAACAGGAAAGACCGAAACCGCGCTGTCGCTCGCCG
ATATGCTCTACGGCGGCAACAGCCATTTGACGACGATCAACATGTCGGAGTTCAAGGAAGAGCATAAGGTTTCGCTGCTT
CTCGGTTCGCCGCCCGGCTATGTCGGTTTCGGCGAAGGCGGCGTGTTGACGGAAGCGGTGCGCCGTCGGCCGTTCGGCGT
TCTGCTCTTGGATGAGATCGACAAGGCTCACCCCGGCGTCCAGGACATTTTCTATCAGGTGTTTGACAAGGGCGTGTTGC
GTGATGGCGAAGGCCGCGACGTGGATTTCAAGAACACCACTATCTTCATGACGGCCAATACAGGCTCGGAGTTGCTCTCG
GCGCTGTCGGCCGATCCCGACACGATGCCGGAAGGCGAAGCGCTGGAAGCGCTTCTGATGCCGGAGCTTACCAAACAGTT
CAAGCCCGCGTTCCTCGGGCGCACGATCATTCTTCCCTTCATGCCGCTTGGTACGGAGGCGCTTGCAAGGATCGTGGATA
TGCAGATTGGCAAGATCAGGGAACGTGTGCTCGCAACATATGGCACCGGCCTGACCCTGTCCAATGTGGCGCGAGATGCC
TTGGTGGCGCGCGCCGGTGCAAGTGAGATTGGCGCACGCGCCATTGAAATCATGATCGGCAAGGACCTCCTGCCGCCTCT
TTCCAGCTTTTTCCTCGAAAAGGTAATCGCGGGAGAGCGTGTCGGGAACATTGTTGTCGATTTTGGTGAAAACGGGTTCG
GCATTCGTGCCGAAGAGGCTGGGGAAGCAGATGAATTCGCCGTAACCGAAGAGGTTGGGGTGGATAAAGTTGCCGCGTCG
GATGGGGCTACCCGACGCATGCGGCATTAA

Protein sequence :
MSHIDLNRLVGALEPVLRVTLEAAASVAVRMGHRYVDIPHWLLAVVDSGIYAKTFEELKIPLPVLQAEISRSLEESIIGD
GEALSLSQNILTAAREAWILASLEAGRDRVTLCDLLLAMDEETSLRSFVRSAFPSLKAMDRTALERLRNSTENGAGVDVP
SALAGSGQAGTGQSAGQNDFLRLYTQDMTADARNGKVDPVIGRDDELRQLVDILTRRRQNNPILVGEAGVGKTAVAEALA
LEIASGNVPEKLRNVRLLNLDISLLQAGAGVKGEFERRLHGVIDAVKRSTEPVILFIDEAHGLVGAGGAAGQGDAANILK
PALARGEVRTVAATTWSEYKKYFEKDAALTRRFQPVHVREPDETTAIRMLRGVADSFVSHHNVVVRDEAVVAAVQLSARY
MPARQLPDKAVSLLDTAAAAVSLARQTEPERLRAMESERRLLTDELNWLLREPQDGEIDSRIQSIRGEVEKLESGIDDLR
SRYDAEMAELSEEQPVEGDVSNVSPLRPAVEAKPANAERLVPTVVDREAIAAVVSRWTGIPLGKLLADQIESARTLDARM
RQRVVGQDAAIARIADAMRTARAGLSDPRRPPAVFFLVGMSGTGKTETALSLADMLYGGNSHLTTINMSEFKEEHKVSLL
LGSPPGYVGFGEGGVLTEAVRRRPFGVLLLDEIDKAHPGVQDIFYQVFDKGVLRDGEGRDVDFKNTTIFMTANTGSELLS
ALSADPDTMPEGEALEALLMPELTKQFKPAFLGRTIILPFMPLGTEALARIVDMQIGKIRERVLATYGTGLTLSNVARDA
LVARAGASEIGARAIEIMIGKDLLPPLSSFFLEKVIAGERVGNIVVDFGENGFGIRAEEAGEADEFAVTEEVGVDKVAAS
DGATRRMRH

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-134 45
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-111 44
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-111 44
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 1e-96 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
AGROH133_09374 YP_004442882.1 ATP-dependent Clp protease, ATP-binding subunit VFG2084 Protein 3e-119 47