Gene Information

Name : AGROH133_13063 (AGROH133_13063)
Accession : YP_004444361.1
Strain :
Genome accession: NC_015508
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease, ATP-binding subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : 3.4.21.92
Position : 1682777 - 1685401 bp
Length : 2625 bp
Strand : +
Note : ATPase family associated with various cellular activities (AAA); ATPases with chaperone activity, ATP-binding subunit

DNA sequence :
ATGAATATTGAAAAATACTCCGAGCGCGTTCGCGGTTTTCTGCAATCGGCGCAGACGTTTGCGCTTGCGGAAAATCACCA
GCAGTTTTCTGCCGAACATGTGCTGAAAGTTCTGCTTGATGACGAGCAGGGCATGGCAGCATCGCTGATCGAGCGGGCTG
GCGGCGACGCCAAGGAAGTGCGCTTGGCCAACGATGCGGCACTGGCGAAATTGCCCAAGGTTTCCGGCGGCAATGGCGGC
CTTTCCCTTACGGCTCCACTTGCGAAAGTGTTCTCGACTGCGGAAGAACTCGCGAAGAAGGCTGGTGACAGTTTCGTCAC
CGTCGAGCGTCTTTTGCAGGCGCTGGCGATCGAAAACTCTGCTTCGACCTCGGCGTCCCTGAAAAAGGGCGGCGTGACGG
CGCAGGCGCTCAATCAGGTCATCAATGAAATCCGCAAGGGCCGCACGGCTGACAGCGCCAATGCCGAACAGGGCTTCGAC
GCGCTGAAGAAGTTCGCGCGCGACCTGACGGAGGAAGCCCGCGAAGGCAGGCTCGACCCGGTGATTGGCCGTGACGACGA
AATCCGCCGGACCATTCAGGTGCTTTCACGCCGCACCAAGAACAATCCCGTGCTGATCGGTGAACCCGGCGTCGGTAAAA
CGGCGATTGCCGAAGGTCTTGCGCTGCGCATCGTCAATGGCGATGTGCCCGAAAGCCTTAAAGACAAGAAGCTTATGGCG
CTGGATATGGGTGCGCTGATCGCTGGCGCGAAGTATCGCGGCGAGTTCGAGGAGCGTCTGAAGGCGGTCCTGAACGAAGT
GCAGGCGGAAAATGGCGGCATTATCCTGTTCATCGACGAGATGCACACGCTGGTCGGTGCCGGCAAGGCCGATGGCGCCA
TGGATGCGTCCAATCTGCTGAAGCCCGCCCTTGCCCGAGGTGAACTGCACTGCGTTGGCGCCACCACGCTGGATGAATAT
CGCAAGCACGTGGAAAAGGATCCGGCCCTTGCCCGTCGTTTCCAGCCGGTGCTCGTAGATGAGCCGACCGTTGAGGACAC
GATCTCGATCCTGCGCGGCCTGAAGGAAAAATACGAACAGCACCACAAGGTCCGTATTTCGGATTCGGCCCTAGTTGCAG
CTGCAACGCTTTCCAACCGCTACATTACCGACCGGTTCCTGCCCGACAAGGCAATCGACCTGATGGACGAAGCCGCTTCG
CGCCTTCGCATGCAGGTGGATTCCAAGCCGGAAGAACTGGACGAACTGGATCGCCGGATCATTCAGCTCAAGATCGAACG
CGAAGCCTTGAAGCAGGAGACGGATCAGTCATCCGTCGACCGCCTGAAGAAGCTCGAGGACGAACTGGCCGATACCGAAG
AAAAGGCCGATGCGCTGACGGCCCGCTGGCAGGCGGAAAAGCAGAAGCTGGGCCATGCCGCAGACCTCAAGAAGCGGCTG
GACGATGCCCGCAACGAACTGGCGAGTGCCCAGCGTAACGGCCAGTTCCAGCGCGCCGGTGAATTAACCTATGGCATCAT
TCCGGGCCTCGAAAAGCAACTGGCTGCAGCGGAAGCGCGTGATAGCAGCGGTGCTGGCTCGATGGTTCAGGAAGTGGTGA
CGGCGGACAATATCGCCCACATCGTTTCCCGCTGGACCGGCATTCCTGTCGACAAGATGCTGGAAGGCCAGCGCGAGAAG
CTGCTGCGCATGGAAGACGATCTTGCCAAGTCCGTGGTCGGGCAGGGCGAGGCCGTGCAGGCCGTTTCCAAGGCGGTTCG
CCGTTCGCGCGCCGGTCTTCAGGATCCGAACCGGCCGATCGGCTCGTTCATTTTCCTCGGCCCGACGGGCGTGGGCAAGA
CCGAGCTGACCAAGTCGCTCGCCCGGTTCCTGTTCGACGACGAGACCGCGATGGTTCGCCTCGACATGTCGGAATTCATG
GAGAAACACTCCGTTGCCCGGCTTATCGGTGCGCCTCCCGGCTATGTCGGTTACGAAGAGGGCGGTGCGTTGACGGAAGC
GGTTCGCCGTCGGCCCTATCAGGTCGTGCTGTTTGACGAGATCGAGAAAGCGCATCCGGACGTGTTCAACGTCCTGTTGC
AGGTGCTGGACGATGGCCGTCTGACCGATGGCCAGGGCCGCACCGTCGATTTCAAGAACACCATCATCATCATGACTTCG
AACCTCGGTTCGGAATTCATGACGCAGATGGGCGACAATGACGATGTGGATTCGGTTCGCGACCTGGTGATGGAACGGGT
CCGGTCGCATTTCCGGCCGGAATTCCTCAACCGTATCGACGATATCATCCTCTTCCATCGCCTGCGGCGCGACGAGATGG
GTGCGATCGTGGAGATCCAGCTCAAGCGCCTCGTCTCGCTTCTGGGTGATCGCAAGATCACGCTCGAACTGGATGAGGAT
GCCCGCAACTGGCTTGCCAATAAGGGTTACGATCCGGCCTATGGCGCACGTCCGCTGAAGCGTGTGATCCAGAAGACCGT
TCAGGACAGGCTCGCCGAAATGATTCTCGGCGGCGAGATCCCGGACGGATCGCGGGTCAAGGTGACCTCTAGCACCGACC
GGCTGCTTTTCAAGGTCAAGCCCCCAAAGGGTGAGGCCGAGACTGAAACCGCCGATGCGGCATAA

Protein sequence :
MNIEKYSERVRGFLQSAQTFALAENHQQFSAEHVLKVLLDDEQGMAASLIERAGGDAKEVRLANDAALAKLPKVSGGNGG
LSLTAPLAKVFSTAEELAKKAGDSFVTVERLLQALAIENSASTSASLKKGGVTAQALNQVINEIRKGRTADSANAEQGFD
ALKKFARDLTEEAREGRLDPVIGRDDEIRRTIQVLSRRTKNNPVLIGEPGVGKTAIAEGLALRIVNGDVPESLKDKKLMA
LDMGALIAGAKYRGEFEERLKAVLNEVQAENGGIILFIDEMHTLVGAGKADGAMDASNLLKPALARGELHCVGATTLDEY
RKHVEKDPALARRFQPVLVDEPTVEDTISILRGLKEKYEQHHKVRISDSALVAAATLSNRYITDRFLPDKAIDLMDEAAS
RLRMQVDSKPEELDELDRRIIQLKIEREALKQETDQSSVDRLKKLEDELADTEEKADALTARWQAEKQKLGHAADLKKRL
DDARNELASAQRNGQFQRAGELTYGIIPGLEKQLAAAEARDSSGAGSMVQEVVTADNIAHIVSRWTGIPVDKMLEGQREK
LLRMEDDLAKSVVGQGEAVQAVSKAVRRSRAGLQDPNRPIGSFIFLGPTGVGKTELTKSLARFLFDDETAMVRLDMSEFM
EKHSVARLIGAPPGYVGYEEGGALTEAVRRRPYQVVLFDEIEKAHPDVFNVLLQVLDDGRLTDGQGRTVDFKNTIIIMTS
NLGSEFMTQMGDNDDVDSVRDLVMERVRSHFRPEFLNRIDDIILFHRLRRDEMGAIVEIQLKRLVSLLGDRKITLELDED
ARNWLANKGYDPAYGARPLKRVIQKTVQDRLAEMILGGEIPDGSRVKVTSSTDRLLFKVKPPKGEAETETADAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 8e-177 45
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-113 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
AGROH133_13063 YP_004444361.1 ATP-dependent Clp protease, ATP-binding subunit VFG2084 Protein 6e-116 44
AGROH133_13063 YP_004444361.1 ATP-dependent Clp protease, ATP-binding subunit VFG2076 Protein 9e-128 43