Gene Information

Name : Hoch_4016 (Hoch_4016)
Accession : YP_003268408.1
Strain : Haliangium ochraceum DSM 14365
Genome accession: NC_013440
Putative virulence/resistance : Unknown
Product : DNA-directed RNA polymerase subunit beta
Function : -
COG functional category : K : Transcription
COG ID : COG0085
EC number : 2.7.7.6
Position : 5514080 - 5518291 bp
Length : 4212 bp
Strand : -
Note : KEGG: afw:Anae109_2221 DNA-directed RNA polymerase subunit beta; TIGRFAM: DNA-directed RNA polymerase subunit beta; PFAM: RNA polymerase Rpb2 domain 6; RNA polymerase Rpb2 domain 7; RNA polymerase Rpb2 domain 2; RNA polymerase subunit beta; RNA polymerase

DNA sequence :
ATGGCGTCGGTAATCCAGAACAACTTCCGGGTCCGGAAGAGCTTTGCCAAGCTCAAGAAGGTCATCGATATCCCGAACCT
GATCGATATCCAGAAGCGGTCCTACGATAAGTTCCTCCAGATCGATATCCCTGCCGAGGAGCGGGAGGATGTCGGCCTCC
AGGGGGTGTTCAAGAGCGTATTCCCCATCAAGGACTTCTCCGAGACGTCTTCGCTGGAGTTCGTCTCGTACAACCTCGAG
CGTCCCAAGTACGACGTCGACGAGTGTCGGGCCCGCGGGATGACCTTCGCCGCGCCGGTGAAGGTCGTGATCCGCTTGGT
GGTGTGGGACGTCAACGAAGAGACCGGCGTGCAGTCGATCCGCGACGTCAAAGAGCAGGAGGTCTACTTCGGCGAGATCC
CGCTCATGACCGACAGCGGTACCTTCATCATCAACGGTACCGAGCGCGTCATCGTCTCGCAGCTGCACCGCTCGCCCGGT
GTGTTCTTCGACCACGATAAGGGCAAGACCCACTCGTCGGGCAAGCTGCTGTACAGCGCCCGGGTCATCCCGTATCGCGG
CTCGTGGCTCGACTTTGAGTTCGACCACAAGGACATCCTCTACGTCCGCATCGATCGCCGGCGCAAGCTGTACGCCACCG
TGCTGCTGCGCGCGCTCGGCTACTCGACCGAGGATCTGCTCAACTACTTCTACGACACCGAGGTGATCCACATCGAGGGG
CCGCAGAAGTTCTCGCGGACCATCAACTACGACCTGCTGCTCGGCCAGCGCGCGACCCGCGACATCCGTCACCCGGACAG
CCGCGAGATCCTGGTGCGCAAGAACCGCAAGTTCACGCGCGCCGCGATCCGCAAGCTGCGCGACTCGGACATCGAGAAGC
TCACGATCGACCTCGAGGAGCTCGTCGGCAAGGTGTCGGCGCGCGACATCATCGATGAGAGCACCGGCGAGGTGCTGCTG
CAGTGCAACGAGGAGCTGAGCGAGGAGAAGCTCGAGGAGCTGCGCACGCGCGGCGTCGAGCGCTTCGATGTGCTGTTCAT
CGACAACCTCAACGTCGGCCCGTACCTGCGCACGACCCTGCTCGCCGACAAGCTGCAGGGCCCGGAAGAGGCGATCATGG
AGATCTACCGGCGCCTCCGCCCGGGTGATCCGCCGACCATCGACACCGCGCAGAACCTGTTCCAGAACCTGTTCTTCAAC
CCCGAGCGCTACGACCTGTCGCAGGTCGGCCGGCTCAAGCTCAACTACAAGTTCCGCCTCGACGAGTCGCTCGACAACCC
GGTGCTCACCCGGCGCGACATCCTGGAGACGGTGCGCTACCTCATCGAGCTGCGCAACGGCCGCGGCATCATCGACGATA
TCGACCATCTCGGTAACCGTCGCGTGCGCGCCGTCGGCGAGCTGATGGAGAACCAGTACCGCATCGGTCTGGTGCGCATG
GAGCGCGCGATCAAGGAGCGCATGAGCATGTCTCAGGAGATCGAGACGCTCATGCCGCACGACCTGATCAACGCCAAGCC
GGTGTCGGCCGTGGTCAAGGAGTACTTCGGCAGCTCGCAGCTGTCGCAGTTCATGGACCAGACCAACCCGCTGTCCGAGG
TGACGCACAAGCGCCGCCTGTCGGCGCTCGGCCCCGGCGGTCTCACGCGCGAGCGCGCCGGCTTCGAGGTGCGCGACGTG
CACCCGACGCACTACGGCCGCATCTGCCCGATCGAGACGCCGGAAGGTCCCAACATCGGCCTCATCGCCTCGCTCTCGAC
CTATGCGCGGGTCAACCAGTACGGCTTCATCGAGACGCCGTATCGCCGCGTGAACGACGGCAAGGTGACCGAGGAGGTGC
AGTTCTACTCGGCCCTCCAGGAGGAGGGGCAGGTCATCGCCCAGGCCAACGCGGCCCACGACCCGAGCGGCGCCTTCAGC
GAGGACTTCGTGTCCTGCCGCCGCGCCGGCGACGTGTCGATGGTGCGTCCCGAGGACGTCACCCTGATGGACGTCTCGCC
CAACCAGCTCGTGTCCGTGGCCGCGTCGCTGATTCCCTTCCTCGAGCACGACGACGCCAACCGCGCGCTCATGGGATCGA
ACATGCAGCGGCAGGCGGTGCCCCTGGTGCGCACGGCCGCGCCGCTGGTCGGCACCGGCATCGAGAACATCGTCGCCCGC
GACTCGGGCGTCACCGTGGTCGCCAAGCGCGACGGCGTGGTCGAGTCGGTGGACGGCGCGCGCATCGTGATCAAGCCCTT
CGAGACCGACGGGGAAGATTCGCTCGGCGCCAAGCCCGACATCTACAACCTGGTCAAGTTCCAGCGCAGCAACCAGAACA
CCTGCAGCAACCAGAAGCCCATCGTGCGCCGCGGCGATACGGTGCGCGTCGGCGACGTCATCGCCGACGGTCCGGCGACC
GAGTGCGGTGAGCTGGCGCTGGGCCAGAACACGGTGGTCGCGTTCATGCCGTGGGGTGGCTACAACTTCGAGGACTCGAT
CCTCGTCAACGAGCGCCTGGTCAAGAACGACACCTTCACCTCGGTGCACATCGAGGAGTTCGAGTGCGTCGCGCGCGACA
CCAAGCTCGGCAAGGAAGAGATCACGCGCGACATCCCCAACGTCGGCGAGGAGGCGCTCAAGGACCTCGACGACTCGGGC
ATCGTGCGCATCGGCGCCGAGGTCAAGGCCGGCGACATCCTGGTCGGCAAGATCACGCCCAAGGGCGAGACCCAGCTCTC
GCCCGAGGAGAAGCTGCTCCGCGCGATCTTCGGCGAGAAGGCCGGCGACGTCCGCGACACCTCGCTGCGCGTGCCCCCGG
GCGTGAGCGGTGTGGTCATCAACGCCCGCGTGTTCGCGCGCAAGGGCACCGAGAAGGACGACCGCGCCAAGGACATCGAG
GACGCCGAGAAGGAGAAGCTGCTGCTCAACAAGCAGACCGAGATCAAGATCATCTCGGACTCCTACTACGGCAAGATGCG
CAAGCTGCTGGTCGGCAAGACCACGGCCGCGCGTCTGGTCGACGACAAGGGCAAGGTGCTGCTGCCCAAGGGCCAGAAGA
TCGACGCCGCCGCGCTCGACCAGGTGCCCGCGCGCTACTGGCACGAGGTCCAGGCCGAGGGTGACACCAAGGTCGAGGAG
TCGCTCGAGAAGCTGGCCGCGCAGCGCGAAGAGGACGTGCGCCTCATCGAGGAGCAGTACGACGAGAAGATCGGCAAGCT
GACCAAGGGCGACGAGCTGCCTCCGGGCGTGATCAAGCTGGTCAAGGTCTACCTGGCCATCAAGCGCAAGCTCTCGGTGG
GTGACAAGATGGCCGGTCGCCACGGCAACAAGGGTGTGGTCTCGCGCCTGTTGCCCGAGGAGGACATGCCGTACCTGTCC
GACGGGACGCCCGTGGACATCGTGCTCAACCCGCTCGGTGTGCCCTCGCGTATGAACGTCGGCCAGATCCTCGAGACCCA
CCTGGGCTGGGCGGCGCGCGAGATCGGCCGCCAGATCGACATGTACATGGAGACCTCCTGGTCGGCCGATGTGCTGCGCG
AGAAGCTCAAGAAGGTCTTCAACACCGCCCAGGCGCACGAGTTTCTCGACCGCCTGGACAACGAAGATATCGGACGCTTC
GCCACCAAGCTGCGCAAGGGCATCCACTTCGCGACGCCGGTCTTCGACGGCGCCGCCGAGGACGAGATCAAGGCGGCCCT
CAACATGGCCGGCATGCGCCCGAGCGGCCAGTCGCAGCTGTGCGACGGCAAATCCGGCGAGCCCTTCGACAACCCGGTGA
CCGTGGGCGTGATGTACATGCTCAAGCTGCACCACCTGGTGGACGACAAGATCCACGCGCGCAGCATCGGTCCGTACTCG
CTGGTTACGCAGCAGCCGCTGGGCGGCAAGGCCCAGTTCGGCGGTCAGCGTCTCGGCGAGATGGAAGTCTGGGCCATGGA
GGCCTACGGCGCGGCCTACGCGCTGCAGGAGTTCCTCACCGTCAAGAGCGACGACGTGCTCGGCCGTACCCGCATGTACG
AGTCGATCGTCAAGGGCGAGCACGTGCTCGAGGCCGGCTTGCCGGAGTCGTTCAACGTGCTGCTCAAAGAGCTTCAGTCG
CTGTGTCTCGACGTCGAGCTCATCGAGGATCCCTCGGCTCCGCGCAAGCAGGAGCACGCGGGCCCCGGTGTGCCGGCCGG
TCTCGCCGCGCTGGCGCGCGAGGTCGCTGAGAAGGTGGGCGGCGCCCAGTAG

Protein sequence :
MASVIQNNFRVRKSFAKLKKVIDIPNLIDIQKRSYDKFLQIDIPAEEREDVGLQGVFKSVFPIKDFSETSSLEFVSYNLE
RPKYDVDECRARGMTFAAPVKVVIRLVVWDVNEETGVQSIRDVKEQEVYFGEIPLMTDSGTFIINGTERVIVSQLHRSPG
VFFDHDKGKTHSSGKLLYSARVIPYRGSWLDFEFDHKDILYVRIDRRRKLYATVLLRALGYSTEDLLNYFYDTEVIHIEG
PQKFSRTINYDLLLGQRATRDIRHPDSREILVRKNRKFTRAAIRKLRDSDIEKLTIDLEELVGKVSARDIIDESTGEVLL
QCNEELSEEKLEELRTRGVERFDVLFIDNLNVGPYLRTTLLADKLQGPEEAIMEIYRRLRPGDPPTIDTAQNLFQNLFFN
PERYDLSQVGRLKLNYKFRLDESLDNPVLTRRDILETVRYLIELRNGRGIIDDIDHLGNRRVRAVGELMENQYRIGLVRM
ERAIKERMSMSQEIETLMPHDLINAKPVSAVVKEYFGSSQLSQFMDQTNPLSEVTHKRRLSALGPGGLTRERAGFEVRDV
HPTHYGRICPIETPEGPNIGLIASLSTYARVNQYGFIETPYRRVNDGKVTEEVQFYSALQEEGQVIAQANAAHDPSGAFS
EDFVSCRRAGDVSMVRPEDVTLMDVSPNQLVSVAASLIPFLEHDDANRALMGSNMQRQAVPLVRTAAPLVGTGIENIVAR
DSGVTVVAKRDGVVESVDGARIVIKPFETDGEDSLGAKPDIYNLVKFQRSNQNTCSNQKPIVRRGDTVRVGDVIADGPAT
ECGELALGQNTVVAFMPWGGYNFEDSILVNERLVKNDTFTSVHIEEFECVARDTKLGKEEITRDIPNVGEEALKDLDDSG
IVRIGAEVKAGDILVGKITPKGETQLSPEEKLLRAIFGEKAGDVRDTSLRVPPGVSGVVINARVFARKGTEKDDRAKDIE
DAEKEKLLLNKQTEIKIISDSYYGKMRKLLVGKTTAARLVDDKGKVLLPKGQKIDAAALDQVPARYWHEVQAEGDTKVEE
SLEKLAAQREEDVRLIEEQYDEKIGKLTKGDELPPGVIKLVKVYLAIKRKLSVGDKMAGRHGNKGVVSRLLPEEDMPYLS
DGTPVDIVLNPLGVPSRMNVGQILETHLGWAAREIGRQIDMYMETSWSADVLREKLKKVFNTAQAHEFLDRLDNEDIGRF
ATKLRKGIHFATPVFDGAAEDEIKAALNMAGMRPSGQSQLCDGKSGEPFDNPVTVGVMYMLKLHHLVDDKIHARSIGPYS
LVTQQPLGGKAQFGGQRLGEMEVWAMEAYGAAYALQEFLTVKSDDVLGRTRMYESIVKGEHVLEAGLPESFNVLLKELQS
LCLDVELIEDPSAPRKQEHAGPGVPAGLAALAREVAEKVGGAQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
BJAB07104_00346 YP_008207811.1 DNA-directed RNA polymerase, beta subunit/140 kD subunit Not tested AbaR25 Protein 0.0 57
BJAB0868_00350 YP_008211676.1 DNA-directed RNA polymerase, beta subunit/140 kD subunit Not tested AbaR26 Protein 0.0 57

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_002516.2.881699.p Protein 0.0 59
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP000034.1.gene3741. Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP001138.1.gene4362. Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_002695.1.914942.p Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP001918.1.gene250.p Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP000647.1.gene4402. Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_011035.1.6448762. Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP000675.2.gene392.p Protein 0.0 58
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP004022.1.gene2794. Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_009085.4918494.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_010611.6233925.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_011595.7060572.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_010400.5987325.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_010410.6003841.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_011586.7046027.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_008702.1.4609796. Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta CP002695.1.gene18.p0 Protein 0.0 57
Hoch_4016 YP_003268408.1 DNA-directed RNA polymerase subunit beta NC_012469.1.7686402. Protein 0.0 47