Gene Information

Name : rpoB (WGLp522)
Accession : NP_871525.1
Strain : Wigglesworthia glossinidia endosymbiont of Glossina brevipalpis
Genome accession: NC_004344
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : K : Transcription
COG ID : COG0085
EC number : -
Position : 599101 - 603129 bp
Length : 4029 bp
Strand : +
Note : Transcription

DNA sequence :
ATGGTTTATTCTTATACGGAAAGAAAAAGAATTAGGAAAGACTTTGGAAAACGTCCGCAAGTTTTAGATATACCATATTT
ACTTGCTATACAAATTAATTCATTTAAAAAATTTATAGAAAAAGATCCAGAAGGTTTGTATGGATTAGAAGCAGCATTTA
AATCCATTTTTCCTATTAAAAGTTATAGTGGAAATGCAGAATTAAAATATATAAGTTATAGACTTGGATGTTCTGTTTTT
AATGTAAAAGAATGCCAAACAAGAGGAACAACTTTTTCTGCTCCTTTAAGAGTAATATTACAATTAATAATTTATTGCGA
TGAATCAAAACATGTTGTAAAAAATATTAAAGAACAAGAAGTATATATGGGAGAAATTCCATTAATGACAGATAATGGAA
CATTTATAATAAATGGAACAGAAAGAGTTGTAGTTTCTCAATTGCATCGTAGTCCAGGTGTATTTTTTGACAGTGATAAG
GGTAAAACCCACTCTTCTGGAAAGGTATTGTATAATGCAAGAATTATCCCATATAGAGGATCTTGGTTAGATTTTGAATT
TGATGCAAAAGATCATTTATTTATTAGAATTGATAGAAGAAGAAAATTACCTGTAACAGTTCTTTTAAAAGCGTTAAATT
TTAGTGATGGTGAAATTTTAAATATATTTTTTGAAAAGGTTAATTTTTTTATTAGAAAAAAAAATTTATTAATGGAATTG
ATACCAAAAAGATTAAGAGGAGAAACAGCTCTATTTGATATATGTGAAAATGGAATAACATATATAAAAAAGGGTCGTAG
AATAACTGCAAAACATATTAGAAATCTAGAAAATGATAAAATATCTCAAATAACAGTTCCATTTGAATATATTATAGGAA
AAGTTTCTGCTAAAAATTATTTTGACAAAAAAACAGAAAAACCAATTATTACAGCGAATACAGAACTTACTGTAGACTTA
ATGCTTAATTTATTTAAATCTGGATATAAAAGCATCGAGACTTTGTTTACAAATGATTTAGATCATGGATCATATATCTC
AGAAACATTAAGAATTGATTCTACTACAGATAAAACTAGTGCTTTAATAGAAATATATCGTATGATGAGACCAGGAGAGC
CTCCTACTAAAGAGGCTGCAGAAAATTTATTTTATAATTTATTCTTCTCAGAAGATAGATATGATTTGTCTTCTGTAGGA
AGAATGAAATTTAACAAATCTTTATCAATAAATTCATCTGAAGGTTCAAGTTTATTGGATAAATTTGATATCATAGAAGT
AACAAAAAAATTGATAGATATAAGAAATGGTAAAGGAGACGTAGATGACATTGATCATCTAGGAAATAGAAGAATTAGAT
CAGTAGGAGAGATGGCTGAGAATCAATTCAGAATAGGATTAGTTAGAGTGGAACGTGCAGTTAAAGAAAGGTTATCTTTA
GGAGATCTAGATACAATAATGCCACAAGACATGATTAATGCTAAGCCCATTTCAGCTGCAGTAAAAGAATTCTTTGGATC
TAGTCAATTGTCACAATTCATGGATCAAAATAATCCATTATCAGAAATTACTCATAAACGTCGAATATCGGCTCTAGGTC
CTGGAGGATTAACTAGAGAAAGAGCTGGTTTTGAAGTAAGAGATGTGCATCCTACACATTATGGTAGAGTATGTCCAATA
GAAACTCCAGAAGGTCCAAATATTGGATTAATCAATTCTCTATCTGTTTATGCTAGAACAAATGAATATGGTTTTTTAGA
AACTCCATACAGATGTGTTTTAAATGGAATAGTGACTAACAATATACATTATCTATCTGCGATAGAAGAAGGAAAATTTA
TTATAGCTCAAGCAAATACTAACTTGGATAAAAATGGATATTTTATAAATGAATTTGTTACTTGTAGAAACAAAGGAGAA
TCAAGTTTATTCAACAGAAATCAAGTAAATTATATGGATGTATCTACTCAACAAATAGTTTCAGTTGGAGCATCTTTAAT
TCCTTTTTTAGAACATGATGATGCAAATAGAGCTTTAATGGGAGCTAATATGCAAAGACAAGCAGTTCCAACTTTAATGA
CAGAAAAACCATTGATAGGAACTGGTATGGAAAGAGCAGTAGCAGTAGATTCTGGTGTAACTGCAGTAGCAAAAAGAGGG
GGAATAGTTCAATTTTTAGATTCATCCAAAATAATAATAAAGGTCAACCAAGAAGAAATAATAAAAGAAAAAATAGGGAT
AGATATTTACCATTTAACAAAATATGTTAGATCAAATCAAAATACATGCATAAATCAAACACCTTGTGTATGCTTAAATG
ATGTAGTAGAACGTGGAGACGTTTTAGCAGACGGTCCATCTACTGATTTAGGAGAGCTAGCCTTAGGTCAAAATATGAGA
ATAGCATTTATGCCGTGGAATGGATATAATTTTGAAGATTCTATGTTAGTTTCCGAAAAAGTTGTTCATGAAGATAGATT
TACAACAATACATATTCAAGAATTAGCCTGTATGTCTAGAGATACCAAATTAGGATCAGAAGAAATAACCTCAGATATTC
CAAATGTAAGTGAAACTTCTTTGTTGAAATTAGATGAATCTGGAATTGTTTATATTGGAGCAGAAGTAAAAGGAGGAGAT
ATATTGGTAGGAAAAGTTACTCCTAAAGGCGAAACTCAATTAACTCCAGAAGAAAAATTATTACGAGCTATTTTTGGAGA
AAAAGCTTCAGATGTAAAAGACTCATCATTAAGAGTCCCAAATGGAGTTTCAGGAACAGTTATAGATGTAGAAATATTCA
CTAGAGATGGAGTTAAAAAAGATAAAAGAGCTTTAGAGATTGAGTATATGCAAATCAAAGAAGCAAAAAAAGATATATAT
GAAGAACTAGAAATTTTTAAATCTAGCTTAAAAATACAAATAGAATATTTTCTAAAAGAAAACAATATAGAATATGATTC
TCTGTCAGAATTATTAAAAGGAAATATAAAAAATTTAATATTTAAAAATAATAATTTAAATAATATATTTGAAGAACTAA
TAAATAAATTTTTGAGATTAAAAGAAGAATTTGAAAAAAAATTAGAAATAAAAATAAAAAAAATTACTCAAGGAGACGAT
TTAGCACCAGGAGTTTTAAAAATTGTTAAAGTTTACTTGGCCGTAAAACGGCAAATACAGCCAGGTGATAAAATGGCAGG
ACGTCATGGAAACAAAGGAGTTATTTCTAAAATAAATCCAATAGAAGATATGCCTTATGATGAGAACGGAGTTCCTGTAG
ACATGGTATTAAATCCTTTAGGAGTTCCTTCTAGAATGAATATTGGACAAATTTTAGAGACTCATTTAGGACTAGCTGCA
AAAGGAATAGGAAATATTATAGATAATATGTTAAAAAATAACAAAAAAATTTATAAAATAAAAAAATTTATACAAAATGC
ATATAATTTAGGGATTGGAATTAGGCAAAAGGTTGAATTAGATCATTTTTCTGATAAAGAAATTATTAAACTTGCAAATA
ATCTTAGAAAAGGAATGCCTATAGCTACTCCTGTATTTGATGGAGCTCAAGAAATAGAAATAAAAGAACTTTTGAAATTT
AGTGGAAATCCTGAATCTGGTCAAATTACATTATTTGATGGACAAACAGGTGAAAAGTTTGATAGACCAGTTACTGTAGG
ATATATGTATATGTTAAAATTAAATCATTTGGTTGACGATAAAATGCATGCTAGATCTACTGGATCTTATAGCTTAGTAA
CCCAACAACCATTGGGTGGAAAAGCCCAGTTTGGAGGGCAAAGATTTGGAGAAATGGAAGTATGGGCTTTAGAAGCATAT
GGAGCTGCTTATTCTTTACAAGAAATGCTTACAGTAAAATCAGATGACGTTAATGGGCGTACTAAGATGTATAAAAATAT
AGTAGATGGCAGCCATTTAATGGAACCTGGAATGCCAGAATCTTTTAATGTACTTTTAAAAGAAATTCGTTCTTTAGGAA
TTAATATTGAATTAGAAGAAAATAATTAA

Protein sequence :
MVYSYTERKRIRKDFGKRPQVLDIPYLLAIQINSFKKFIEKDPEGLYGLEAAFKSIFPIKSYSGNAELKYISYRLGCSVF
NVKECQTRGTTFSAPLRVILQLIIYCDESKHVVKNIKEQEVYMGEIPLMTDNGTFIINGTERVVVSQLHRSPGVFFDSDK
GKTHSSGKVLYNARIIPYRGSWLDFEFDAKDHLFIRIDRRRKLPVTVLLKALNFSDGEILNIFFEKVNFFIRKKNLLMEL
IPKRLRGETALFDICENGITYIKKGRRITAKHIRNLENDKISQITVPFEYIIGKVSAKNYFDKKTEKPIITANTELTVDL
MLNLFKSGYKSIETLFTNDLDHGSYISETLRIDSTTDKTSALIEIYRMMRPGEPPTKEAAENLFYNLFFSEDRYDLSSVG
RMKFNKSLSINSSEGSSLLDKFDIIEVTKKLIDIRNGKGDVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLSL
GDLDTIMPQDMINAKPISAAVKEFFGSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCPI
ETPEGPNIGLINSLSVYARTNEYGFLETPYRCVLNGIVTNNIHYLSAIEEGKFIIAQANTNLDKNGYFINEFVTCRNKGE
SSLFNRNQVNYMDVSTQQIVSVGASLIPFLEHDDANRALMGANMQRQAVPTLMTEKPLIGTGMERAVAVDSGVTAVAKRG
GIVQFLDSSKIIIKVNQEEIIKEKIGIDIYHLTKYVRSNQNTCINQTPCVCLNDVVERGDVLADGPSTDLGELALGQNMR
IAFMPWNGYNFEDSMLVSEKVVHEDRFTTIHIQELACMSRDTKLGSEEITSDIPNVSETSLLKLDESGIVYIGAEVKGGD
ILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVSGTVIDVEIFTRDGVKKDKRALEIEYMQIKEAKKDIY
EELEIFKSSLKIQIEYFLKENNIEYDSLSELLKGNIKNLIFKNNNLNNIFEELINKFLRLKEEFEKKLEIKIKKITQGDD
LAPGVLKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKINPIEDMPYDENGVPVDMVLNPLGVPSRMNIGQILETHLGLAA
KGIGNIIDNMLKNNKKIYKIKKFIQNAYNLGIGIRQKVELDHFSDKEIIKLANNLRKGMPIATPVFDGAQEIEIKELLKF
SGNPESGQITLFDGQTGEKFDRPVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEAY
GAAYSLQEMLTVKSDDVNGRTKMYKNIVDGSHLMEPGMPESFNVLLKEIRSLGINIELEENN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
BJAB07104_00346 YP_008207811.1 DNA-directed RNA polymerase, beta subunit/140 kD subunit Not tested AbaR25 Protein 0.0 65
BJAB0868_00350 YP_008211676.1 DNA-directed RNA polymerase, beta subunit/140 kD subunit Not tested AbaR26 Protein 0.0 65

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
rpoB NP_871525.1 hypothetical protein NC_002695.1.914942.p Protein 0.0 79
rpoB NP_871525.1 hypothetical protein CP001138.1.gene4362. Protein 0.0 79
rpoB NP_871525.1 hypothetical protein CP004022.1.gene2794. Protein 0.0 79
rpoB NP_871525.1 hypothetical protein CP001918.1.gene250.p Protein 0.0 79
rpoB NP_871525.1 hypothetical protein CP000034.1.gene3741. Protein 0.0 79
rpoB NP_871525.1 hypothetical protein CP000647.1.gene4402. Protein 0.0 79
rpoB NP_871525.1 hypothetical protein NC_002516.2.881699.p Protein 0.0 68
rpoB NP_871525.1 hypothetical protein CP000675.2.gene392.p Protein 0.0 66
rpoB NP_871525.1 hypothetical protein NC_010400.5987325.p0 Protein 0.0 65
rpoB NP_871525.1 hypothetical protein NC_010611.6233925.p0 Protein 0.0 65
rpoB NP_871525.1 hypothetical protein NC_011595.7060572.p0 Protein 0.0 65
rpoB NP_871525.1 hypothetical protein NC_010410.6003841.p0 Protein 0.0 65
rpoB NP_871525.1 hypothetical protein NC_011586.7046027.p0 Protein 0.0 65
rpoB NP_871525.1 hypothetical protein NC_009085.4918494.p0 Protein 0.0 64
rpoB NP_871525.1 hypothetical protein NC_008702.1.4609796. Protein 0.0 63
rpoB NP_871525.1 hypothetical protein CP002695.1.gene18.p0 Protein 0.0 63
rpoB NP_871525.1 hypothetical protein NC_011035.1.6448762. Protein 0.0 61
rpoB NP_871525.1 hypothetical protein NC_012469.1.7686402. Protein 0.0 45