PAI Gene Information


Name : bglE (CMM_0099)
Accession : YP_001220838.1
PAI name : chp/tomA region
PAI accession : NC_009480_P1
Strain : Clavibacter michiganensis NCPPB 382
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : putative beta-galactosidase/beta-glucuronidase (ZP_00229015.1| COG3250: Beta-galactosidase/beta-glucuronidase [Kineococcus radiotolerans SRS30216]; ZP_00061173.1| COG3250: Beta-galactosidase/ beta-glucuronidase [Clostridium thermocellum ATCC 27405]). Inte
Homologs in the searched genomes :   4 hits    ( 4 protein-level )  
Publication :
    -Gartemann,K.H., Abt,B., Bekel,T., Burger,A., Engemann,J., Flugel,M., Gaigalat,L., Goesmann,A., Grafen,I., Kalinowski,J., Kaup,O., Kirchner,O., Krause,L., Linke,B., McHardy,A., Meyer,F., Pohle,S., Ruckert,C., Schneiker,S., Zellermann,E.M., Puhler,A., Eiche, "The genome sequence of the tomato-pathogenic actinomycete Clavibacter michiganensis subsp. michiganensis NCPPB382 reveals a large island involved in pathogenicity", J. Bacteriol. 190 (6), 2138-2149 (2008) PUBMED 18192381.

    -Gartemann,K.H., Abt,B., Bekel,T., Burger,A., Engemann,J., Flugel,M., Gaigalat,L., Goesmann,A., Grafen,I., Kalinowski,J., Kaup,O., Kirchner,O., Krause,L., Linke,B., McHardy,A., Meyer,F., Pohle,S., Ruckert,C., Schneiker,S., Zellermann,E.M., Puhler,A., Eiche, "Direct Submission", Submitted (23-MAY-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Linke,B., "Direct Submission", Submitted (10-MAY-2007) Linke B., Center For Biotechnology, Bielefeld University, Universitaetsstrasse 25, 33501 Bielefeld, GERMANY.


DNA sequence :
ATGACCGAGGCCGCCCTGCACACGATCGCCATGACTCTCACCGAGTGGGACTTCACGGTGAAGGGCGGGGGAGCCGGACA
ACCGGTTCGTCTACCGCATGACGCGATGATCCACGAGCACCGTGATCCTCGCGCGCCGGGAGGCGCGGACACGGCGTACT
TTCCCGGTGGTGCGTACCGCTACAGCACCAACTGGGACGCACCTGCGGACCGCTCATCGTCGGTCGCTCTGCGCTTCGAA
GGGGTCCAGGGCGACGCGACCGTGACGGTCAACGGGACCGTCGTCGGCTCGATCCGGAGCGGCTACACCGAGTTCGAGTT
CGAGATCGGCGAACACCTCGCTTGGGGAGCGTCGAACACGTTCGTGGTCGACGTCGACAACGCCGCACAACCCACGGGCC
GCTGGTATCCCGGGTCCGGCCTGTACCGCCCTGTCTCGCTCCTCCTGCGACCGGCAGTCCGCTTCGCCCCCGACGGACTA
AAGGTGCGCACGCTGAGCATCACCGCGGCGACCGCCGAGGTCCAGATCGGATACTCGGTGCTCGGCCTCGAGGATCGGAC
AGCGCGCGTCGCCGTGGAGTTGCGCGACGGGAAGACGCTGGTGAGCTCGGCCGACGGAGTCGGTGTCGAGGGGGTGCTCT
CCCTGGTCGTCGACCGGCCCCGTGCCTGGTCCGCCGACTCGCCCCATCTGTACGAGCTGATCGCTCGGGTCGACAGCGGT
GACGCTACCTACGAACGGCGGGAACGGGTGGGGCTGCGCACGATTGCGGTCGATTCCCGGAACGGGCTGCGCATCAACGG
GCGCAGGGAGCTCTTGCGGGGGGCGTGCATGCATCACGACAACGGCGTGCTCGGTGCCGCGACCCATCGTGCGGCGGAGT
ACCGGCGCATCCGCCTGCTCAAGGGAGCGGGGTTCAATGCCGTCCGCAGCGCGCACCACCCGATGTCACGACACCTGCTC
GACGCCTGCGACGAGCTCGGCATGTACGTGGTCGAGGAGCTCGCCGACTACTGGGTGGCGTCGAAGTCCGCGCACGACGC
AGCCGACCGCTTCCACGAGACGTGGCGCGAGGACGCGGACCGCATGATCCGAAAGGACCGCAACCGTCCGTCGGTCATCA
TGTACGCCGCAGGGAACGAGATCCCCGAGACCGCAACCCCACAGGGAGTCGAACTGACGCGCGAGATCACGGCTCATCTT
CATGCCGCGGATCCTGATCGACCTGTGACTCTCGCGATCAACCTGTTCCTGAACACCCTGGTCTCGTTCAACAGGTCGCC
CTACAAGGAGGCCGCCGCCGCCGGCGACGCGGGAACCTCCATGGCCGGGAGCACCGAGGCGAACGTGATGATCAATCAGA
TCGGCCGCATGATGGATGTCGTCTCTCGGCTGCCGCAGGCGGACAAGGCCTCGAGGGACGCCTTCGCCGAAGTGGATGTC
GCCGGATACAACTACGGCATCGGCCGCTACGACCGCGACGTCCGGGCATACCCGGATCGGGTCATCCTCGGGACGGAGAC
GCTGCCCGGTGATGTCGCCCGCGCCTGGGGCCGAGTCCTGAAGCATCCTGCCGTGATCGGCGACTTCGTGTGGGCCGGGT
GGGAATACCTCGGTGAGGCCGGGGTGGCTGTCTGGGTCCCCGGGAAGAAGGCCGGCCTGTCCAAGCCCTACCCGTACCTG
ATCGCCGGGCCCGGGATGTTCGACCTGATCGGGCAGCCCGACATCACCCTCCGCCTCGCTCAATTGGCTTGGGGAGACCT
GCACCGGCCGGCCATAGGGGTGCGACCCCTCGACCGGAGCGGCATGCCCATGGTGCGATCGGCATGGCGCGTGACGGATG
CGGTCGAGAGCTGGTCGTGGCGCGGGTCTGACGGCAGGAAGTCAGAGGTCCAGATCTACTCCGCCGACGACCATGTCGAG
CTGCTCCTCAACGGTCGACGCGTCGGGCGTCGACGTGCCGGACGCCGAAGAGGCTTCCGAGCCGACTTCACCGTCCCGTA
CGAGCCGGGCACCCTCGAGGCTGTCGGGTACCGCAACGGGATCGAGGTGTCGCGGTCATCGCTTCGCAGCGCGACCGGTC
CGTTGCGACTGCGAGTCGAGGCAGAGTCGCGCGACATGATCGCGGACGGCGATGATCTGCTCTTCGCAGAGATCACCGTC
GTCGGCGAGAACGGAGTCGTCGAGATGCTCGCCGACGAGCAGGTCGCTGTCACAGTCGACGGTCCAGGCGAAGTCATCGG
ATTCGGATCGTCGGCACCCTCCTCGGAGGAGTCGTACACGACGGCGACGCACCGCACCTTCCGAGGTCGTGTCCTCGCCG
TGATCCGCTCTACCGGTGCACCGGGGACCGTCACCGTCACCGCTCGTGGCCGCACGCTCGGCACAGCCGAACTCGCTATC
GATGCCCGAGAGCCCGGGCAGGCCCCGCTCCCCGCTTCCCCTCTGAGCGGTGACACCAACCGTCTTGCCACCACTGAACC
GTCCCACTGA

Protein sequence :
MTEAALHTIAMTLTEWDFTVKGGGAGQPVRLPHDAMIHEHRDPRAPGGADTAYFPGGAYRYSTNWDAPADRSSSVALRFE
GVQGDATVTVNGTVVGSIRSGYTEFEFEIGEHLAWGASNTFVVDVDNAAQPTGRWYPGSGLYRPVSLLLRPAVRFAPDGL
KVRTLSITAATAEVQIGYSVLGLEDRTARVAVELRDGKTLVSSADGVGVEGVLSLVVDRPRAWSADSPHLYELIARVDSG
DATYERRERVGLRTIAVDSRNGLRINGRRELLRGACMHHDNGVLGAATHRAAEYRRIRLLKGAGFNAVRSAHHPMSRHLL
DACDELGMYVVEELADYWVASKSAHDAADRFHETWREDADRMIRKDRNRPSVIMYAAGNEIPETATPQGVELTREITAHL
HAADPDRPVTLAINLFLNTLVSFNRSPYKEAAAAGDAGTSMAGSTEANVMINQIGRMMDVVSRLPQADKASRDAFAEVDV
AGYNYGIGRYDRDVRAYPDRVILGTETLPGDVARAWGRVLKHPAVIGDFVWAGWEYLGEAGVAVWVPGKKAGLSKPYPYL
IAGPGMFDLIGQPDITLRLAQLAWGDLHRPAIGVRPLDRSGMPMVRSAWRVTDAVESWSWRGSDGRKSEVQIYSADDHVE
LLLNGRRVGRRRAGRRRGFRADFTVPYEPGTLEAVGYRNGIEVSRSSLRSATGPLRLRVEAESRDMIADGDDLLFAEITV
VGENGVVEMLADEQVAVTVDGPGEVIGFGSSAPSSEESYTTATHRTFRGRVLAVIRSTGAPGTVTVTARGRTLGTAELAI
DAREPGQAPLPASPLSGDTNRLATTEPSH