Gene Information

Name : BURPS1106A_A0143 (BURPS1106A_A0143)
Accession : YP_001074186.1
Strain :
Genome accession: NC_009078
Putative virulence/resistance : Unknown
Product : Rhs element Vgr protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3501
EC number : -
Position : 128568 - 131258 bp
Length : 2691 bp
Strand : +
Note : identified by match to protein family HMM PF04524; match to protein family HMM TIGR01646

DNA sequence :
ATGCCAAACCATTTTTCGAACGGACGGACGAATCAAAGCCGCACGGTAGTGATCCGCAGCGGTGCGATGCCGCGGCTGCT
CGGTCAGCCCGCGCTCGAGTTCCTGTCGCTGCGCGGTGAAGAGCACCTCGGAAAACTCTACACGTACGAATTGCTCCTGC
GCACGCCGGACGATTTTCATGTGCCGTTGGCAACGAGCGCGAATCTCGACCTGAAGGCGATGATCGGCACGGAGATGACG
GTCTGCATTCAGCTCGACGGAATCGGGACGGGCGCGCAAGGCGGCGTTGGCGCGGGTGCGCGCGAAATCAGCGGGCTCGT
GGTCAAGGCGGGCTTCCTGCGCTGCGAGGGGCGCTACAACGTCTATCGCATCGAGCTGCGCCCCTGGCTGTGGCTCGCGA
CTTTGACGAGCGACTACAAGATTTTTCAGGACAAGAGCGTCGTCGAAATCATCGATACGGTCTTGCACGATTACCCTTAC
CCGGTCGAGAAGCGGCTCGACATCGACAAGTATTCGGTGGCGGGCGAGAGCGCTCGAAACGAGCCGCGCGCGTTCCAGGT
GCAATATGGCGAAACCGATTTCGACTTCGTTCAGCGTTTGATGGAGGAGTGGGGGATCTACTGGTTCTTCGAGCATTCGG
ACAACAAGCATCGCCTGGTCTTGTGCGATCACATCGGCGGGCATCGCAAGGCGCCGAGCGAGGCCTATCACGAGATCGCG
CATCACCCGGAAGGCGGGAAGATCGACATCGAGTACATCAACTATTTCTCGACGGACGAAGCGCTGCGGCCCGGCCGCGT
CGTGATAGACGATTTCGACTTCACGCGTCCGCTCGCGAGCCTCGTCACGTCCAATCACCAGCCGCGGGAGACGAACTGGG
GGGAGGGCGAGCTGTTCGAATGGCCGGGCGACTATACCGATAGCAAGCATGGCGATCTCATCAGCCGCGTGCGCATGGAA
GAGCGCCGCGCGACCGGGTCGCGCGCATACGGTCGGGGCAACGTGCGCGGCCTCGCCTGCGGTCATACGTTCGTGCTGTC
GAAGCACAAGCACGACGGCGCGAACCGCGAGTACCTCGTCATCGAATCGGCGTTGATGCTGACCGAAGTCGCGGACGAAA
CGGGCAGCGGCTACCGCTACGAATGCGATAACGAACTGGTCGTGCAGCCGTCGAACGAGGTGTTTCGAATGCCGCGCGAA
ACGCCCAAGCCGACGACGAGCGGGCCACAGTCCGCGATCGTGGTCGGGCCGCCGGGCCACGAGGTATGGACCGACGAATT
CGGCCGCGTGAAGATCCGTTTTCTGTGGGATCGCTACGCACGCAATGACGCAACGGATTCCTGCTGGGTACGCGTGAGCC
AGGCGTGGGCCGGCGTGAACTTCGGCGGCATCTACATTCCGCGGATCGGACAGGAAGTGATCGTCGGATTCATGAACGGC
GATCCGGACCGTCCGCTGATTCTCGGCAGCCTCTACAACACCATTACGCCGCCGCCTTGGGATCTGCCCGGCGACGCGAC
GAAGAGCGGATTCAAGAGCAAGTCGATCACGGGCGGGCGCGAGAACTATAACGGCATCCGCTTCGAGGACAAGCTGGGGG
CCGAGGAATTTCACATGCAGGCGGAAAAGGACATGAACCGCCTGACGAAGAACGACGAGTCGCATACGGTCGGCGCGAAT
TTTTCGATCGGCGTCGGGCTTACCCATACGCGCGCGGTGGGCGCCATGTTCAGCAGCATCGTCGGCGGAGCCGCCAGCTA
TGCGGTGGGGGGCGCGGAATCGACGATGATCGGCGGCGCGTATGCGTTGAACGTCGGCGGCGCGCACGCGGTTGCGGTGG
GCGGCGCGTCGTCCGTTTCCGTTGGCGGCGCCTACGCGCGCAACGTGGGCGGCGCGTATGCGCTGACAGTCGGCGGCGTG
CTGTCGATCGTCTGCGGTGCGTCTTCGATCACCATGACGGCTTGCGGCTCGATCAAGATCGTCGGCAAGAATATTCGCAT
CATCGGCAGCGACGAAGTCGTCGTGCAAGGCGCGCCCCTGCAACTGAATCCGGGCGATTCGGATTGCGGCGGAGGGGGCG
GCGGCGGAGGCGGCGGCGGCGCGATTCCGCCGATTCCGTTGCCGTCGTTCTTCCTCGATATCACGAAGCCGATTCTTCCG
CCGCCGCCGCCGCCACCGACGGAGGTGCCACCGGATCCGACGCCGACGCCCACGCCGACGCCGACGCCGACGCCCACGCC
GACGCCGACGCCGACGCCCACGCCGACGCCGACGCCGACGCCCACGCCGACGCCGACGCCGACGCCCACGCCGACGCCCA
CGCCCACGCCCACGCCGACGCCGACGCCGACGCCGACGCCGACGCCGACGCCCACGCCGACGCCCACGCCCACGCCGACG
CCCACGCCCACGCCGACGCCCACGCCGACGCCCACGCCGACGCCAACGCCAACGCCAACGCCAACGCCGACGCCCACGCC
CACGCCGACGCCCACGCCGACGCCAACGCCGACGCCGACGCCGACGCCGACGCCGACGCCGACGCCCACGCCGACGCCCA
CGCCGACGCCCACGCCCACGCCCACGCCGACGCCAACGCCAACGCCCACGCCGACGCCCACGCCCACGCCGACGCCGACG
CCGACGCCAACGCCAACACCAACGCCAACTCCGACTAGTTCCGAGATTTAA

Protein sequence :
MPNHFSNGRTNQSRTVVIRSGAMPRLLGQPALEFLSLRGEEHLGKLYTYELLLRTPDDFHVPLATSANLDLKAMIGTEMT
VCIQLDGIGTGAQGGVGAGAREISGLVVKAGFLRCEGRYNVYRIELRPWLWLATLTSDYKIFQDKSVVEIIDTVLHDYPY
PVEKRLDIDKYSVAGESARNEPRAFQVQYGETDFDFVQRLMEEWGIYWFFEHSDNKHRLVLCDHIGGHRKAPSEAYHEIA
HHPEGGKIDIEYINYFSTDEALRPGRVVIDDFDFTRPLASLVTSNHQPRETNWGEGELFEWPGDYTDSKHGDLISRVRME
ERRATGSRAYGRGNVRGLACGHTFVLSKHKHDGANREYLVIESALMLTEVADETGSGYRYECDNELVVQPSNEVFRMPRE
TPKPTTSGPQSAIVVGPPGHEVWTDEFGRVKIRFLWDRYARNDATDSCWVRVSQAWAGVNFGGIYIPRIGQEVIVGFMNG
DPDRPLILGSLYNTITPPPWDLPGDATKSGFKSKSITGGRENYNGIRFEDKLGAEEFHMQAEKDMNRLTKNDESHTVGAN
FSIGVGLTHTRAVGAMFSSIVGGAASYAVGGAESTMIGGAYALNVGGAHAVAVGGASSVSVGGAYARNVGGAYALTVGGV
LSIVCGASSITMTACGSIKIVGKNIRIIGSDEVVVQGAPLQLNPGDSDCGGGGGGGGGGGAIPPIPLPSFFLDITKPILP
PPPPPPTEVPPDPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTSSEI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0319 NP_454896.1 Rhs-family protein Not tested SPI-6 Protein 8e-120 43