Gene Information

Name : BURPS668_A0166 (BURPS668_A0166)
Accession : YP_001061172.1
Strain :
Genome accession: NC_009075
Putative virulence/resistance : Unknown
Product : Rhs element Vgr protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3501
EC number : -
Position : 148425 - 151217 bp
Length : 2793 bp
Strand : +
Note : identified by match to protein family HMM PF04524; match to protein family HMM TIGR01646

DNA sequence :
ATGCCAAACCATTTTTCGAACGGACGGACGAATCAAAGCCGCACGGTAGTGATCCGCAGCGGTGCGATGCCGCGGCTGCT
CGGTCAGCCCGCGCTCGAGTTCCTGTCGCTGCGCGGTGAAGAGCACCTCGGAAAACTCTACACGTACGAATTGCTCCTGC
GCACGCCGGACGATTTTCATGTTCCGTTGGCAACGAGCGCGAATCTCGACCTGAAGGCGATGATCGGCACGGAGATGACG
GTCTGCATTCAGCTCGACGGAATCGGGACGGGCGCGCAAGGCGGCGTTGGCGCGGGTGCGCGCGAAATCAGCGGGCTCGT
GGTCAAGGCGGGCTTCCTGCGCTGCGAGGGGCGCTACAACGTCTATCGCATCGAGCTGCGCCCCTGGCTGTGGCTCGCGA
CTCTGACGAGCGACTACAAGATTTTTCAGGACAAGAGCGTCGTCGAAATCATCGATACGGTCTTGCACGATTACCCTTAC
CCGGTCGAGAAGCGGCTCGACATCGACAAGTATTCGGTGGCGGGCGAGAGCGCTCGAAACGAGCCGCGCGCGTTCCAGGT
GCAATATGGCGAAACCGATTTCGACTTCGTTCAGCGTTTGATGGAGGAGTGGGGGATCTACTGGTTCTTCGAGCATTCGG
ACAACAAGCATCGCCTGGTCTTGTGCGATCACATCGGCGGGCATCGCAAGGCGCCGAGCGAGGCCTATCACGAGATCGCG
CATCACCCGGAAGGCGGGAAGATCGACATCGAGTACATCAACTATTTCTCGACGGACGAAGCGCTGCGGCCCGGCCGCGT
CGTGATAGACGATTTCGACTTCACGCGTCCGCTCGCGAGCCTCGTCACGTCCAATCACCAGCCGCGGGAGACGAACTGGG
GGGAGGGCGAGCTGTTCGAATGGCCGGGCGACTATACCGATAGCAAGCATGGCGATCTCATCAGCCGCGTGCGCATGGAA
GAGCGCCGCGCGACCGGGTCGCGCGCATACGGTCGGGGCAACGTGCGCGGCCTCGCCTGCGGTCATACGTTCGTGCTGTC
GAAGCACAAGCACGACGGCGCGAACCGCGAGTACCTCGTCATCGAATCGGCGTTGATGCTGACCGAAGTCGCGGACGAAA
CGGGCAGCGGCTACCGCTACGAATGCGATAACGAACTGGTCGTGCAGCCGTCGAACGAGGTGTTTCGAATGCCGCGCGAA
ACGCCCAAGCCGACGACGAGCGGGCCACAGTCCGCGATCGTGGTCGGGCCGCCGGGCCACGAGGTATGGACCGACGAATT
CGGCCGCGTGAAGATCCGTTTTCTGTGGGATCGCTACGCACGCAATGACGCAACGGATTCCTGCTGGGTACGCGTGAGCC
AGGCGTGGGCCGGCGTGAACTTCGGCGGCATCTACATTCCGCGGATCGGACAGGAAGTGATCGTCGGATTCATGAACGGC
GATCCGGACCGTCCGCTGATTCTCGGCAGCCTCTACAACACCATTACGCCGCCGCCTTGGGATCTGCCCGGCGACGCGAC
GAAGAGCGGATTCAAGAGCAAGTCGATCACGGGCGGGCGCGAGAACTATAACGGCATCCGCTTCGAGGACAAGCTGGGGG
CCGAGGAATTTCACATGCAGGCGGAAAAGGACATGAACCGCCTGACGAAGAACGACGAGTCGCATACGGTCGGCGCGAAT
TTTTCGATCGGCGTCGGGCTTACCCATACGCGCGCGGTGGGCGCCATGTTCAGCAGCATCGTCGGCGGAGCCGCCAGCTA
TGCGGTGGGGGGCGCGGAATCGACGATGATCGGCGGCGCGTATGCGTTGAACGTCGGCGGCGCGCACGCGGTTGCGGTGG
GCGGCGCGTCGTCCGTTTCCGTTGGCGGCGCCTACGCGCGCAACGTGGGCGGCGCGTATGCGCTGACAGTCGGCGGCGTG
CTGTCGATCGTCTGCGGTGCGTCTTCGATCACCATGACGGCTTGCGGCTCGATCAAGATCGTCGGCAAGAATATTCGCAT
CATCGGCAGCGACGAAGTCGTCGTGCAAGGCGCGCCCCTGCAACTGAATCCGGGCGATTCGGATTGCGGCGGAGGGGGCG
GCGGCGGAGGCGGCGGCGGCGCGATTCCGCCGATTCCGTTGCCGTCGTTCTTCCTCGATATCACGAAGCCGATTCTTCCG
CCGCCGCCGCCGCCACCGACGGAGGTGCCACCGGATCCGACGCCGACGCCGACGCCGACGCCGACGCCAACGCCAACGCC
GACGCCGACGCCGACGCCAACGCCAACGCCTACGCCAACGCCAACGCCGACGCCGACGCCGACGCCAACGCCAACGCCAA
CGCCAACGCCAACGCCAACGCCTACGCCTACGCCAACGCCTACGCCAACGCCAACGCCGACGCCAACGCCAACGCCAACG
CCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCC
AACGCCGACGCCGACGCCGACGCCGACGCCGACGCCGACGCCCACGCCAACGCCCACACCGACGCCAACGCCAACGCCAA
CGCCAACGCCAACGCCGACGCCGACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCGACGCCGACG
CCGACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCAACGCCGACGCCAACGCCAACGCCGACGCCGACGCC
GACGCCGACGCCCACGCCGACGCCGACGCCAACACCAACACCAACGCCAACTCCGACTAGTTCCGAGATTTAA

Protein sequence :
MPNHFSNGRTNQSRTVVIRSGAMPRLLGQPALEFLSLRGEEHLGKLYTYELLLRTPDDFHVPLATSANLDLKAMIGTEMT
VCIQLDGIGTGAQGGVGAGAREISGLVVKAGFLRCEGRYNVYRIELRPWLWLATLTSDYKIFQDKSVVEIIDTVLHDYPY
PVEKRLDIDKYSVAGESARNEPRAFQVQYGETDFDFVQRLMEEWGIYWFFEHSDNKHRLVLCDHIGGHRKAPSEAYHEIA
HHPEGGKIDIEYINYFSTDEALRPGRVVIDDFDFTRPLASLVTSNHQPRETNWGEGELFEWPGDYTDSKHGDLISRVRME
ERRATGSRAYGRGNVRGLACGHTFVLSKHKHDGANREYLVIESALMLTEVADETGSGYRYECDNELVVQPSNEVFRMPRE
TPKPTTSGPQSAIVVGPPGHEVWTDEFGRVKIRFLWDRYARNDATDSCWVRVSQAWAGVNFGGIYIPRIGQEVIVGFMNG
DPDRPLILGSLYNTITPPPWDLPGDATKSGFKSKSITGGRENYNGIRFEDKLGAEEFHMQAEKDMNRLTKNDESHTVGAN
FSIGVGLTHTRAVGAMFSSIVGGAASYAVGGAESTMIGGAYALNVGGAHAVAVGGASSVSVGGAYARNVGGAYALTVGGV
LSIVCGASSITMTACGSIKIVGKNIRIIGSDEVVVQGAPLQLNPGDSDCGGGGGGGGGGGAIPPIPLPSFFLDITKPILP
PPPPPPTEVPPDPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTSSEI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0319 NP_454896.1 Rhs-family protein Not tested SPI-6 Protein 9e-120 43