Gene Information

Name : W5S_0142 (W5S_0142)
Accession : YP_006281153.1
Strain : Pectobacterium sp. SCC3193
Genome accession: NC_017845
Putative virulence/resistance : Unknown
Product : YD repeat protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 167185 - 171471 bp
Length : 4287 bp
Strand : +
Note : -

DNA sequence :
ATGCTTGATGACATCCTCTCGCGCATTGCGCGGGTTGGCGCGATGCATGCGGGGAATCGCCCCACCCTGCCACCCGATTT
GCCTGAACCGGGGCAGGGTAAACCGCCGACCTCACCGGGCAAGCCTATCAAACACAGCAGCTTTCTGGGGGCGTTGCTCG
GTGCGGTGGCCGGGGCACTCGTGGCCGCCGCCGTCGCGGCGGTGGCCGTTGCCCTGGTGGGCGTCACTGGCGGGCTGGCC
ATTGCTCTGGTTGGTGGGCTGGCGGCACTGGGTGCAGGGAGTCTGATATCTGCCGTCAGTGGTCGGGTGTCGGCCATGGT
CGACAGCGCTTCGCCGCCATCAGGGCAGGTTGATGGCGGTTCGAAGACCGTTTTTGTTGAAGGCAATCCGGTCTCCCGTG
CGGAAATCGATGCGGTGAAGTGCGCTAAACACAATGGTCCGCAACTGATTGCGCAGGGCAGTGAAACGGTGTTTGTCGAG
GGATACTATGCCGCACGGGTGGACGACAAGGCCGTGTGCGGTGCGACCATCAAGGAAGGGGCCTCGACGGTGTTCTTTGG
CTCCGGGCAGGCCTCGCCCCTCAAGGTGCAGGAGGAGTTCAGCGGCTGGCAGAAAGCGTTACTGATTGCGGTGGAGTTTC
TGGTGCCGCCGAGCAAGGGACTATTCAGAGGATTGGGCCGGTTATTTACCGGAAAAGGGCTTGTCGGGGTACTCCGGGGG
GCTAAAGCGGCGGCTAAATATCTAGCCCGGGTACCCGGAAAAACCCGCTGTGCGGCAACGGCATTTATTCACAGCAAAGG
ACTGGCCCGCTTTTCTCATGCCAAAGCGGCCTTCAAAGCCGACCCGGTCTACATCGCCAGCGGCGAGGTGATTGAAAGCC
GTACCGATATCGAACTGGGGCAGACCATTCCTCTGGTATTTGAGCGCACCTATCGCTCCGGTAGCCCTCACACCGGCCTG
CTGGGACAGGGCTGGCACGATAGCTGGAGTGAAGTGGCCACCGTTGACCGCACCGACAAGGGTGACATCCAGGTGACGCT
CACGCTGGCGCAGGGTTACGACATCGACTTCACCTTCGGGATAGGGGCAACGGTGGTGTACTGCGCCGAATACCCCGAAT
TCAAACTGGTGAAACGCCATAGCGGCTTTCATCTGTGGCACCGCGACAGCCAGACCTGGCGGGCGTTTACCGTCAAACAA
GATAACCGACTGCTGCTGTCGGCGATAACCGACAACCACCATAACCGCATTGACTTTCTGCGCGACCCCAAAGGCTACCT
GCGCAAGATTAGGCACGGCGACGGCATCGAGCTGCTGCTGGTGTGGCAGGGAGAATTTCTCAGCCAGATACAGCGTATCG
ACGGCGGGCAAAAAACCGTGCTGGCCGAATACCGGCAGGATGAACGGGGGCGACTGGTTGAGGCCGATGCGGCCCATGCC
TATCACCTGTTCTACGACTACGACAGCCATCACCGCCTGACGCGCTGGCACGATAACGACCAGACCTGGGCGCGGTATGA
ATACGACCATCAGGGACGCTGTATCTACACCACCTGTGCCGATGGCTACCTGACGGCGAATTTCGAGTACCTGGCTGACC
GCGTGGTGATGATTGACGGGCTGGGACAGCGCCATGAATACGGCTTCAACGACCTGTACCTGATGGCGTGGGAAAAATCT
CCGCTCGGCCATCTCACCCGCTATGAGTATGACGATGTGGGCAACCTGCTGCGGGAAATTTCCCCGGCGGGTCGGGCAGT
GGAATTTGCCTATCTGGGCGACAGCGGGCTGGTCAGCACCTTTACCGACGGCAGCGGCCACGCGTGGCACTACGCCTATG
ACGACCACGAGCGACTTATCGGTATTACCGACCCGCTGGGGCGCCGTTGGGTCTGGCAGTACGACAATAACGGCAACCCG
CTGAGTCTGACCGGGCCGGATGCGAGCGAAATGCGGTTTGCCTGGAACCGCTACGGCCTGCTGACCGAAGTGAGCGACCA
GAATGGACACGTGCAGGCCAGCCTGTTTTACGACCACCGTCAGCGCCTGTTGAGCGCCACCGATGCGGAAGCCCGCACTC
AGCAACTGCGCTACGACCAGCAGGACAGACTGACCACCTGGACACGCCCGGATGGTGCCACCTATCGGCTGGGTTATCGC
CGGGCAAGCTGGCGACTGCCGGAGCAACTGATACGCCCGGACGAGAAAGAGGAAAAACGCCAGTACGACAAGCATAACAA
CCTGCTGAACTACACCGACGGCAACGGTGCGGTCTGGCAGCAAACCTACGGCCCGTTCGACCTGCTGACTTCACGCACCG
ACGCTGAAGGCCGCACCTGGCAGTACGACTATGATAAAGACAGTCAGCAACTGATTGGTGTCACCGCCCCGGATGGCAAC
CGCTGGCAGTGGTGGCTGGATGCGGACGGACGGGTTATCCGTGAGCGGGACATGGCCGGTACGGAAACCCACTACGGCTA
TGACGAAGACGGGCACTGCATCAGCGTGCGCAATGGCGAAGGGGAAATCCGCCACTTTCTGTACGACGGGCGCGGCCTGC
TTATCAAAGAAACCGCGCCGGACGATACGCTGTATTACCGCTATGATGCGGCCGGTCGACTGACCGACGTGACATCGACC
ACCAGTCATGTCCAGCTTGAATATGACGAGCGTGACCGGGTGGTACAGGAGCATAACAGCGGCACGGCAATCCGACGCCA
CTATCAGGACGCATCGCACACCGTTACCCGCAGCCTGCACTGGGAAGGCGAAGAGGACAGCGCGGCACTGACCAGCACCT
TCTGCTACCGCGCCACGGGCGAACTGCATCAGGTGCAGTTGCCGGATGGCGCCGAGCTGACACTGACTCACGATGCGGCT
GGGCGGGAAGCCATCCGCCACAGCCCGGGCGGTTTTATACAGCGGCGTGAATACGACACGATGGGCTGGCTGACGCGGGA
GATGAGCGGGCAGACGGTTGACGGTCGCCTGCAGCCGGTACAGACGCGGGAATACCTCTACGACAGCGCGGGTAACCTGA
CGGGCACCCGGCGCAACCGTGAGGCTGCAGGTTACCGACTGGATGCCAGCGGACGGGTGCTGTCGGTACTGAGCGGTGGT
GCGGGGCGCACCGTCAACACCGAAGAGGAATACCGCTACACCCGCAGCGGGCTGCCGCAGGATGCGACAAGGCTGACCGA
CTGGCAGGCCGGACGCCTGACCCAGCAGGACAACACTCACTATCAGTACGACAAGGCCGGGCGACTCATTCGCCGGCAGG
TGGTGCAACCGGGCTATCGGCCACAGGTGTGGCACTACCGCTGGGACAGCCGCAATCAGCTCAGAGTAGTGGACACCCCA
AACGGTGAACGCTGGCTCTACCGCTACGACCCGTTCGGACGGCGCATCGGCAAACGCTGCGACCAGACAGCCGAAGCTAT
CCGTTATCTGTGGGACGGTGACCAGATAGCGGAGGTGCGGCACTATCGCGACAATCAACTTGTCTCACGGCGCCACTGGG
TGCACAACGGCTGGGAACTGCTGGTGCAGCAGCGTCAGAACGCCGACGAAAGCTGGGAAACGGACTTTGTCACCAGCGGC
CATAACGGTGAACCGCAGGCTATTTTCAACCAGACCGGTGAACTGCGCTGGCAGGCCCCCCGCGCTAACTTATGGGGCCA
GCGCTATACGGAAAACGCTGAAAAATATGATCCGGGGCTGGCGTTTGCCGGACAATACCGTGACGACGAAAGTGGCTTAT
GCTATAACCGTTTCAGGTATTATGACCCGAACGGCGGTTGTTATATCTCGCCTGACCCTATAGGGGTGTTGGGTGGGGAA
AGTAATTACGGGTATGTTCATAATCCTGTAAGTTGGATCGATCCGAAAGGGTTAGCTGGATGTAATATAGTCCATAGGGC
GGTGACTCCTGAACAAGCAGCAAGTATCCGGGCAGGCAATGGAATATCAAGGCCGACTCCATATCATAGAACGACACCGA
CTCAGCATGTAGCAGGAGCACCGCACTCGCGTGATCCGTGGATATCAACAACTAGAAGTCAATCAACGGCAGAATACTTC
GCCACTCATGGTGGAACCCAAGCAGCAAATCCAATTGTTAAGATAGATCTGTCTAAAATTCCGAGCGATAAGATTTTAGA
TGTATCAAACGCTCAAAAGGCTGCGGAACATTTGCAGACTCCATTCACCCGAAATGTAGCAGCAGCCCACCAAGAGGTAT
TAATCTTTGGAGAAATACCATCGGAAGCGATAATTGGATTTTTATAG

Protein sequence :
MLDDILSRIARVGAMHAGNRPTLPPDLPEPGQGKPPTSPGKPIKHSSFLGALLGAVAGALVAAAVAAVAVALVGVTGGLA
IALVGGLAALGAGSLISAVSGRVSAMVDSASPPSGQVDGGSKTVFVEGNPVSRAEIDAVKCAKHNGPQLIAQGSETVFVE
GYYAARVDDKAVCGATIKEGASTVFFGSGQASPLKVQEEFSGWQKALLIAVEFLVPPSKGLFRGLGRLFTGKGLVGVLRG
AKAAAKYLARVPGKTRCAATAFIHSKGLARFSHAKAAFKADPVYIASGEVIESRTDIELGQTIPLVFERTYRSGSPHTGL
LGQGWHDSWSEVATVDRTDKGDIQVTLTLAQGYDIDFTFGIGATVVYCAEYPEFKLVKRHSGFHLWHRDSQTWRAFTVKQ
DNRLLLSAITDNHHNRIDFLRDPKGYLRKIRHGDGIELLLVWQGEFLSQIQRIDGGQKTVLAEYRQDERGRLVEADAAHA
YHLFYDYDSHHRLTRWHDNDQTWARYEYDHQGRCIYTTCADGYLTANFEYLADRVVMIDGLGQRHEYGFNDLYLMAWEKS
PLGHLTRYEYDDVGNLLREISPAGRAVEFAYLGDSGLVSTFTDGSGHAWHYAYDDHERLIGITDPLGRRWVWQYDNNGNP
LSLTGPDASEMRFAWNRYGLLTEVSDQNGHVQASLFYDHRQRLLSATDAEARTQQLRYDQQDRLTTWTRPDGATYRLGYR
RASWRLPEQLIRPDEKEEKRQYDKHNNLLNYTDGNGAVWQQTYGPFDLLTSRTDAEGRTWQYDYDKDSQQLIGVTAPDGN
RWQWWLDADGRVIRERDMAGTETHYGYDEDGHCISVRNGEGEIRHFLYDGRGLLIKETAPDDTLYYRYDAAGRLTDVTST
TSHVQLEYDERDRVVQEHNSGTAIRRHYQDASHTVTRSLHWEGEEDSAALTSTFCYRATGELHQVQLPDGAELTLTHDAA
GREAIRHSPGGFIQRREYDTMGWLTREMSGQTVDGRLQPVQTREYLYDSAGNLTGTRRNREAAGYRLDASGRVLSVLSGG
AGRTVNTEEEYRYTRSGLPQDATRLTDWQAGRLTQQDNTHYQYDKAGRLIRRQVVQPGYRPQVWHYRWDSRNQLRVVDTP
NGERWLYRYDPFGRRIGKRCDQTAEAIRYLWDGDQIAEVRHYRDNQLVSRRHWVHNGWELLVQQRQNADESWETDFVTSG
HNGEPQAIFNQTGELRWQAPRANLWGQRYTENAEKYDPGLAFAGQYRDDESGLCYNRFRYYDPNGGCYISPDPIGVLGGE
SNYGYVHNPVSWIDPKGLAGCNIVHRAVTPEQAASIRAGNGISRPTPYHRTTPTQHVAGAPHSRDPWISTTRSQSTAEYF
ATHGGTQAANPIVKIDLSKIPSDKILDVSNAQKAAEHLQTPFTRNVAAAHQEVLIFGEIPSEAIIGFL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
rhs-core AAN64198.1 Rhs Not tested macrophage toxin pathogenicity island Protein 0.0 42