Gene Information

Name : YPK_0478 (YPK_0478)
Accession : YP_001719234.1
Strain : Yersinia pseudotuberculosis YPIII
Genome accession: NC_010465
Putative virulence/resistance : Virulence
Product : virulence plasmid 65kDa B protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 522649 - 527118 bp
Length : 4470 bp
Strand : -
Note : PFAM: virulence plasmid 65kDa B protein; KEGG: yps:YPTB3553 insecticidal toxin complex

DNA sequence :
ATGGAAAACAGTAAACAGCAAGTGGCCGTTGCTCCCTTGTCGCTCCCTAAAGGGGGCGGTGCGATTACCGGTATGGGCGA
TAGTTTAGGTCCTATCGGCCCCAGTGGCATGGCAACACTGACGCTGCCCCTGCCGATCTCCGCAGGCCGCGGTTACGCCC
CCTCGCTCACGCTAAGTTACAGTAGCGGCAGTGGTAACGGCCCGTTTGGCCTTGGCTGGCAACTCGGCACCATGGCGATT
CGCCGCCGAACCAACGCCCAAGTGCCACGTTACGATGAGTATGATGAATTTCTGGCTCCCAATGGGGAAGTCATGGTGGT
TGCCGCGGATCCGCAGGGCAATATCGAACGCACTGAACAGTCACTAAATGGGGAACAATTCAGCGTAATTCGTTACTTAC
CACGTATTGAAGGCAATTTTCATCGCATTGAGTATTGGCGACCCCGGACAAATAACAGCCAGGCACCGTTTTGGTTAGTC
CACAGTTCAGATGGCCAAAAACACGGTTTGGGATATTCCGCCTCAGCCCGAATTGCCGATCCACTGCACCCTGAGCATAT
TGCTGAATGGCTACTGGAAGAGTCGGTGTCTCTGAGCGGTGAGCATATCTGCTACCAGTATCAGGCAGAAGATGAACAGG
ATATTGACGAATCTGAGAAACAAAATCATCCGGCAGCCAGTGCGCAACGCTATCTAAGCACCGTGGTTTATGGCAATAGA
GAAGTGGCGCATGAACTCTATTGTCTGACTCAGCGACCTGCGCCGACAAGCTGGTTATTTAGCCTGATCTTCGATCACGG
CGAATACAGTAATATTGCGGAGCAGGTGCCGGTAATCATAAAAGGAAAATCTTGGAATTTCCGTCAGGACGCCTTCTCAC
GCTTTAGTTGCGGCTTTGAAGTGCGTACCCGCCGCCTGTGTCAGCAAGTGTTGATGTACCATAATTTGTCAGCGCTTAAA
GGTGATGAACCCGATGCTCAAGCCACGCTGGTTAGCCGTCTGCGTTTACACTATCAGCACGATGCTTATGCCACCCAACT
GGTGGGTTGTCAGCAGTTAGCCCATGAACCCGATGGCACCAAACGTAGCCTGCCACCGCTAGAATTTGATTATCAGGATT
TTTCAACGCGTGACGCCCTTGGTTGGCAACCACTTACTGATTGGGCTGAATTTAACTATCAGTACCAAATGGTGGATCTC
AACGGTGAAGGGATACCGGGGATGTTGTATCAGGACAGCGGTCACTGGATTTATCGCCCACCGGTTCGCCAACCAGGCAC
CGCCGACGGCATCACTTTTGGCGCGGCCCAACGGTTACCGAGCCTACCCGCGATGCGCGAAAATGCCATGTTGATGGATA
TCAATGGTGACGGCAAGCTGGATTGGGTGATCAGTCAACCGGGCTTAGCGGGCTACTTTAGCCGTGATCCTGACCTTAGC
CGTGATCCTGATCTTAGCTGGACACAATTCATCCCGTTATCTACCTTGCCAGCGGAGTATTTCCATCCACAAGCGCAACT
GGTCGATTTAGCCGGCAGCGGGTTATCTGATCTGGCGTTAGTCGGGCCAAAAAGTGTCCGCGTGTATACCAATCTGTGTG
ACAGTTTTGCTGCCGCAACCCAAGTAGCACAAGATGATGACATCACGCTACCACTGCCCGGAGTTCACTTCACTGAACTG
GTGGCGTTCAGCGATGTGATGGGGTCGGGCCAACAACATCTGGTGCGAATACGCCACAATAGCGTCACCTGCTGGCCCAA
TCTCGGCCATGGCCGTTTTGGTCACCCCCTCTCTTTACCGGGGTTTAACCAGCCCATTGAGCAATTTAATCCGCTAGCGA
TCTATCTGGCGGATATTGATGGCTCCGGTACCATTGACCTTATTTATGCCACCACCAGCCAACTGCTGATTTACCGCAAC
CAGAGTGGTAACCGCTTTGCCGAACCAGTGGCTATCGCATTGCCAACAGGCATTCGCTTTGATAATAGCTGCCAACTGTC
ACTGGCAGATATTCAGGGGCTGGGCGTGGCCAGCATCATGTTGAGTGTGCCGCACCCCACCACACAACATTGGCGCTATG
ATTTTGTCGCCAGTAAACCCTATTTACTGTGCACCACCAACAATAATATGGGCGCAGAGAGTCAGTTGTTTTACCGCAGT
TCAGCCCAATTCTGGCTGGATGAAAAAGCGCAGGCGGCCAAACAGGGCCGATCACTGGCCTGCCAGTTACCCTTTCCGAT
CCATCTATTAGCGCAAACCACACAACTTGATGAGATCACCGGAAACAGCCTGAGTCAAACCGCCCGCTACTTCCATGGTT
TTTATGATGGCGTACAACGTGAATTCTGTGGTTTTGGCCGTGTCGATACGCTGGATACCGATACCTCCGCACAAGGCAGC
GCCGCTGAACGCACTGCGCCGACCAAAAGTAGTAACTGGTTCCATACCGGCCGGGCAGACAATGAGACACTATGGCAAAG
TGAGTACTGGCAGGGCGATGACCAAGACTACCCATTACTTCCCACCCGTCTGACAAAATTTATTAACGATACCCAAGGTG
ACGACGTACTCAGTGAACTCGATGATAATCAAACGTTTTGGTTACACCGGGCACTGAAAGGTTCATTGTTGCGCAGCGAA
CTCTATGGTCTGGACGGTAGCGAACTGGCCACACAACCCTATAGCGTCAACAGTGCTCGCTATCAGGTGCGGCAAATTCA
ATCCTCTACGGATGAAATAACCTCCCCGGTAGCATTACCGATGATGCTGGAACAACTCAACTATCACTACGAACGCATAG
TGCAAGACCCGCAATGCAACCAACAGATTGTGCTACGTTGCGATGAATTTGGTCACCCATTGCACAGTGCGACAATCTAT
TATCCACGCCGCGATAAAGCCAGTATTCCTCCGTATTCCTGGCTGGCTGAAGGACATTGGGACAGCCATTTTGACGAGCA
GCAGCAGCAGTTGCGCATCACTGAAAGCCAGCAATCTTATCACCATGAGATCAGTGATAAATTCTATGTGCTAGGCCTGC
CGGCCGGGCAGCGCAGTGATGTACTGACCTATCCAGACAATTTCGTGCCTACAGCGGGGATTCATTGGGAAGAATTACAA
CAACCAGAGGGCCTGCTCGGTACTAAAGCAGAGCGAACCTTTGCCGGGCAGCAGCAGGTTTTCTACGCCTCGGACACCAT
TCCGGGCCTGGTTGCCTATAGCCAACAGGCGGAATTTGACGATCAAACCTTGGCCGCGTTGGATGAATTATTACCAACGA
ATGAGCGTGAACAACAGCTTATCGAGGCAGGTTATCAACGAGCACCCCGTCTATTTTCTCGCCCCGGAGAAACAGATATT
TGGGCCGCCCAACGTGGGTTTACTGACTATGGCGATGCATCTCGTTTCTACCGCCCCATCAGTCAACGTAGCTCACCATT
GGTAGGAAAAACAGCCTTAGAATGGGATAAAAATAGCTGTGCCATTACCCAAATGATATTGGCTGATGGCAGTACCACCC
AGGCTGAATATGATTATCGTTTTATCACCCCTTATCACCTCACGGATATCAACGATAACTCCCGTCATATTGAACTGGAT
GCGCTGGGGCGCGTCACCTCCAGCCGTTTTTGGGGCACCGAGCTTGACTCACAAACCGGTGAGGTCAGCACAACCGGTTT
CCCTTTAATCGCTGAGCCCCCCTTTACCGTACCCAATTCAGTTGATGCCGCTATCAGCATGGAGAATACCCAGGTTCCCG
TCGCGCAATTCTCTGTCTATCAACCTCAAAGCTGGATGATTTCGTTACAGCTTGATGACATTGAAACATTGTCTGAAACC
AATAACGTCACTCTAGAATATCTATTCCAGAATCATATTCTGATCGATAACTATTATCTTTGCCCCCTTGCGCTACGTCG
CTGGATAAGACAAAGCAACCCTCTCATCACCGAATACGTCGGCCTGACATTGAAAAATCCCGTGCGCCAACCCCCCCATG
TCTTAACCGTCGTCGTGGATAACTACTTTTCTGCTGCTGAGCCACAACAACATCAACAAACTCTCGCTTTCAGCGACGGC
TTTGGCCGCGTGTTGCAAAGCGCGCAACGGGTGGAGGCAGGAACCGCTTATTTCCACACAGGAGAGGGCGGTCTGGAGAT
GGATCAACAAGGTCATCTCACGCAAGACGAGAGCGATCAACGCTGGGCAGTTTCTGGCCGTACCGAGTACGACAATAAAG
GTCTGCCTATCCGCCGCTATCAACCCTATTTCCTCGATGACTGGCGTTATATCGCGGATGACAACGCCCGCAAAGAAGCC
TGGGCCGACACCCATATTTACGACCCACTCGGACGAGAAATAAAAGTGATCACCGCCAAAGGTTACCTGCGACGAGCACA
CTATTTCCCGTGGTTTGTTATCAGCGAGGATGAAAATGACACGGCGTCAGAAATCACGCCGAATCCCTAA

Protein sequence :
MENSKQQVAVAPLSLPKGGGAITGMGDSLGPIGPSGMATLTLPLPISAGRGYAPSLTLSYSSGSGNGPFGLGWQLGTMAI
RRRTNAQVPRYDEYDEFLAPNGEVMVVAADPQGNIERTEQSLNGEQFSVIRYLPRIEGNFHRIEYWRPRTNNSQAPFWLV
HSSDGQKHGLGYSASARIADPLHPEHIAEWLLEESVSLSGEHICYQYQAEDEQDIDESEKQNHPAASAQRYLSTVVYGNR
EVAHELYCLTQRPAPTSWLFSLIFDHGEYSNIAEQVPVIIKGKSWNFRQDAFSRFSCGFEVRTRRLCQQVLMYHNLSALK
GDEPDAQATLVSRLRLHYQHDAYATQLVGCQQLAHEPDGTKRSLPPLEFDYQDFSTRDALGWQPLTDWAEFNYQYQMVDL
NGEGIPGMLYQDSGHWIYRPPVRQPGTADGITFGAAQRLPSLPAMRENAMLMDINGDGKLDWVISQPGLAGYFSRDPDLS
RDPDLSWTQFIPLSTLPAEYFHPQAQLVDLAGSGLSDLALVGPKSVRVYTNLCDSFAAATQVAQDDDITLPLPGVHFTEL
VAFSDVMGSGQQHLVRIRHNSVTCWPNLGHGRFGHPLSLPGFNQPIEQFNPLAIYLADIDGSGTIDLIYATTSQLLIYRN
QSGNRFAEPVAIALPTGIRFDNSCQLSLADIQGLGVASIMLSVPHPTTQHWRYDFVASKPYLLCTTNNNMGAESQLFYRS
SAQFWLDEKAQAAKQGRSLACQLPFPIHLLAQTTQLDEITGNSLSQTARYFHGFYDGVQREFCGFGRVDTLDTDTSAQGS
AAERTAPTKSSNWFHTGRADNETLWQSEYWQGDDQDYPLLPTRLTKFINDTQGDDVLSELDDNQTFWLHRALKGSLLRSE
LYGLDGSELATQPYSVNSARYQVRQIQSSTDEITSPVALPMMLEQLNYHYERIVQDPQCNQQIVLRCDEFGHPLHSATIY
YPRRDKASIPPYSWLAEGHWDSHFDEQQQQLRITESQQSYHHEISDKFYVLGLPAGQRSDVLTYPDNFVPTAGIHWEELQ
QPEGLLGTKAERTFAGQQQVFYASDTIPGLVAYSQQAEFDDQTLAALDELLPTNEREQQLIEAGYQRAPRLFSRPGETDI
WAAQRGFTDYGDASRFYRPISQRSSPLVGKTALEWDKNSCAITQMILADGSTTQAEYDYRFITPYHLTDINDNSRHIELD
ALGRVTSSRFWGTELDSQTGEVSTTGFPLIAEPPFTVPNSVDAAISMENTQVPVAQFSVYQPQSWMISLQLDDIETLSET
NNVTLEYLFQNHILIDNYYLCPLALRRWIRQSNPLITEYVGLTLKNPVRQPPHVLTVVVDNYFSAAEPQQHQQTLAFSDG
FGRVLQSAQRVEAGTAYFHTGEGGLEMDQQGHLTQDESDQRWAVSGRTEYDNKGLPIRRYQPYFLDDWRYIADDNARKEA
WADTHIYDPLGREIKVITAKGYLRRAHYFPWFVISEDENDTASEITPNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 58
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 50
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 50