PAI Gene Information


Name : YE3511 (YE3511)
Accession : YP_001007673.1
PAI name : YAPI
PAI accession : NC_008800_P2
Strain : Yersinia enterocolitica 8081
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : Similar in parts to Salmonella typhi hypothetical protein Sty4528 SWALL:Q8Z1L7 (EMBL:AL627282) (447 aa) fasta scores: E(): 1.5e-49, 46.48 38d in 398 aa and to Xylella fastidiosa hypothetical protein Xf1781 SWALL:Q9PCJ8 (EMBL:AE004000) (418 aa) fasta score
Homologs in the searched genomes :   37 hits    ( 37 protein-level )  
Publication :
    -Delihas,N., "Annotation and evolutionary relationships of a small regulatory RNA gene micF and its target ompF in Yersinia species", BMC Microbiol. 3, 13 (2003) PUBMED 12834539 REMARK Publication Status: Online-Only.

    -Delihas,N., "Direct Submission", Submitted (19-JAN-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Thomson,N.R., "Direct Submission", Submitted (30-JUN-2006) Thomson N.R., Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UNITED KINGDOM.

    -Thomson,N.R., Howard,S., Wren,B.W., Holden,M.T., Crossman,L., Challis,G.L., Churcher,C., Mungall,K., Brooks,K., Chillingworth,T., Feltwell,T., Abdellah,Z., Hauser,H., Jagels,K., Maddison,M., Moule,S., Sanders,M., Whitehead,S., Quail,M.A., Dougan,G., Parkh, "The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081", PLoS Genet. 2 (12), E206 (2006) PUBMED 17173484.


DNA sequence :
ATGATGGGGCTACCGCCAGACAGCATCGTTGAATTCACGTTATCACGAATGCAGACAAGCCTTGCCAAACGCACATCAGA
GAATGATGTCACCAACGAGCGCAGTGGCTTACTTTTTCTCGGCAATGTACATGATGCATTTCCGCGCCAATTATTTATGG
ACATTCGGCTCTCTCCTTTAGATAAAACGGCGTGGGTCATGATCCGTCTTTACGCGCAGCAGCATGAAGGCGCTATCTTC
CCGACTTATGACGAGTTGCAGTTACAACTGGCATCACCACACGCGGAAAAAGCCTCACGTGAAACCATTAGCCGCGCGTT
GTTAATGTTGAGAATTACCGGATGGCTGAGCCTCTGTAAGCGAGTCAGGGATGCACATGGCCGTGTCCGGGGAAATATTT
ATGCCCAGCATGACGAGCCTCTCAGTTGCAGAGATGCTGAAACCTTTGACCCTGGCTGGTTGGATCTCGTGGCATCAAGC
TGTCAGCACAAGAACAAAGCAGTGCGAATGACCGCACTGGCAGTACTAAATGAGATTCGCAACGATCCGACCATGCGTCA
TCGACATTCTCGCTTAAGCATGATTGAAAACCGGTTGGGGAGTTCAGCTACACCGCGACAAATGGCAGCGTTACGCCGAG
AGGCTGTACCCAGTTCGAATTCCGAACTCAGTAAAAAAGGAATACAGAATGGCTCAAAAACACTGGGTTCGAATTTCGAA
CTCTGTAATAAATCAGCCATTTATCCCGAAGTTCGAAAACCGAACTGTAACGTACGTAATTTCACACTAAGTGTGAATAA
AAAAACATACGTATCTGATGCGAACAGTGTTAAGGACTCGCCCGCACTGTCCTTGCCAGATTCTCTGCTAAGCAGAATAT
CTGCAGAGGATGCGGACATGCTGATGAGGCAACTGAGTGCATTGCCGGAAAATCAGTCGTTAGCGTTGCTGACGCAGCTC
AGGACACAATTACGCCGTGGTCAGTTGAGTAATCCGCTTGGGTGGATGCTTAGTATGCTCAAGCGTGCTCGTGAAGGGCA
GATGACTATGCCGGAAGCGGCGATTTCACCCAAACAGTCTCAACGACCACTTTCTGCTAGTGAGCCGGTTGGCCGTGGTT
TATTACCGATAGAACGAGCCTCTGCAGGACAGGTAGCCCGTGTTATTGCAGATATAAAAAGCAAATTCTTCCATGGTGGA
TAA

Protein sequence :
MMGLPPDSIVEFTLSRMQTSLAKRTSENDVTNERSGLLFLGNVHDAFPRQLFMDIRLSPLDKTAWVMIRLYAQQHEGAIF
PTYDELQLQLASPHAEKASRETISRALLMLRITGWLSLCKRVRDAHGRVRGNIYAQHDEPLSCRDAETFDPGWLDLVASS
CQHKNKAVRMTALAVLNEIRNDPTMRHRHSRLSMIENRLGSSATPRQMAALRREAVPSSNSELSKKGIQNGSKTLGSNFE
LCNKSAIYPEVRKPNCNVRNFTLSVNKKTYVSDANSVKDSPALSLPDSLLSRISAEDADMLMRQLSALPENQSLALLTQL
RTQLRRGQLSNPLGWMLSMLKRAREGQMTMPEAAISPKQSQRPLSASEPVGRGLLPIERASAGQVARVIADIKSKFFHGG