PAI Gene Information


Name : irp1 (YE2618)
Accession : YP_001006816.1
PAI name : HPI
PAI accession : NC_008800_P1
Strain : Yersinia enterocolitica 8081
Virulence or Resistance: Virulence
Product : yersiniabactin biosynthetic protein
Function : -
Note : -
Homologs in the searched genomes :   49 hits    ( 48 protein-level,   1 DNA-level )  
Publication :
    -Delihas,N., "Annotation and evolutionary relationships of a small regulatory RNA gene micF and its target ompF in Yersinia species", BMC Microbiol. 3, 13 (2003) PUBMED 12834539 REMARK Publication Status: Online-Only.

    -Delihas,N., "Direct Submission", Submitted (19-JAN-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Thomson,N.R., "Direct Submission", Submitted (30-JUN-2006) Thomson N.R., Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UNITED KINGDOM.

    -Thomson,N.R., Howard,S., Wren,B.W., Holden,M.T., Crossman,L., Challis,G.L., Churcher,C., Mungall,K., Brooks,K., Chillingworth,T., Feltwell,T., Abdellah,Z., Hauser,H., Jagels,K., Maddison,M., Moule,S., Sanders,M., Whitehead,S., Quail,M.A., Dougan,G., Parkh, "The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081", PLoS Genet. 2 (12), E206 (2006) PUBMED 17173484.


DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAACTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT
TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA
ATCGATGGACCCACAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG
TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC
GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT
GCACGGCCCAGCGTTATCGGTACAGACCGCCTGCTCCAGTTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG
CAGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCGGGCTACCGCTACCAGCCCGGA
ATGATTTTCTCTCCCGATGGTCACTGTCGTCCTTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG
CGTGGTACTGCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA
AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAATATGGGCCATC
TGGATACCGCAGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGCCGCGGGCAAATTCCACCCTTACTGAAT
TTTCATACCCCCAACCCGGCGCTGAAACTTGAAGAGAGTCCCTTTACCATACCGATGTCGGCGCAGGCGTGGCAGGATGA
AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG
CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATACGGATGCCAGCGATCTGGCCTTCACGGCCCTGCACGC
GCGCCGTCTCGATCTTCCCTTTCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGCGGCGCTCAGCGACTGGGCCGGTG
AGAAATCGGGGGCGCTGGTTTATAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC
TGGCGCACTATGGGTCAAACGATGTACCAGCACTCAACGGCGTTTGCCGACATGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC
CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCACGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT
TCCGTCGGTGAATTTGCCGCTGCCGTCGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGACGCGG
CGCACTGATGCAGCAGTGCGCAAGCGGCGCAATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC
AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTT
TGCACCACGCTCTCGCAGCATAACATTAACTATCGTCGCCTGAGCGTAACCGGCGCGGCGCACTCCGCTTTACTGGAACC
GATACTCGATCGGTTCCAGGACGCCTGCGCGGGGCTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG
CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG
AGTATTCAGATGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTCCGGGCA
GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCTGCCGGTGTCGCCTTACCGTGGACCGACCTACTGGCGGGTGATGGACAACGTATCGCTGCGCCA
TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAACCTGCCGACGCAGCGCTGTCTGC
CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCCCGTCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAAAACGGCGTGGACGCCATAACCATC
ATACGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA
CCGCTGCACCGACGGGCGATACGTCCGCGCCCACCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG
GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGACATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG
CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGTCGTTGCGTATTCTTG
AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCGCTGGAGTACCACTTC
ACCGATATCTCAGCGCTGTTCACCCGCCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA
TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCGGCGAACGTGATTCACGCCA
CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC
CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAATT
ATTCCTCACCACCGCTCAGTGGCAGCAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCATGGCTACCGCAGGATGGCA
GCCCAACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGTGCCGTAACATTCACCGCG
CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC
CGAACGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGTCTGCTCTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC
CCCTCGTCGCCGCCCCGGAGTGGCTGGGGGAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCAC
GTTGAAGCCCGTCATCCTGATGGCGAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCGCCGCAGACGCATTATCA
ATGGCGCTGGACGCCCCTCAACGTCGCCAGCGTTGACCATCCGCTTACCTTCAGCGCCGGTACGCTTGCGCGCAGCGACG
AGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCGCGACTGATGATTGTTGAGGAGAGCGAGGATACGCTG
GCCTTAGCGGAGAAAGTGATAGCAGCGCTCATCGCCAGCGCAGCCGGATTGATTGTGGTCACTCGCCGCGCGTGGCGAGT
CGAGGAAAATGAGGCACTCTCTGCGTCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGCCGGAAC
GGTTGATTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACACTGCATCAAGGGTTGAGCGCAGTCTCACTATCA
CAGCGCTGGCTCGCCGCGCGGGGTAACACCCTCTGGCTCCCTTCACTGGCGCTCAATACGGGATGCGCCGCTGAATTACC
AGCAAACGTGTTTACCGGCGATAACCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCGTGAACT
GGCTCAGAGAAAAAGGCGCGCGACGCATCGCCCTGCTGGCGCAGCGCGTGGATGAGTCATGGCTACGCGACGTGGAGGGC
GGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGCCAACGG
CGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGCTGGCCA
CCGTTTTTGCGGTAAAAGCGCAGGCGGCAAACCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTTATTCTC
TACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGGGCTGGC
ACAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGGCGGCCA
CGCCGGAAATGCTGGCGACGCTCGCCAGCCGTGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTGGAACAG
GCGGTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGCTCTGTT
TAACATCAGCGCCACAGAAAAAGCCGCAACGCCTGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCCTGAGCG
ATGAAACGGCGGTGATAGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCGATCCGGCGTCACTGCGCCCA
AACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCTGGGTGT
ACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAGAGACGA
CGCCTGCCGCTTCGCAGCCGGAAGTGTTGCAGCACGACGCCGACAAGCGTTATGCGCCCTTCCCTTTGACGCCCATTCAG
CACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGATAAACG
CCACGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTTATCGCACGCCACGATATGTTGCGCATGGTGG
TTGATGCCGACGGGCAGCAGCGAGTCCTGGGGACAACGCCGGAGTATCACATCCAGCGTGACGATCTGCGCGCGCTTTCC
CCGGAAGAACAACGCATCGCGCTGGAAAAACGGCGGCATGAAATGAGCTATCGCGTTTTGCCTGCCGACCAGTGGCCTCT
TTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGCCTGCACATGAACCTCGACCTTTTGCAGTTCGATGTGC
AGAGTTTTAAAGTCATGATGGACGATCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCGCCGCTCGCTATTACCTTCCGT
GATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGGAAAACTGCC
GCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCGCCCGGAAACGCCACACTTCACCACCTTCAAATCGACGA
TCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTCACGCTG
TTTGCCGCCACCCTTGAGCGCTGGAGCCGCACCACGGCATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCCGATCCA
TCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAACGCTGGTGACGTTGC
AAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTGATCCGT
GAGCTGGGCCGCCTGCGCGGGTCACAACGTCAACCGCTGATGCCGGTGGTGTTTACCAGTATGCTGGGGATGACGCTGGA
AGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACACCGCAGGTCTGGC
TGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCCGGCGCT
GCCGAGGCGATGTTTAATGACTACTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCTCGCCAG
CGGTATCGCCGGGCACATTCCCCGTCGACGCTGGCCGCTGAACGCACAGGCGGACTACGACCTGCGGGATATTGAGCAGG
CGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATCGTAATG
GCCGACGATCCGTCGCCATCAGCGGCGACGCCTGATGAGCACGAACTTACCCAACTGGCGCTGTCGTTGCCTGAGCAGGC
GCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAAATCGTC
ACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAAGCGTCTCACCAG
CGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGCGTGGTTAATCCGCGAGGGTGAAAGCTGGCGCTGCCGCGT
TCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCCAAGCCAATGGAGCCAGGCGCTGGCGCAGTATCTGGAAA
CCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCACCGCTGGAATTGCTGTTCAACGAGCAGCATCGTGTG
ACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTGCGGCGC
AGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGCGGAAGT
CGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCGGGTGTCTTAT
GCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGTCAATGT
GCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGCTGATCG
TTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCAAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGCGATTTC
CGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGTTTTGCAAACGAGCT
GGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTGGCGCATTCGCCTGGCGTAAATCGCCCGGATA
AAGAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTTCCCGTTTTACAGATCCGGCAAAGAGAAGCGTTA
TTTACGCCGCTGCATGCCCCGTCTGATGCGCTGATTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCCGGCGCT
GGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAACTGGGCG
GCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGATCTGTTC
AGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATTCCCCTTTGCCA
GGGCGACGGCGACGAAACCCTGTTCGTCTTCCACGCTTCGGACGGCGATATCAGCGCCTGGCTGCCGCTCGCCAGCGCGC
TGAACAGGCGTGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTTGACCAGATGATCGATGAGTATGTC
GGGTGCATCCGTCGCCAGCAGCCTCACGGCCCTTATGTACTGGCGGGTTGGTCGTATGGCGCGTTTCTCGCGGCGGGCGC
CGCACAGCGCCTGTACGCCAAAGGCGAGCAGGTTAGGATCGCGTTAATCGATCCCGTGTGCCGACAGGATTTCTGTTGCG
AAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAGCAGACG
CCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCAAGCGGC
AGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAACGTTCCTGTCCCCT
GTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAACAACGCC
GACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCTCACGTTCAGGTTTGTGCGCAACACATTAC
GCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH
YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPMSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL
RRLATDYAGALRENTDASDLAFTALHARRLDLPFRLAAPLNRETAAALSDWAGEKSGALVYSGHGASGKQVWLFTGQGSH
WRTMGQTMYQHSTAFADMLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWHAEGLKPDFAIGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF
CTTLSQHNINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQMAHQLGARVFLEMGPDAQLVASGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWTDLLAGDGQRIAAP
CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAITI
IRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRAHPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYDMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQSLRILEVGGGTGGTTAWLLPELNGVPALEYHF
TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPTAGMSEHIILATLPGQAVSAVTFTA
PSEPVLGQALTDNGDYLADWSDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPLVAAPEWLGEVRLSWQNEAFSRGQMH
VEARHPDGEWLPLSPAAPLPAPQTHYQWRWTPLNVASVDHPLTFSAGTLARSDELAQYGIIHDPHASSRLMIVEESEDTL
ALAEKVIAALIASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLIAAIDLAENTPWETLHQGLSAVSLS
QRWLAARGNTLWLPSLALNTGCAAELPANVFTGDNRWHLVTGAFGGLGRLAVNWLREKGARRIALLAQRVDESWLRDVEG
GQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLATVFAVKAQAANQLLQTLRNHDGRYLIL
YSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHLEQ
AVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVIAWLKKRIAVQLRLSDPASLRP
NQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPETTPAASQPEVLQHDADKRYAPFPLTPIQ
HAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRVLGTTPEYHIQRDDLRALS
PEEQRIALEKRRHEMSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAITFR
DYVMAEQARRQTSAWHDAWDYWQGKLPQLPLAPELPVVETRPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALLTL
FAATLERWSRTTAFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSTLVTLQEQMQQTQQRLWQNMAHSEMNGVEVIR
ELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEPGA
AEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDIVM
ADDPSPSAATPDEHELTQLALSLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQASHQ
RLLRQWLQCLTERAWLIREGESWRCRVPLSEIPEPQEACPPSQWSQALAQYLETCIARHDALFSGQCSPLELLFNEQHRV
TDALYRDNPASACLNRYTAQIAALCGAERILEVGAGTAATTAPVLKATRNTRKSYHFTDVSAQFLNDARARFHDESRVSY
ALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYRDF
RRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVAHSPGVNRPDKEAVSRYLQQRFGTGLPVLQIRQREAL
FTPLHAPSDALIEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQDLF
SHSTLSDFCAHLQAATSGEDNPIPLCQGDGDETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFATLDQMIDEYV
GCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRIALIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQQT
PDSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGENVPVPCLMVYAAGRPARWTPAETEWQGWINNA
DDAVIEASHWQIMMEAPHVQVCAQHITRWLCATSTQPENTL