PAI Gene Information


Name : HH0275 (HH0275)
Accession : NP_859806.1
PAI name : HHGI1
PAI accession : NC_004917_P1
Strain : Helicobacter hepaticus ATCC 51449
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : virD4-like protein
Homologs in the searched genomes :   1 hits    ( 1 protein-level )  
Publication :
    -Drescher,B. and Suerbaum,S., "Direct Submission", Submitted (27-MAY-2003) Institute of Hygiene and Microbiology, University of Wuerzburg, Josef Schneider Str. 2, Wuerzburg D-97080, Germany.

    -Suerbaum,S., Josenhans,C., Sterzenbach,T., Drescher,B., Brandt,P., Bell,M., Droge,M., Fartmann,B., Fischer,H.P., Ge,Z., Horster,A., Holland,R., Klein,K., Konig,J., Macko,L., Mendz,G.L., Nyakatura,G., Schauer,D.B., Shen,Z., Weber,J., Frosch,M. and Fox,J.G., "The complete genome sequence of the carcinogenic bacterium Helicobacter hepaticus", Proc. Natl. Acad. Sci. U.S.A. 100 (13), 7901-7906 (2003) PUBMED 12810954.

    -Suerbaum,S., Josenhans,C., Sterzenbach,T., Drescher,B., Brandt,P., Bell,M., Droge,M., Fartmann,B., Fischer,H.P., Ge,Z., Horster,A., Holland,R., Klein,K., Konig,J., Macko,L., Mendz,G.L., Nyakatura,G., Schauer,D.B., Shen,Z., Weber,J., Frosch,M. and Fox,J.G., "Direct Submission", Submitted (26-JUN-2003) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGTTTTGCGGAATTATATTAAGATGTAAGTCAATAAATCAAACAAGCACACATCATTCCACAGACAATGTAACCTTTAA
GGGCGGAGAAGCAGCAACACCCTCTTGGGAGATGTTCCTTAAGCAATTCATATTTCTTTGCATTATAATGTTTGTTTTTC
TTAATCCTGCATTTAAAGCAACAGTTACTATAGAGGACGAGCGATTTTATTATGCAGCACAAACAGGCGGCACTGCGAGT
AAAGTTCGTGTAGATAATGTCCCCGTAATTCTCGCCTTTGGTGCTTCACTTACATCAACTTTTGGGAAAGGGCTTGTTCA
AACGGTAGATTCTGCATTTGGTAGCATACAAGGAGCAAGATTCTCAGATATAGGATTCCTTGAAAACTTTGCTCAGTTGC
GTGAGGACAGGGCAGCACCAAAATTTGCAAAGACAGAAAAAGATTTTGTAGAATATTTCCGCACTTTTGTTCAAAAATGT
GCCTTGCGTTATCAAGGTATATCAGGTGCTGCCATTGAAGCCGTTGAGGTAGGGAAAACTTCTTTGCAGAATCTTAATCC
AAGGATTTGGCAACCTCAATTAGAGGCAACACTCTATGTTAATGGATATTCCTGTGAGGAATATTACGATAAAATGATGG
CTGAATTTAATAAGATTGAAAAACAATGGGAAAATAGAACTATCAAAGCTATGGGGCTTAATCCTGCTACTGCAGCCCCT
GCGCTCAATGCTATGGCAAATGCTATGGTAGCTGCTCCTAATAATGTATCTGCACAGGGCGATAATATGAATCCTTTTGT
GCAGTTTAAACTAAGAGCTTCGCTTGATAAGATTGTAGGCGAGGCAGTGTATGCAGATAGCTTAGGGATTCAAGGTGCTT
ATGGGCAGTATGTTAATCAATCAGTCAAAGACGCTAGAGCTAAGCTTACAGATATAAGCAGTCAATCTAAGTTTATATTC
TCTCTCAAAAGCGTTCCGCAAGCCTTGCACTTTCTTTTAGCACTATTATATGCAGCATTTCCATTTATGCTTGCGTGGAT
TGCCATTCAAGGCTATCCTAGTGGATTAAAAGTGCTCTACTATTTCGTGGGTAGCTTATTCTCATTTGAACTTGTTAAAA
TGTCTTGTGCTCTTACACACGGACTTATTACTGAATATACTTCAAGGAATGCGGCTGAGACATTGCACGGAATGACTGCG
GGAGCAATTAATATGAATGATGTGATTCACGGCACTGCATATTATGAATATATGGCTACACAATCAGAAATTGCTTCTAC
AATAGCAGCAGGGGCAATATTTATTATCCCCATTGTTGTTGCCACAGGACAAGCCAAACTCCTTGCTGCACTCGCACAAC
AAGCCTCTGCCGCTCCTCAAATGGGTGGTATTGAATCATCAGGGGCAGCAATGGCAAAAGCCAATGACGCAAACAATAGA
GATTTAGAAACGGTATCACGAGGCATAGTGTCTGCTTCTGTTATGAAGGGCATTGAGGAAAACAATGCTGCTATCGCTGC
ACTTCACAAAATGGATAATTATAATGAGTTCAACGCAGGACAAGTAGCACAACAAATGAATCAAATTGGCTCTACGGCAG
GATATGGCAAAGAAACGATGTTTAGAGGCGCAGATGGATTCGCAAAAGTTTCAGATGGCGGTAAATTCCAAGGCGTTCAA
TCAGCAGCTACGGCAAAGGGACTTGGTGAAGCCTATACTGATACACACGACTTTAGAGACAGAACAGGTATCTCGGGTAA
TGCCCTTGCTAGAGGACTAGGTAATGACTCTGCTGTAAAAACAATCGCTTCTGCTAGAGGACTTATGAGTGGTGATTTCT
ATACACGAGAGGGAGACCTCTCGGGCGGTAGAGAAGGTAGTTTCTACGCACAAGGCTTAGAAACTAATGCTCGTATAGCT
GCTAATCAAGCCGCAGGTGTAGGAAAGCTTGGAGAAACTACTAGAGAACAAATGAATGCTGTTCAATATGGTGCTGAAGC
TCAAGTCCTTAATCAAATAGCACAGGGTAGAAGTTTGCAAAGGACTTATGGACACGGAGCAGGATTCCAAAGGAGTTTTG
ATGCTATGGCTGAAACAAGTGCTGATATACAAAATATTACTGCAAGAGAAACTGCTGCAAGTGCAAGGGAGCATTTTAAT
GGTATTAGTGGTGGCAAAGGTGAGGATAGTTTCTCAAGTATAACAAAATCTAATGCAGATTTGAGACACGATAGCACCAT
AGGAACTGCAAAAGGTTATGATGGAGGTGTTGGGTTTGATGTGGCAAATGGAAGTGATGGCAAAGTATTTCAAAGAGCAG
CAGAGATTCAAACATATGCTCAAGGAATGACGCAAAGTCAAACCGAAAAACAAATAGGACAAGGCATTAAAGACTCATTT
GGAAATACTTTCAATGCAACAGAAGGACAAACATTTGATAAAAAAGCAGAAGACATAAATAAAAGCCACTCCCAAGCACT
GAAACAAGTTTATGATGGACATTCTAGCTCTATGGGGGCAACAGGTGATATAGGTAATGAAGTAATATCACTTGGAGAAG
CAGCAAAAAAATCTCACACACATCTTGCTAATGAGAGAGCAGCAGCTCAAGAATCGTATGTAAATAATGCTATGAATGCA
CTCGGCAATGATGGATTAAAACAATCTATTATGGCAGAACAAGTTTCAACAATGGCTTCAGCAGGTGTAATGAATTACTC
CTCTATGTCTGCTAATGTTCAAGGAGCACAGGCTTCAGGAATGATGAGTAATGGCATAGTAACACATACAGGTATGCAAT
CAATGATTGCAGATTCTACAATGAGACATCATAATACATTGGCTCTAGGTGATAGATTTGGTAATAGCAATCTCACACAG
CTTACTCAATCATTAAGAGAAGCAGGTATGAGTGAAGTGGGTATCGCCTCTATTGCAAGTGCTTCTAGTCCTGCTGAAAG
AGCAAATAGATTTGCAGCCCTTACACGAAGTTCTACACTTAGTGGAACAATTAATGGGAGAAGCTTCAATCTTGCTGCGG
GTATTGATGGGGGCTATCACGGAAGCTTTGATAGTAGCTTAAATGTTCGAGGTGGATATAATGTAGATATGGGTAGTGTA
AGTGCAGCAACAGGATATGGAATAGGTGGTGTATCAGGTATGAGAGCAATAGGTATGGCAAATACAGGTATGAGCTTTAT
AAGTAATACTGTGAGCAATGTAGGTATGTTTGCTCCTGCTGGTGGTATAGCTAAACTTGGTGCAAATACAGGACTTATTA
ATCCAAATAGTATGTATGCTCAGGGTGTTGGTAAGTTTAAAGAAGCAATGAGAGAAAAGAAAAGTATTGGAGAGAATCAA
GCTACGAGTATGGGGAATGGGGTAAAGGATAATGTAAGAAATGAAATTATGGGGAATGGAACACACTAA

Protein sequence :
MFCGIILRCKSINQTSTHHSTDNVTFKGGEAATPSWEMFLKQFIFLCIIMFVFLNPAFKATVTIEDERFYYAAQTGGTAS
KVRVDNVPVILAFGASLTSTFGKGLVQTVDSAFGSIQGARFSDIGFLENFAQLREDRAAPKFAKTEKDFVEYFRTFVQKC
ALRYQGISGAAIEAVEVGKTSLQNLNPRIWQPQLEATLYVNGYSCEEYYDKMMAEFNKIEKQWENRTIKAMGLNPATAAP
ALNAMANAMVAAPNNVSAQGDNMNPFVQFKLRASLDKIVGEAVYADSLGIQGAYGQYVNQSVKDARAKLTDISSQSKFIF
SLKSVPQALHFLLALLYAAFPFMLAWIAIQGYPSGLKVLYYFVGSLFSFELVKMSCALTHGLITEYTSRNAAETLHGMTA
GAINMNDVIHGTAYYEYMATQSEIASTIAAGAIFIIPIVVATGQAKLLAALAQQASAAPQMGGIESSGAAMAKANDANNR
DLETVSRGIVSASVMKGIEENNAAIAALHKMDNYNEFNAGQVAQQMNQIGSTAGYGKETMFRGADGFAKVSDGGKFQGVQ
SAATAKGLGEAYTDTHDFRDRTGISGNALARGLGNDSAVKTIASARGLMSGDFYTREGDLSGGREGSFYAQGLETNARIA
ANQAAGVGKLGETTREQMNAVQYGAEAQVLNQIAQGRSLQRTYGHGAGFQRSFDAMAETSADIQNITARETAASAREHFN
GISGGKGEDSFSSITKSNADLRHDSTIGTAKGYDGGVGFDVANGSDGKVFQRAAEIQTYAQGMTQSQTEKQIGQGIKDSF
GNTFNATEGQTFDKKAEDINKSHSQALKQVYDGHSSSMGATGDIGNEVISLGEAAKKSHTHLANERAAAQESYVNNAMNA
LGNDGLKQSIMAEQVSTMASAGVMNYSSMSANVQGAQASGMMSNGIVTHTGMQSMIADSTMRHHNTLALGDRFGNSNLTQ
LTQSLREAGMSEVGIASIASASSPAERANRFAALTRSSTLSGTINGRSFNLAAGIDGGYHGSFDSSLNVRGGYNVDMGSV
SAATGYGIGGVSGMRAIGMANTGMSFISNTVSNVGMFAPAGGIAKLGANTGLINPNSMYAQGVGKFKEAMREKKSIGENQ
ATSMGNGVKDNVRNEIMGNGTH