PAI Gene Information


Name : nanH (VPI2_0032)
Accession : AAW31751.2
PAI name : VPI-2
PAI accession : EU272902
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : neuraminidase
Function : -
Note : sialidase precursor; energy metabolism; NANase; similar to AA sequence (same species):INSD:AAW31751.2, protein motif:InterPro:IPR015344, AA sequence similar to TIGR locus VC1784 in Vibrio cholerae strain N16961
Homologs in the searched genomes :   8 hits    ( 8 protein-level )  
Publication :
    -Coelho,A., "Direct Submission", Submitted (09-NOV-2007) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil REMARK Sequence update by submitter.

    -Figueiredo,S.C., Neves-Borges,A.C. and Coelho,A., "The neuraminidase gene is present in the non-toxigenic Vibrio cholerae Amazonia strain: a different allele in comparison to the pandemic strains", Mem. Inst. Oswaldo Cruz 100 (6), 563-569 (2005) PUBMED 16302067.

    -Figueiredo,S.C.A. and Coelho,A., "Direct Submission", Submitted (14-NOV-2004) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil.

    -Figueiredo,S.C.A. and Coelho,A., "Direct Submission", Submitted (03-MAR-2005) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil REMARK Sequence update by submitter.

    -Figueiredo,S.C.A., Reis,R.C., Goncalves,M.S.M., Beltrao,P.J.M.S.I. and Coelho,A., "The VPI-2 pathogenicity island of Vibrio cholerae Amazonia", Unpublished.


DNA sequence :
TTGTCAATCAAGATGACTTCACAACGAAGAAGAGCATCGATTCACAAGGAAACAGATTCTAATATAAAGGGAGTAGATAT
GCGTTTCAAAAACGTAAAGAAAACCGCTTTAATGCTTGCTATGTTCGGTATGGCGACAAGCTCAAACGCCGCACTTTTTG
ACTATAACGCAACGGGTGACACTGAGTTTGACAGCCCTGCCAAGCAGGGATGGATGCAAGACAACACGAATAATGGCAGT
GGTGTTTTAACCAATGCAGATGGAATGCCCGCTTGGTTGGTGCAAGGTAATGGAGGGAGAGCTCAATGGACATATTCTCT
CTCTACTAATCAACATGCCCAAGCATCAAGTTTCGGTTGGCGAATGACGACAGAAATGAAAGTGCTCAGTGGTGGAATGA
TCACAAACTACTACGCCAACGGCACTCAGCGTGTCTTACCCATCATTTCATTAGACAGCAGTGGTAACTTAGTTGTTGAG
TTTGAAGGGCAAACTGGACGTACCATTTTGGCAACTGGCACAGCAGCAACGGAATATCATAAATTTGAATTGGTATTCCT
TCCTGGAAGTAACCCATCCGCTAGCTTTTACTTTGATGGCAAACTCATTCGAGACAACATCCAGCCAACGGCATCAAAAC
AAAATATGATCGTATGGGGAAATGGCTCATCAAATACGGATGGTGTCGCCGCCTATCGTGATATTAAGTTTGAAATTCAA
GGCGACGTCATCTTCAGAGGCCCAGACCGTATACCATCCATCGTGGCAAGTAGTGTAACACCAGGGGTTGTAACCGCGTT
TGCAGAGAAACGTGTGGGCGGTGGCGACCCCGGTGCTCTGAGTAATACCAATGACATCATCACTCGTACCTCACGAGATG
GCGGCATAACTTGGGATACAGAGCTCAACCTCACTGAGCAAATCAATGTCAGTGATGAATTTGATTTCTCGGATCCTCGG
CCTATCTATGACCCTTCCACCAACACAGTTCTTGTCTCTTACGCTCGATGGCCGACCGATGCCGCTCAAAACGGAGATCG
AATAAAACCATGGATGCCAAACGGTATTTTTTACAGCGTCTATGATGTTGCATCAGGGAATTGGCGAGCGCCTATCGATG
TTACCGATCAGGTGAAAGAACGCAGTTTCCAGATCGCAGGTTGGGGTGGTTCAGAACTGTATCGCCGAAATACCAACCTA
AATAGCCAGCAAGACTGGCAATCAAACGCTAAGATCCGAATTGTTGATGGCGCTGCAAACCAGATACAAGTTGCCGATGG
TGGCCGAAAATATGTTTTCACACTGAGTATTGATGAATCAGGCAGTCTAGTCGCTAATCTAAATGGTGTTAGTGATCCAA
TTATCCTGCAATCTGAACGCGCGAAGGTACATTCTTTCCATGACTACGAACTTCAATATTCGGCGTTAAACCGCAGCACA
ACGTTATTCGTGGATGGTCAGGCAATCACCACTTGGACTGGCGAAGTATCGCAGGAGAACAACATTCAGTTTGGTAATGC
GGATGCCCAAATTGATGGCAGACTGCATGTGCAAAACATTGCTCTCACACAACAAGGCCAAAACCTCGTGGAGTTTGATG
CTTTCTATTTGGCACAACAAACCCCTGAAGTAGAGAAAGACCTTGAAAAGCTTGGTTGGACAAAAATTAAAACGGGCAAC
ACCATGAGTTTGTATGGGAATGCCAGTGTCAACCCAGGACCGGGTCATGGCATCACCCTTACTCGACAACAAAATATCAG
TGGCAGCCAAAACGGCCGCTTGATCTACCCAGCGATTGTGCTTGATCGTTTCTTCTTGAACGTCATGTCTATTTACAGTG
ATGATGGCGGTTCAAACTGGCAAACCGGTTCAACACTCCCTATCCCTTTTCGCTGGAAGAGTTCGAGTATCCTAGAAACT
CTCGAACCTAGTGAAGCTGATATGGTTGAGCTCCAAAACGGTGATCTACTCCTTACTGCACGCCTTGATTTTAACCAAAT
CGTTAATGGTGTGAACTATAGCCCACGCCAGCAATTTTTGAGTAAAGATGGTGGAATCACGTGGAGCCTACTTGAGGCTA
ACAACGCTAACGTCTTTAGCAATATCAGTACTGGTACCGTTGATGCTTCTATTACTCGGTTCGAGCAAAGTGACGGTAGC
CATTTCTTACTCTTTACTAACCCACAAGGAAACCCTGCGGGGACAAATGGCAGGCAAAATCTAGGCTTATGGTTTAGCTT
CGATGAAGGGGTGACATGGAAAGGACCAATTCAACTTGTTAATGGTGCATCGGCATATTCTGATATTTATCAATTGGATT
CGGAAAATGCGATTGTCATTGTTGAAACGGATAATTCAAATATGCGAATTCTTCGTATGCCTATCACATTGCTAAAACAG
AAGCTGACCTTATCGCAAAACTAA

Protein sequence :
MSIKMTSQRRRASIHKETDSNIKGVDMRFKNVKKTALMLAMFGMATSSNAALFDYNATGDTEFDSPAKQGWMQDNTNNGS
GVLTNADGMPAWLVQGNGGRAQWTYSLSTNQHAQASSFGWRMTTEMKVLSGGMITNYYANGTQRVLPIISLDSSGNLVVE
FEGQTGRTILATGTAATEYHKFELVFLPGSNPSASFYFDGKLIRDNIQPTASKQNMIVWGNGSSNTDGVAAYRDIKFEIQ
GDVIFRGPDRIPSIVASSVTPGVVTAFAEKRVGGGDPGALSNTNDIITRTSRDGGITWDTELNLTEQINVSDEFDFSDPR
PIYDPSTNTVLVSYARWPTDAAQNGDRIKPWMPNGIFYSVYDVASGNWRAPIDVTDQVKERSFQIAGWGGSELYRRNTNL
NSQQDWQSNAKIRIVDGAANQIQVADGGRKYVFTLSIDESGSLVANLNGVSDPIILQSERAKVHSFHDYELQYSALNRST
TLFVDGQAITTWTGEVSQENNIQFGNADAQIDGRLHVQNIALTQQGQNLVEFDAFYLAQQTPEVEKDLEKLGWTKIKTGN
TMSLYGNASVNPGPGHGITLTRQQNISGSQNGRLIYPAIVLDRFFLNVMSIYSDDGGSNWQTGSTLPIPFRWKSSSILET
LEPSEADMVELQNGDLLLTARLDFNQIVNGVNYSPRQQFLSKDGGITWSLLEANNANVFSNISTGTVDASITRFEQSDGS
HFLLFTNPQGNPAGTNGRQNLGLWFSFDEGVTWKGPIQLVNGASAYSDIYQLDSENAIVIVETDNSNMRILRMPITLLKQ
KLTLSQN