PAI Gene Information


Name : kpsC (ORF_61)
Accession : AAZ04470.1
PAI name : PAI I APEC-O1
PAI accession : DQ095216
Strain : Escherichia coli 042
Virulence or Resistance: Virulence
Product : capsule polysaccharide export protein
Function : -
Note : similar to NP_755565
Homologs in the searched genomes :   108 hits    ( 108 protein-level )  
Publication :
    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "The pap Operon of Avian Pathogenic Escherichia coli Strain O1:K1 Is Located on a Novel Pathogenicity Island", Infect. Immun. 74 (1), 744-749 (2006) PUBMED 16369033.

    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (15-JUN-2005) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.


DNA sequence :
ATGATTGGCATTTACTCGCCTGGCATCTGGCGTATTCCGCATCTGGAGAAATTTCTGGCGCAACCGTGCCAGAAACTTTC
TCTGCTGCGCCCTGTTCCGCAAGAAGTTAATGCTATCGCCGTGTGGGGACATCGTCCCAGCGCGGCGAAACCAGTCGCCA
TCGCCAAAGCAGCGGGAAAACCCGTCATTCGTCTGGAAGATGGATTTGTGCGTTCGCTGGATCTTGGCGTCAATGGCGAG
CCGCCGCTTTCTCTGGTGGTGGATGATTGTGGCATTTACTACGATGCCAGCAAGCCTTCGGCGCTGGAGAAACTGGTACA
GGATAAAGCCGGAAATACAGCTCTGATAAGCCAGGCCAGAGAAGCGATGCACACCATCGTGACCGGGGATATGTCGAAAT
ATAATCTGGCGCCTGCGTTTGTGGCTGATGAGTCAGAACGTACAAACATCGTTCTGGTTGTCGATCAGACATTTAATGAT
ATGTCAGTGACGTATGGCAATGCTGGCCCGCATGAGTTTGCTGCCATGCTGGAAGCCGCGATGGCGGAAAATCCTCAAGC
CGAAATTTGGGTGAAGGTGCACCCAGATGTACTGGAAGGAAAGAAAACAGGTTATTTCGCCGATCTGCGCGCCACGCAAC
GAGTACGTTTAATTGCCGAGAATGTCAGCCCGCAGTCGCTGTTGCGACACGTTTCCCGGGTTTACGTCGTGACCTCCCAA
TACGGCTTTGAAGCCTTGCTGGCAGGAAAACCAGTAACATGTTTCGGCCAGCCCTGGTATGCAAGCTGGGGCTTAACCGA
CGATCGCCATCCGCAGTCCGCTTTGTTATCTGCCCGACGCGGTTCTGCCACGCTGGAGGAACTTTTTGCCGCTGCATACC
TGCGTTACTGTCGCTATATCGATCCGCAAACGGGAGAAGTAAGCGATCTATTTACCGTGCTGCAATGGCTGCAATTACAA
CGTCGACATCTGCAACAGCGTAATGGTTATTTATGGGCGCCAGGCTTAACGCTGTGGAAGTCGGCGATCCTGAAACCCTT
CTTACGAACGCCAACAAACCGGCTGAGTTTTTCACGTCGCTGTACTGCGGCGAGCGCCTGCGTGGTATGGGGTGTAAAGG
GGGAACAGCAATGGCGAGCCGAAGCGCAGCGAAAATCACTGCCATTATGGCGAATGGAAGATGGTTTTCTGCGTTCATCC
GGACTTGGCTCTGACCTGCTGCCGCCGCTATCGTTGGTACTGGATAAACGCGGGATCTACTATGACGCCACGCGCCCCAG
CGACCTGGAAGTGCTGCTTAATCATAGCCAGCTAACGCTGGCGCAGCAGATGCGAGCTGAAAAATTACGCCAGCGACTGG
TTGAAAGTAAACTGAGCAAGTACAACCTGGGAGCCGATTTCTCTCTACCAGCCAAAGCCAAAGATAAAAAAGTTATCCTG
GTGCCGGGTCAGGTAGAGGACGATGCCTCTATTAAAACAGGCACAGTCTCGATTAAGAGCAACCTTGAGTTATTACGCAC
AGTACGCGAGCGTAATCCGCACGCCTACATTGTTTATAAACCGCACCCGGATGTACTGGTGGGGAATCGCAAGGGCGATA
TTCCGGCAGAACTGACTGCTGAACTCGCTGATTATCAGGCACTGGACGCCGATATTATTCAATGCATTCAGCGCGCAGAT
GAAGTGCATACCATGACGTCGCTGTCGGGGTTTGAAGCGTTATTACATGGCAAGCACGTACATTGTTACGGCCTGCCCTT
CTATGCCGGTTGGGGTTTAACCGTCGATGAACATCGTTGCCCGCGTCGCGAGCGAAAATTAACGTTAGCGGATTTGATCT
ATCAGGCGCTGATTGTTTATCCAACCTATATCCACCCAACACGGCTACAACCTATTACGGTTGAAGAGGCGGCGGAATAT
TTGATCCAGACACCGCGCAAGCCGATGTTTATTACCCGAAAAAAAGCGGGGCGAGTAATACGTTATTACCGCAAATTAAT
TATGTTCTGTAAGGTCAGATTTGGCTAA

Protein sequence :
MIGIYSPGIWRIPHLEKFLAQPCQKLSLLRPVPQEVNAIAVWGHRPSAAKPVAIAKAAGKPVIRLEDGFVRSLDLGVNGE
PPLSLVVDDCGIYYDASKPSALEKLVQDKAGNTALISQAREAMHTIVTGDMSKYNLAPAFVADESERTNIVLVVDQTFND
MSVTYGNAGPHEFAAMLEAAMAENPQAEIWVKVHPDVLEGKKTGYFADLRATQRVRLIAENVSPQSLLRHVSRVYVVTSQ
YGFEALLAGKPVTCFGQPWYASWGLTDDRHPQSALLSARRGSATLEELFAAAYLRYCRYIDPQTGEVSDLFTVLQWLQLQ
RRHLQQRNGYLWAPGLTLWKSAILKPFLRTPTNRLSFSRRCTAASACVVWGVKGEQQWRAEAQRKSLPLWRMEDGFLRSS
GLGSDLLPPLSLVLDKRGIYYDATRPSDLEVLLNHSQLTLAQQMRAEKLRQRLVESKLSKYNLGADFSLPAKAKDKKVIL
VPGQVEDDASIKTGTVSIKSNLELLRTVRERNPHAYIVYKPHPDVLVGNRKGDIPAELTAELADYQALDADIIQCIQRAD
EVHTMTSLSGFEALLHGKHVHCYGLPFYAGWGLTVDEHRCPRRERKLTLADLIYQALIVYPTYIHPTRLQPITVEEAAEY
LIQTPRKPMFITRKKAGRVIRYYRKLIMFCKVRFG