Gene Information

Name : BPSS2099 (BPSS2099)
Accession : YP_112098.1
Strain :
Genome accession: NC_006351
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 2840542 - 2842050 bp
Length : 1509 bp
Strand : -
Note : Similar to Pseudomonas aeruginosa hypothetical protein pa0084 SWALL:Q9I748 (EMBL:AE004447) (498 aa) fasta scores: E(): 3.9e-153, 72.83% id in 497 aa, and to Yersinia pestis hypothetical protein ypo2938 SWALL:Q8ZCP0 (EMBL:AJ414154) (500 aa) fasta scores: E

DNA sequence :
ATGAATGAACGTGCCCAAACCCAGGCCGACACGCGCGCCGCCGCCCAGCCCGTGGTCGCGCGCGACGAGTTCGCCGCGCT
GCTGCAAAAGGAGTTCAAGCCGAAGACGGCGGAGGCGCGCGAATCGGTCGAGCGCGCGGTGCGCACGCTCGCGCAGCAGG
CGCTCGAGCACACGGTCGGCATGACGACCGACGCTTACGGCAGCGTCAAGCAGATCATCGCCGAGATCGACCGCAAGCTC
TCCGAGCAGATCAACCTGATCCTGCATCATCAGGAGTTCCAGACGCTCGAAGGCGCGTGGCGCGGCCTGCACTACCTCGT
CACGAACACCGAGACCGACGAGCTGCTGAAGATCAAGGCACTGCCCGCGTCGCGCAACGAGCTCGCGCGCACGCTCAAGC
GCTACAAGGGCGTCGCGTGGGATCAGAGCCCGCTGTTTCGCAAGGTCTACGAAGAAGAGTACGGCCAGTTCGGCGGCGAG
CCGTTCGGCTGCCTCGTCGGCGATTTCCATTTCAACCACAGTCCGCCCGACGTCGAGATGCTCGGCGAGCTGTCGAAGAT
CGCGGCAGCCGCGCATGCGCCGTTCATCGCGGGCGCGTCGCCCGAGCTGATGCAGATGGATTCGTGGCAGGAGCTCGCCA
ATCCGCGTGATCTGACGAAGATCTTCCAGAACACCGAATACGCCGCGTGGCGCAGCCTGCGCCAGTCCGAGGATTCGCGC
TACGTCGGGCTCGCGATGCCGCGCTTTCTCGCGCGGCTGCCGTACGGCGCGCGCACGAATCCCGTCGACGAATTCGACTT
CGAGGAGGATACCGGCGCCGCGAGCCACGACCGCTACACGTGGGCGAATTCCGCGTACGCGATGGCGGCGAACATCAACC
GCTCGTTCAAGCTGTACGGCTGGTGCTCGTCGATCCGCGGCGTCGAATCGGGCGGCGCGGTGCAAGGGCTGCCGTGCCAC
ACGTTCCCCACCGACGACGGCGGCGTCGACCAGAAATGCCCGACCGAGATCGCGATCAGCGACCGCCGCGAGGCCGAGCT
CGCGAAGAACGGCTTCATGCCGTTCGTACATCGAAAGAATTCGGATTTCGCGGCGTTCATCGGCGCGCAGTCGCTGTATC
AGCCCGCCGAGTATCACGATCCCGACGCGACCGCGAACGCGCGGCTCTCGGGCCGCCTGCCGTACCTGTTCGCGTGCTGC
CGCTTCGCGCATTACCTGAAGTGCATCGTGCGCGACAAGATCGGCTCGTTTCGCGAGCGCGACGACATGGAGCGCTGGCT
CAACGACTGGATCATGAACTACGTCGACGGCGATCCGGCGAACTCGTCGCAGGAGACGAAGGCGCGCAAGCCGCTCGCGG
CCGCGCAGGTCGTCGTCGAGGAGATCGACGACAACCCCGGCTATTACGCGTCGAAATTCTTCCTGCGGCCGCATTACCAA
CTGGAAGGGCTCACCGTGTCGCTGCGGCTCATCTCGAAGCTGCCGTCGGCGAAGGCGGCGGGCGAATGA

Protein sequence :
MNERAQTQADTRAAAQPVVARDEFAALLQKEFKPKTAEARESVERAVRTLAQQALEHTVGMTTDAYGSVKQIIAEIDRKL
SEQINLILHHQEFQTLEGAWRGLHYLVTNTETDELLKIKALPASRNELARTLKRYKGVAWDQSPLFRKVYEEEYGQFGGE
PFGCLVGDFHFNHSPPDVEMLGELSKIAAAAHAPFIAGASPELMQMDSWQELANPRDLTKIFQNTEYAAWRSLRQSEDSR
YVGLAMPRFLARLPYGARTNPVDEFDFEEDTGAASHDRYTWANSAYAMAANINRSFKLYGWCSSIRGVESGGAVQGLPCH
TFPTDDGGVDQKCPTEIAISDRREAELAKNGFMPFVHRKNSDFAAFIGAQSLYQPAEYHDPDATANARLSGRLPYLFACC
RFAHYLKCIVRDKIGSFRERDDMERWLNDWIMNYVDGDPANSSQETKARKPLAAAQVVVEEIDDNPGYYASKFFLRPHYQ
LEGLTVSLRLISKLPSAKAAGE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 2e-79 41
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 1e-79 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BPSS2099 YP_112098.1 hypothetical protein VFG2070 Protein 7e-179 75
BPSS2099 YP_112098.1 hypothetical protein VFG2475 Protein 3e-83 43
BPSS2099 YP_112098.1 hypothetical protein VFG2093 Protein 2e-85 42