Gene Information

Name : BPSS1497 (BPSS1497)
Accession : YP_111504.1
Strain :
Genome accession: NC_006351
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 2041333 - 2042832 bp
Length : 1500 bp
Strand : +
Note : Similar to Pseudomonas aeruginosa hypothetical protein Pa1658 SWALL:Q9I367 (EMBL:AE004593) (491 aa) fasta scores: E(): 9.7e-131, 66.03% id in 474 aa, and to Vibrio cholerae hypothetical protein Vca0108 SWALL:Q9KN57 (EMBL:AE004353) (492 aa) fasta scores: E

DNA sequence :
ATGGAAGGCGAACACCTGCAATCCCCGAAGCACGACGACGCGCCGGACGCGACGCCGCCCGAGTCCCCCGCCTCGCTGCT
CGACGAGCTGATCGAGGCCGCGCGCGTGAAGCGCGACGAAGACGCATACCCGATCACGCGCCACGGCATCCAGGCGTTCG
TCGCGCATCTGGCGAAGCCCAAGCGCCCGATCGAGACCGTGAGCCAGGCGACGATCGACGACATGATCGCCGAGATCGAC
CGCAAGCTGTGCCGGCAGATCGACGCGATCCTCCATGACCCGGCATTCCAGCAACTCGAATCGACGTGGCGCTCGCTGAA
GTTTCTCGTCGATCGAACGGATTTCCGCGAGAACGTGAAGGTTCAGATTCTCGACGTCGGCAAAACGGCGCTGTTCGACG
ACTTCGAGGATTCGCCCGACATCACGAAATCCGGGCTGTACCAGAAGGTCTATACGGCCGAGTACGGCCAATTCGGCGGC
CAGCCGATCGGCGCGATCGTCGCGAACTACACGTTCGGGCCCGGCGCGCAGGACGTCAAGCTGCTGCAGTACGTCGCGAG
CACGTCGGCGATGGCGCATACGCCGTTCATCGCGGCGGCGGGCCCCGCGTTCTTCGGCATCGATTCGTTCGGCAAGCTGC
CGAACGTAAAGGATCTCGCCTCGCTGTTCGAGGGGCCGCAATACGCGAAATGGAATGCGTTTCGCGAAAGCGAGGACGCG
CGCTACGTCGGCCTCACGCTGCCGCGCTTCTTGCTGCGGCTGCCTTACGGCGCGAACACGACGCCCGTCAAGCGCTTCAA
CTACGAGGAGCGCGTCGACGGCGGCGACGCGCATTTTCTGTGGGGCAACGCGGCGTTCGCGTTCGCGACGCGCCTCACCG
CGAGCTTCGCCGACTATCGCTGGTGCGCGAACGTGATCGGGCCGAAAGGCGGCGGAACGGTGACCGATCTGCCGCTCTAC
GCGTACGAATCGATGGGCGAGATCCAGAACAAGATCCCGACCGACGTGCTGATTTCCGAGCGCCGCGAGTTCGAGCTCGC
CGAACAGGGCTTCATCGCGCTGACGATGCGCAAGAACAGCGACAACGCCGCCTTTTTCTCCGCGAACTCCACGCAGAAGC
CGAAGTTCTTCGGCATCAGCAAGGAAGGCAAGGAGGCCGAGCTCAACTATCGGCTCAGCACGCAACTGCCGTACATCTTC
GTCGTCAACCGGCTCGCCCATTACATCAAGGTGATCCAGCGGGAAAACATCGGCTCGTGGAAGGAGCGCGGCGATCTCGA
GCAGGAGCTCAACCAGTGGATCCGCCAGTACGTCGTCGACATGGACAACCCGTCGCAGAGCGTGCGCAGCCGCCGCCCGC
TGCGGCAGGCGCAGATCGTCGTGTCGGACGTCGAGGGCGAACCCGGCTGGTATCGCGTGGACATGAAGGTGCGGCCGCAC
TTCAAGTACATGGGCGCGTTCTTCACGCTGTCGCTCGTCGGCAAGCTCGAAAAGCGCTAG

Protein sequence :
MEGEHLQSPKHDDAPDATPPESPASLLDELIEAARVKRDEDAYPITRHGIQAFVAHLAKPKRPIETVSQATIDDMIAEID
RKLCRQIDAILHDPAFQQLESTWRSLKFLVDRTDFRENVKVQILDVGKTALFDDFEDSPDITKSGLYQKVYTAEYGQFGG
QPIGAIVANYTFGPGAQDVKLLQYVASTSAMAHTPFIAAAGPAFFGIDSFGKLPNVKDLASLFEGPQYAKWNAFRESEDA
RYVGLTLPRFLLRLPYGANTTPVKRFNYEERVDGGDAHFLWGNAAFAFATRLTASFADYRWCANVIGPKGGGTVTDLPLY
AYESMGEIQNKIPTDVLISERREFELAEQGFIALTMRKNSDNAAFFSANSTQKPKFFGISKEGKEAELNYRLSTQLPYIF
VVNRLAHYIKVIQRENIGSWKERGDLEQELNQWIRQYVVDMDNPSQSVRSRRPLRQAQIVVSDVEGEPGWYRVDMKVRPH
FKYMGAFFTLSLVGKLEKR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 2e-148 58
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 1e-148 58
HH0247 NP_859778.1 hypothetical protein Not tested HHGI1 Protein 2e-146 53

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BPSS1497 YP_111504.1 hypothetical protein VFG2475 Protein 0.0 100
BPSS1497 YP_111504.1 hypothetical protein VFG2093 Protein 7e-158 62
BPSS1497 YP_111504.1 hypothetical protein VFG2070 Protein 7e-82 41