Gene Information

Name : BURPS668_A2959 (BURPS668_A2959)
Accession : YP_001063950.1
Strain :
Genome accession: NC_009075
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 2811903 - 2813411 bp
Length : 1509 bp
Strand : -
Note : identified by match to protein family HMM PF05943

DNA sequence :
ATGAATGAACGTGCCCAAACCCAGGCCGACACGCGCGCCGCCGCCCAGCCCGTGGTCGCGCGCGACGAGTTCGCCGCGCT
GCTGCAAAAGGAGTTCAAGCCGAAGACGGCGGAGGCGCGCGAATCGGTCGAGCGCGCGGTGCGCACGCTCGCGCAGCAGG
CGCTCGAGCACACGGTCGGCATGACGACCGACGCTTACGGCAGCGTCAAGCAGATCATCGCCGAGATCGACCGCAAGCTC
TCCGAGCAGATCAACCTGATCCTGCATCATCAGGAGTTCCAGACGCTCGAAGGCGCGTGGCGCGGCCTGCACTACCTCGT
CACGAACACCGAGACCGACGAGCTGCTGAAGATCAAGGCACTGCCCGCGTCGCGCAACGAGCTCGCGCGCACGCTCAAGC
GCTACAAGGGCGTCGCGTGGGATCAGAGCCCGCTGTTTCGCAAGGTCTACGAAGAAGAGTACGGCCAGTTCGGCGGCGAG
CCGTTCGGCTGCCTCGTCGGCGATTTCCATTTCAACCACAGTCCGCCCGACGTCGAGATGCTCGGCGAGCTGTCGAAGAT
CGCGGCGGCCGCGCATGCGCCGTTCATCGCGGGCGCGTCGCCCGAGCTGATGCAGATGGATTCGTGGCAGGAGCTCGCCA
ATCCGCGTGATCTGACGAAGATCTTCCAGAACACCGAATACGCAGCGTGGCGCAGCTTGCGCCAGTCCGAGGATTCGCGC
TACGTCGGGCTCGCGATGCCGCGCTTTCTCGCGCGGCTGCCGTACGGCGCGCGCACGAATCCCGTCGACGAATTCGACTT
CGAGGAGGATACCGGCGCCGCGAGCCACGACCGCTACACGTGGGCGAATTCCGCGTACGCGATGGCGGCGAACATCAACC
GCTCGTTCAAGCTGTACGGCTGGTGCTCGTCGATCCGCGGCGTCGAATCGGGCGGCGCGGTGCAAGGGCTGCCGTGCCAC
ACGTTCCCCACCGACGACGGCGGCGTCGACCAGAAATGCCCGACCGAGATCGCGATCAGCGACCGCCGCGAGGCCGAGCT
CGCGAAGAACGGCTTCATGCCGTTCGTACATCGAAAGAATTCGGATTTCGCGGCGTTCATCGGCGCGCAGTCGCTGTATC
AGCCCGCCGAGTATCACGATCCCGACGCGACCGCGAACGCGCGACTCTCGGGCCGCCTGCCGTACCTGTTCGCGTGCTGC
CGCTTCGCGCATTACCTGAAGTGCATCGTGCGCGACAAGATCGGCTCGTTTCGCGAGCGCGACGATATGGAGCGCTGGCT
CAACGACTGGATCATGAACTACGTCGACGGCGATCCGGCGAACTCGTCGCAGGAGACGAAGGCGCGCAAGCCGCTCGCGG
CCGCGCAGGTCGTCGTCGAGGAGATCGACGACAACCCCGGCTATTACGCGTCGAAATTCTTCCTGCGGCCGCATTACCAG
CTGGAAGGGCTCACCGTGTCGCTGCGGCTCATCTCGAAGCTGCCGTCGGCGAAGGCGGCGGGCGAATGA

Protein sequence :
MNERAQTQADTRAAAQPVVARDEFAALLQKEFKPKTAEARESVERAVRTLAQQALEHTVGMTTDAYGSVKQIIAEIDRKL
SEQINLILHHQEFQTLEGAWRGLHYLVTNTETDELLKIKALPASRNELARTLKRYKGVAWDQSPLFRKVYEEEYGQFGGE
PFGCLVGDFHFNHSPPDVEMLGELSKIAAAAHAPFIAGASPELMQMDSWQELANPRDLTKIFQNTEYAAWRSLRQSEDSR
YVGLAMPRFLARLPYGARTNPVDEFDFEEDTGAASHDRYTWANSAYAMAANINRSFKLYGWCSSIRGVESGGAVQGLPCH
TFPTDDGGVDQKCPTEIAISDRREAELAKNGFMPFVHRKNSDFAAFIGAQSLYQPAEYHDPDATANARLSGRLPYLFACC
RFAHYLKCIVRDKIGSFRERDDMERWLNDWIMNYVDGDPANSSQETKARKPLAAAQVVVEEIDDNPGYYASKFFLRPHYQ
LEGLTVSLRLISKLPSAKAAGE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 2e-79 41
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 1e-79 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BURPS668_A2959 YP_001063950.1 hypothetical protein VFG2070 Protein 7e-179 75
BURPS668_A2959 YP_001063950.1 hypothetical protein VFG2475 Protein 3e-83 43
BURPS668_A2959 YP_001063950.1 hypothetical protein VFG2093 Protein 2e-85 42