Gene Information

Name : EcHS_A2108 (EcHS_A2108)
Accession : YP_001458788.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Resistance
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0534
EC number : -
Position : 2096806 - 2098446 bp
Length : 1641 bp
Strand : -
Note : identified by match to protein family HMM PF01554; match to protein family HMM TIGR00797

DNA sequence :
TTGAGGCACATCTTAACGGCGAAAAATCTTTTGTCAAACCCGATTTTTAAATTTCCCAACTGTTTGCCGTTTCTATCAAC
AGTTTGTTGCATTTGCAGACAATTTGTTGGCGAAAATCTTTGCAGCTTTGCTGATTCTCCCTCATTATTTGAAATGTGGT
TTCACTTTCTGCAATTAAGGTCGGCTTTGAATATCTCCTCTGCTTTACGCCAGGTTGTTCACGGCACTCGCTGGCACGCT
AAACGCAAGAGCTACAAAGTGTTGTTCTGGCGCGAGATAACCCCGCTTGCTGTTCCTATCTTCATGGAGAATGCCTGTGT
CCTGTTGATGGGTGTCCTGAGCACTTTTCTGGTCAGCTGGCTGGGAAAAGATGCGATGGCCGGCGTGGGATTGGCGGACA
GCTTCAATATGGTCATTATGGCTTTTTTTGCTGCTATCGATCTTGGTACTACTGTCGTTGTGGCATTTAGTCTCGGTAAG
CGGGATCGACGACGAGCGAGGGTGGCGACGCGGCAGTCATTGGTGATCATGACGTTGTTTGCCGTACTGTTGGCAACGCT
TATTCATCATTTTGGCGAACAAATTATTGATTTCGTCGCGGGTGATGCCACGACAGAAGTTAAAGCACTGGCGTTGACTT
ATCTGGAGCTGACGGTACTCAGTTATCCAGCAGTTGCCATCACTTTGATTGGTAGCGGGGCACTTCGTGGTGCAGGGAAT
ACGAAAATACCGCTACTGATTAACGGTAGCCTGAATATTCTTAATATTATTATTAGCGGCATATTGATTTACGGCCTTTT
CTCCTGGCCGGGACTGGGATTTGTCGGGGCAGGGCTGGGTTTAACCATTTCTCGTTATATTGGCGCAGTTGCAATTTTGT
GGGTGCTGGCGATTGGTTTTAATCCTGCGCTAAGGATTTCGTTAAAGAGCTATTTTAAACCGCTGAATTTTAGCATTATC
TGGGAAGTCATGGGGATTGGTATTCCCGCTAGTGTCGAATCAGTGTTATTTACCAGTGGTCGGTTATTAACCCAAATGTT
CGTTGCCGGGATGGGGACCAGTGTTATTGCCGGAAATTTTATCGCGTTTTCAATTGTGGCTCTTATCAACTTACCCGGAA
GTGCGCTCGGCTCTGCTTCTACGATCATTACAGGCCGAAGGTTGGGGGTAGGGCAGATAGCGCAAGCAGAGATTCAGTTG
CGGCATGTGTTCTGGCTGTCCACTCTTGGATTAACGGCCATCGCCTGGCTAACGGCTCCCTTTGCCGGGCTTATGGCATC
GTTTTACACCCAGGATCCACAGGTTAAACATGTCGTTGTGATTCTGATTTGGCTAAATGCTTTATTTATGCCTATTTGGT
CCGCCTCATGGGTGCTACCCGCTGGATTTAAAGGTGCTCGTGATGCCCGTTACGCCATGTGGGTTTCGATGTTGAGCATG
TGGGGTTGTCGGGTTGTAGTCGGTTATGTGCTGGGAATCATGCTTGGCTGGGGTGTGGTTGGTGTCTGGATGGGAATGTT
TGCCGACTGGGCTGTGCGGGCCGTGCTGTTTTACTGGCGAATGGTTACTGGACGTTGGCTATGGAAATACCCTCGACCCG
AGCCGCAAAAGTGTGAAAAAAAGCCAGTTGTGTCGGAATAA

Protein sequence :
MRHILTAKNLLSNPIFKFPNCLPFLSTVCCICRQFVGENLCSFADSPSLFEMWFHFLQLRSALNISSALRQVVHGTRWHA
KRKSYKVLFWREITPLAVPIFMENACVLLMGVLSTFLVSWLGKDAMAGVGLADSFNMVIMAFFAAIDLGTTVVVAFSLGK
RDRRRARVATRQSLVIMTLFAVLLATLIHHFGEQIIDFVAGDATTEVKALALTYLELTVLSYPAVAITLIGSGALRGAGN
TKIPLLINGSLNILNIIISGILIYGLFSWPGLGFVGAGLGLTISRYIGAVAILWVLAIGFNPALRISLKSYFKPLNFSII
WEVMGIGIPASVESVLFTSGRLLTQMFVAGMGTSVIAGNFIAFSIVALINLPGSALGSASTIITGRRLGVGQIAQAEIQL
RHVFWLSTLGLTAIAWLTAPFAGLMASFYTQDPQVKHVVVILIWLNALFMPIWSASWVLPAGFKGARDARYAMWVSMLSM
WGCRVVVGYVLGIMLGWGVVGVWMGMFADWAVRAVLFYWRMVTGRWLWKYPRPEPQKCEKKPVVSE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
yeeO YP_853088.1 hypothetical protein Not tested PAI IV APEC-O1 Protein 8e-178 94

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcHS_A2108 YP_001458788.1 hypothetical protein BAC0146 Protein 9e-155 82