Gene Information

Name : O3O_04915 (O3O_04915)
Accession : YP_006785883.1
Strain : Escherichia coli 2009EL-2071
Genome accession: NC_018661
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp proteinase ATP-binding subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4236338 - 4239097 bp
Length : 2760 bp
Strand : +
Note : COG0542 ATPases with chaperone activity, ATP-binding subunit

DNA sequence :
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCTGA
ATGTATGAGTCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTCATTCAGATGCTCGCCATGCCACGCAGTGACCTGC
GGGTTATTACCCGCCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTAGAGAACTACACAACCGCCCGT
TCTGCGGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTTAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACAATGACGGACGCCTCTGACACCCTGCTTGCCCGCTATGCCAAAAACAT
GACCGCAGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCGGGCGTGGGCAAAAGCGCACTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACTTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGTTTCAAAGGGCTGATGGCGGAGGTCATTTCCTCCCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAAAAAGATGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGCGAGCGTTATCTC
TCCGGGCGTCAGTTACCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCCATCAACCTGTCGTCGCC
GCCGAAGCAAATCTCGGCGCTGACCACTCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATTGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAACAGGATAATACCGGTGCTAAAC
TGGCTGATGAAGCTGGCAGCGAACAGCCGGAAGAGACCGCAGAAACAGTTTCCCCGGTGCAGCGACTGGCACAGCTCACT
GCCGAACTGGACGCCCTGCATAACGACCGGTTGCTGGTCTCCCCGCACGTCGATAAAAAACAGATTGCGGCGGTGATTGC
CGAATGGACCGGCGTACCGCTTAACCGCCTGTCACAGAATGAGATGTCGGTCATCACCGACCTGCCGGTATGGCTGGGTG
ACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAACACCTGCTGACTGCACGCGCCGACCTGCGTCGTCCG
GGACGCCCACTCGGCGCGTTTCTGCTGGCCGGTCCCAGCGGCGTGGGTAAAACCGAAACCGTCCTGCAACTGGCAGAACT
GCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAGTTCCAGGAAAAACACACCGTCTCGCGGCTGATTG
GCTCCCCTCCGGGCTATGTCGGCTATGGCGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCGTACTCGGTGGTG
CTGCTTGATGAAGTGGAAAAAGCGCACCCGGATGTCCTCAACCTGTTCTACCAGGCGTTCGACAAGGGCGAGATGGCAGA
CGGCGAAGGCCGACTGATTGACTGTAAGAATATCGTTTTCTTCCTCACCTCCAACCTTGGTTACCAGGTGATTGTTGAAC
ACGCGGATGACCCGGAAACCATGCAGGAAGTGCTGTATCCGGTGCTGGCGGACTTCTTTAAACCAGCCCTGCTGGCGCGT
ATGGAAGTGGTGCCGTACCTGCCTCTGTCGAAAGAGACGCTCACCACCATTATCGACGGGAAACTGGCCCGCCTGGATAA
CGTGCTGCGCAGCCGCTTTGATGCGGACGTGATTATTGAGTCGGAAGTGACGGACGAGATCATGAGCCGCGTCACCCGCG
CGGAAAACGGCGCAAGGATGCTGGAGTCCGTCATTGACGGCGACATGCTGCCCCCGCTCTCGCTGCTGCTGTTGCAGAAA
ATGGCGGCCAACACGGCGATTGCCCGCATCCGCCTGTCGGCGGCAGACGGCGCGTTCACGGCAGACGTGGAAGATGCTCT
GGACGACGAGTCTGTCACAGAGGATGAAACGGATTTATGA

Protein sequence :
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTMTDASDTLLARYAKNMTADARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESEQDNTGAKLADEAGSEQPEETAETVSPVQRLAQLT
AELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPVWLGDTIKGQDLAIASLHKHLLTARADLRRP
GRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVV
LLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALLAR
MEVVPYLPLSKETLTTIIDGKLARLDNVLRSRFDADVIIESEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQK
MAANTAIARIRLSAADGAFTADVEDALDDESVTEDETDL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 92
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 92

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3O_04915 YP_006785883.1 ATP-dependent Clp proteinase ATP-binding subunit VFG2084 Protein 0.0 57
O3O_04915 YP_006785883.1 ATP-dependent Clp proteinase ATP-binding subunit VFG2076 Protein 7e-137 42