Gene Information

Name : ECs4547 (ECs4547)
Accession : NP_312574.1
Strain : Escherichia coli Sakai
Genome accession: NC_002695
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 4586854 - 4588392 bp
Length : 1539 bp
Strand : +
Note : similar to L0015 [Escherichia coli EDL933] gi|3414883|gb|AAC31494.1|, hypothetical proteins e.g. [Escherichia coli plasmid pEAF] gi|4808945|gb|AAD30027.1|AF119170_2

DNA sequence :
ATGAACGACATCTCTTCTGACGACATCTTCCTGCTGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCA
GGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCA
GTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATACGCTG
ACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACC
CCGTGACGAAAAGCGACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATA
CCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGC
GATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCT
GACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGGCGTT
CACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACT
GACGGCAAACTCCATGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTG
GGCGTATGTTCGTGATGACCGCAATGCAGGGTCAGCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAG
GCATCCATCCGCAGACTCATCTTGCCTGCTTCAGCGGTGTGCTGCAAGCGGATGCGTACGCCGGGTTCAACGAGCTGTAT
CGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCATCCCGTC
AGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGC
AGCGGCTTGCTGAACGTCAGCGAAAAACGAAACCGTTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACC
CTGTCGCGACACTCAGAGTTGGCGAAGGCGTTCGCGTACGCACTTAACCAGTGGCCGGCACTGACGTACTATGCGAACGA
TGGCTGGGTGGAAATCGACAACAACATCGCTGAAAATGCCCTGCGGGCGGTCAGTCTGGGTCGTAAAAACTTCCTGTTCT
TCGGCTCTGATCATGGTGGTGAGCGGGGAGCGCCACTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTGGATCCA
GAAAGCTACCTTCGCCATGTGCTTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCAT
AGCACTGCCAGCTGAATAA

Protein sequence :
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTL
TGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQC
DAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQSEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMT
DGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELY
RNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGAPLYSLIGTCKLNDVDP
ESYLRHVLGVIADWPVNRVSELLPWRIALPAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 0.0 100
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 0.0 100
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 0.0 99
unnamed AAC31494.1 L0015 Not tested LEE Protein 0.0 99
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 0.0 99
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 0.0 99
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 0.0 99
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 0.0 99
tnp AEA34686.1 transposase Not tested Not named Protein 0.0 99
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 0.0 99
unnamed AAL57570.1 unknown Not tested LEE Protein 0.0 99
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 0.0 98
unnamed AAL08460.1 unknown Not tested SRL Protein 0.0 96
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 8e-164 92
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 2e-151 64
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 6e-155 64
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 6e-155 64
s0025 CAD33772.1 IS66-like transposase Not tested PAI I 536 Protein 2e-106 62
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 7e-141 60
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 1e-101 59
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 9e-118 56
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 1e-117 55
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 1e-129 55

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECs4547 NP_312574.1 hypothetical protein VFG0793 Protein 0.0 99
ECs4547 NP_312574.1 hypothetical protein VFG1051 Protein 0.0 96
ECs4547 NP_312574.1 hypothetical protein VFG1700 Protein 3e-164 92
ECs4547 NP_312574.1 hypothetical protein VFG1736 Protein 3e-124 63
ECs4547 NP_312574.1 hypothetical protein VFG1513 Protein 1e-106 62