Gene Information

Name : c3564 (c3564)
Accession : NP_755439.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : T : Signal transduction mechanisms
COG ID : COG2205
EC number : -
Position : 3413218 - 3414690 bp
Length : 1473 bp
Strand : -
Note : Residues 169 to 490 of 490 are 26.31 pct identical to residues 162 to 473 of 475 from GenPept.129 : >gb|AAK81155.1|AE007818_1 (AE007818) Membrane associated histidine kinase with HAMP domain [Clostridium acetobutylicum]

DNA sequence :
ATGAATTGTAGCCTGACATTAAGCCAGAGGTTAAGCCTAGTATTTACAGTCGTTTTGCTGTTTTGCGCAGCAGTGACATG
TGGCGTTCATATTTACAGCAGTAATCTGTATGGCAATGCAATGGTACAGCGTTTATCTGCAGGGCTGGCGCAACAGATTG
TCATCACGGAGCCTCTGCTGGATAATCGTGGGCAGGTGAATCACCGGACATTAAAGAGTCTGTTTGAGCGTCTGATGACG
CTTAATCCCAGTGTGGAGCTGTATATTGTCTCGCCGGAAGGTCGGCTGCTTGTGGAGGCCGCCCCTCTAGGTCATATCAA
ACGTCGGTATATCAATATAGCGCCCTTGAAAAAATTTCTCTCCGGTGCTGTCTGGCCCGTATATGGTGATGATCCCCGAA
GTGTAAATAAGAAAAAAGTTTTCAGTACCGCACCGCTTTACCTGAGGGATGATCTGAAAGGATATCTGTATATTATTTTA
CAGGGAGAGGAACTTAATGCTCTTACTGATGCAGCCTGGACAAAGGCACTATGGAATGCACTGTACTGGTCGCTGTTTCT
GGTAGTGATATGTGGTCTGCTGTCGGGTATGCTGGTCTGGTACTGGGTAACCCGTCCCATACAGCAACTAACTGAAAATG
TCAGCGGGATAGAGCAGGACAGTATTAGTGCCATTAAACAACTGGCAATTCAGCGCCCTGCCACCCCCCCTAGCAACGAG
GTCGAGATATTACACAATGCCTTCATTGAACTGGCCCGTAAAATATCCTGTCAGTGGGATCAACTTTCAGAAAGTGATCA
ACAGCGCCGTGAATTTATTGCCAATATCTCCCATGATTTACGGACGCCATTAACATCACTTCTGGGATATCTGGAAACCC
TGTCAATGAAGTCGGATTCGCTATCATCAGAGGACTGTCATAAATATCTGACAACAGCTCTCCGGCAGGGACACAAGGTG
AGGCATCTATCCTGTCAGCTTTTTGAGCTGGCACGTCTTGAGCATGGTGCTATAAAACCTCAACTGGAGCAATTTTCTGT
CTGTGAACTTATTCAGGATGTAGCTCAAAAATTTGAGCTCAGCATAGAAACCCGTCGATTGCAACTAAGAATTATGATGT
CACATTCCCTGCCTCTTATCAGGGCAGATATTTCAATGATAGAGCGTGTGATAACAAATTTACTGGATAATGCTGTACGC
CACACACCTCCGGAAGGCTCGATCAGGCTGAAAGTCTGGCAGGAAGATAATCGGTTGCACGTCGAAGTGGCTGACAGCGG
CCCTGGACTAACTGAAGATATGCGAACTCATCTTTTCCGGCGGGCATCAGTGTTATGTCATGAACCGTCAGAAGAGCCCC
GGGGAGGACTGGGATTGCTGATTGTACGCAGGATGCTGGTACTACACGGTGGTGATATCAGGTTGACTGATTCAACGACT
GGAGCCTGCTTTCGTTTTTTTCTTCCATTATAA

Protein sequence :
MNCSLTLSQRLSLVFTVVLLFCAAVTCGVHIYSSNLYGNAMVQRLSAGLAQQIVITEPLLDNRGQVNHRTLKSLFERLMT
LNPSVELYIVSPEGRLLVEAAPLGHIKRRYINIAPLKKFLSGAVWPVYGDDPRSVNKKKVFSTAPLYLRDDLKGYLYIIL
QGEELNALTDAAWTKALWNALYWSLFLVVICGLLSGMLVWYWVTRPIQQLTENVSGIEQDSISAIKQLAIQRPATPPSNE
VEILHNAFIELARKISCQWDQLSESDQQRREFIANISHDLRTPLTSLLGYLETLSMKSDSLSSEDCHKYLTTALRQGHKV
RHLSCQLFELARLEHGAIKPQLEQFSVCELIQDVAQKFELSIETRRLQLRIMMSHSLPLIRADISMIERVITNLLDNAVR
HTPPEGSIRLKVWQEDNRLHVEVADSGPGLTEDMRTHLFRRASVLCHEPSEEPRGGLGLLIVRRMLVLHGGDIRLTDSTT
GACFRFFLPL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
c3564 NP_755439.1 hypothetical protein Not tested PAI I CFT073 Protein 0.0 100

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
c3564 NP_755439.1 hypothetical protein VFG1701 Protein 0.0 100