Gene Information

Name : ECUMN_0216 (ECUMN_0216)
Accession : YP_002410987.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 243396 - 246923 bp
Length : 3528 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pm : putative membrane component

DNA sequence :
GTGTTCAGATTTCCGACATCCCGACTGTTCAGCACGCTGAGATCTGCTCTCAGGCCAGCGATGCCGCGATTCAGGGTATC
CGCCGCCTGGCTGCTGGCGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAAGGTCCGAAATGGACGCTCTATG
AGCAGCACTGGCTGGCTCCTCTAACAAACCGCTGGCTGGCGACCGCCGTCTGGGGGCTTATCGCTCTGATCTGGCTCACC
TGGCGGGTAATGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAAGAAAAAGATCCGTTGACCGT
GGAACTCCACCGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCTGGATAACCGCCGTTATCTGT
GGCAGTTGCCGTGGTATATGGTCATTGGTCCTGCGGGTAGCGGTAAAAGCGCTCTGCTGCGCGAGGGCTTTCCATCTGAC
ATTATTTACACGCCGGAAAGCATCCGGGGTACGGAATACCATCCGCTGATCACACCGCGAGTGGGCAACCAAGCGGTGAT
TTTCGATGTTGACGGCGTACTGACCTCGCCCGGCGGGGATGATCTGCTCCACCGCCGCCTGCGCGAACACTGGCTGGGCT
GGCTGATGCAAACGCGCGCGCGCCAGCCGCTCAACGGCCTGATCCTGACGCTCGATCTTCCCGATCTGCTGACGGCGGAT
AAATCCCGCCGTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGCCAGAGTCTGCACTGCCGTCTGCC
CGTTTACGTGGTGCTGACACGGCTGGATCTGCTGACCGGCTTTGCCGCGCTGTTCCATTCACTGGATAAAAAAGACCGCG
ATGCGATCCTCGGCGTCACGTTTACCCGCCGCGCCCATGAAAGTGACGACTGGCGCAGCGAACTGGGGGCTTTCTGGCAG
ACGTGGGTACAACAGGTGAACCTGGCGCTGTCGGATCTGATGCTCGCACAAACCGGTGCTGCTCCCCGCAGCGCCGTGTT
CAGCTTCTCCCGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATTGCTGGACGGTGAGAACATGG
ATGTAATGCTGCGTGGCGTCTGGCTCACATCATCGCTACAGCGTGGCCAGGTGGATGATATTTTCACGCAGTCCGCCGCC
CGCCAGTACGGGCTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCGTATTTTACTCGCCGCCTCTT
CCCTGAAGTCCTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAGCTCCCGGCGCAGGCTGACCG
CCTTTTCCGCCTGTGGCGCGGCGCTGGCGGCATTGCTGGTCGGAAGCTGGCACCATTATTACAATCAGAACTGGCAGTCC
GGCGTTAACGTACTGGCACAGGCTAAAGCCTTTATGGACGTACCACCACCGCAAGGAACGGATGAATTCGGCAATCTGCA
ACTGTCGTTGCTTAATCCGGTACGCGATGCCACCCTGGCCTATGGCGATTACCGCGATCGCGGTTTTCTGGCGGATATGG
GATTGTACCAGGGCGTCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTGAGCAGCGTTATCTCCCCTCG
TTAATGAACGGCCTGATCCGGGATCTAAACAATGCCCCGCCAGAGAGCGAAGAAAAGCTCGCCGTGCTGCGCGTACTGCG
CATGATGGAAGACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCGCGGCGCTGGAGCAATGAATTTCACG
GCCAGCGCGATATTCAGGCGCAACTGATGGCGCATCTGGACTATGCGCTGGAGCACACCGACTGGCACGCGCAGCGCCAG
AGCGGTGACAGCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGGTCATTAATGCGCAGCAGGAACTGAGCAAGCT
GCCCATATACCAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGCCGATTTGAATTTGCGCGACC
AGGTTGGTCCCACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCCCGCAGTTCCTCACCCGCTAT
GGACTGCAAAGCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGATTCGTGGGTACTGAACCTGAC
GCAAAGCGTCGCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACAGTACATCAGTGACTATACCG
CCACCTGGCGTGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGCTGACCGACGCGCTGGAGCAG
ATTATCAGCGGCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCACGCGCTGACGCTCTCCGGCAA
ACTGGATGATAAGGCGAGGGAAGCGGCAATAAATGAGATGGATTACCGCCTGTTATCCCGGCTGGGGCATGAGTTCGCAC
CGGAAAACAGCGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACCAGCAACTGACCGAGCTGCAC
CGTTACCTGCTGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTACAGCTACGTCTGGATCAATA
CAGCAGCGATCCAATCTTCGCTACCCGCCAGATGGCAAAAACTCTCCCTGCACCGCTTAACCGCTGGGTAGGTAAGCTCG
CGGATCAGGCCTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGCGCGACAATGTAGTGAAACCC
TTCAACGAGCAGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCACTGGATTCGTTTGAACGTTT
CTTTAAACCGGATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGAAAACGATCTGACCTTTGGCG
ACGACGGCAGAGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAATCCGCGACATCTTCTTCAGC
CAGCAGAACGGGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAGCGGCGCAGCGTACTTAACCT
GGACGGCCAGTTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCCGAACAACATGCGTGAAGGCA
ATGAAAGCAAGCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCACAGTATCGCGTTCAGTGGACCGTGGGCGCAGTTC
CGCCTGTTCGGCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTTAACGTGGACGGCGGCGCAAT
GGTTTACCGGGTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCTGTTCCGTTTACCGGATACGT
TGTATTAA

Protein sequence :
MFRFPTSRLFSTLRSALRPAMPRFRVSAAWLLALAWIFLLVWIWWQGPKWTLYEQHWLAPLTNRWLATAVWGLIALIWLT
WRVMKRLQKLEKQQKQQREEEKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSALLREGFPSD
IIYTPESIRGTEYHPLITPRVGNQAVIFDVDGVLTSPGGDDLLHRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTAD
KSRRETLVQNLRQQLQEIRQSLHCRLPVYVVLTRLDLLTGFAALFHSLDKKDRDAILGVTFTRRAHESDDWRSELGAFWQ
TWVQQVNLALSDLMLAQTGAAPRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAA
RQYGLGNSSLATWPLVETTPYFTRRLFPEVLLAEPNLAGENSVWLNSSRRRLTAFSACGAALAALLVGSWHHYYNQNWQS
GVNVLAQAKAFMDVPPPQGTDEFGNLQLSLLNPVRDATLAYGDYRDRGFLADMGLYQGVRVGPYVEQTYIQLLEQRYLPS
LMNGLIRDLNNAPPESEEKLAVLRVLRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMAHLDYALEHTDWHAQRQ
SGDSDAVSRWTPYDKPVINAQQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRY
GLQSYFVKQREGLVELTALDSWVLNLTQSVAYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQ
IISGDQPFQRALTALRDNTHALTLSGKLDDKAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELH
RYLLAIQNSPVPGKSALKAVQLRLDQYSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKP
FNEQLADNYPFNPRATQDASLDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFS
QQNGLGAQFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPHSIAFSGPWAQF
RLFGAGQLTNVTSDTFNVRFNVDGGAMVYRVHVDTEDNPFTGGLFSLFRLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 98
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 98
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 58