Gene Information

Name : SeD_A3072 (SeD_A3072)
Accession : YP_002216747.1
Strain : Salmonella enterica CT_02021853
Genome accession: NC_011205
Putative virulence/resistance : Virulence
Product : ATP binding cassette
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG1132
EC number : -
Position : 2962270 - 2965926 bp
Length : 3657 bp
Strand : +
Note : identified by match to protein family HMM PF00005; match to protein family HMM PF00664

DNA sequence :
ATGCCCGCGACTCATTCCCCCATGCCCGCTCGTGCCTGGATAGTTCGCCTTGCCCGCGTGTGTTGGGAACGTAAAACACT
GAGCATCATTGTCATCGTAGCATCAGTATCGACCATTTTACTGGCTGCGCTGACGCCGCTAATAACGCGTCAGGCCGTCA
ATGACGCGATAGCAGGCGATACGACCCACCTGCCGTTACTTGCCTGCGGCCTGCTGTTAATTGCCCTTTTTGATTTTATC
GGGAATTACGTGCGCCGCGGCTATGCCGGGGAACTCTCGCTGTGGGTTCAGCATACGTTACGTAGCCGTGCGTTCGACAG
TATTCAAAAACTGGACGGCGCAGGCCAGGACGCCTTGCGTACCGGGCAGGTCATTTCGCGTACCAACAGCGATCTTCAAC
AGGTACACACCTTGCTACAGATGTGTCCGGTGCCGCTGGCGGTGCTCACTTACTATGTGGCCGGTATCGCGGTGATGCTA
TGGATGTCGCCATCCATGACGCTTATCGTGATTTGCGTACTGGCCGCCCTTGCTATTACCGCCCTGCGCGCCCGTCGCCG
TGTTTTCGCACAAACGGGGCTTGCCTCAGACCGGCTGGCGCACATGACGGAACATATGCGCGAAGTATTGGAACAGATCT
CAGTGGTGAAGTCCTGCGTGGCGGAGTTACGCGAAACGCGTTGGCTCGATGGTCAGTCGCGGCAGATGGTGCGGGTACGC
ATCGGCGCCGCTATCTCACAGGCGATGCCGGGCGCAACCATGCTGGCGCTACCGGTGATAGGGCAAATCGTCCTGCTGTG
CTATGGCGGCTGGTCGGTAATGAATGGGCGCATCGATCTGGGAACCTTCGTTGCGTTCGCGAGTTTTCTCGCTATGCTGA
CCGGCCCTACCCGCGTACTGGCATCGTTTTTGGTTATCGCACAGCGCACACAGGCGTCGGTGGAGCGCGTCTTCGCGCTT
ATCGACACGCGTTCTCGCATGGAAGACGGTACTGAGTCGGTTGAAGGTCAGATTATCGGGCTGGACGTGGAGAAGATGAG
TTTCCACTACGACAACGGTAACCGTATCCTCAATGAGATCTCGTTTTCCATTCACGCCGGTGAAACCGTGGCAGTGGTTG
GCGCCTCCGGCTCCGGGAAATCAACGTTGCTGATGTTGCTGGCGCGCTTTTACGATCCCACCTCCGGCGGGGTGTGGCTC
AACACCACTACGGGTCAACAGAATATTCGCGACCTGAAACTGACGGCGCTTCGTCGTCGCGTTGGCGTAGTGTTTGAAGA
CGCGTTTCTGTTTGCCGGTACGGTGGCGGAAAACATCGCCTATGGGCACCCGCAAGCGACTCAGGACGACATTCGACGCG
CCGCCGATGCCGCAGGCGCCAGCGGGTTCATCAATGCGCTACCGCAGGGGTTCAACACCCGACTGGCCGAACGCGGAAGC
AACCTCTCCGGCGGCCAACGCCAGCGTATTGCGCTAGCCCGGGCACTGATTACCGCGCCGGAACTGCTGATTCTCGATGA
CACAACCTCGGCGGTCGATGCCGGTACTGAAGCGGAAATTAACACGGCTCTTGGTCGTTATGCCGATAATGAGCATATGC
TGCTGGTGATTGCACGCCGCCGTTCAACGCTGCAGTTAGCCGATCGGATCGTCGTGCTGGATAAAGGCCGCGTCGTGGAT
ATCGGTACCCAGGCGGAGCTGGATGCAAGGTGTCCGACGTTTCGCTCGCTGATGAGCGGCGAGGGTGATTTTCTCGCCCT
TGCCCCGGCAGAACAACGGACGCTATGGCCAACAACGCAGGCGGCAAAATCCGACGACGCGCATGAGCGCCAGACACCCG
CCGGAAAAGGTTTTGTCGACCGGATGACGCGCGTTCCGGAACGCGCCGTACAGATGGCGCTGGCAGGCCACGGGCGTCAA
GTCTCATCGCTGCTGACGCCAGTAGCCTGGATGTTCGTCATCGCCGCCCTGCTTATCGCGCTTGATTCCGCCGCAGGCGT
TGGCGTGCTGGTGCTTTTGCAGCGCGGTATTGACTCAGGCGTTGCCGCAGGGGATATGTCGACTATTGGCATATGCGCTC
TGCTCGCGCTGTGTCTGGTAGCGATCGGTTGGTGCTGCTATGCGCTGCAAACGATCTTCGCCGCCCGCGCGGCAGAGTCG
GTACAGCATACGGTACGCTTACGCAGCTTCAGCCACCTGCTACGCCTGAGTCTTCCCTGGCACGAGAAGCACATCGACTC
GCGTCTTACTCGCATGACCGTCGACGTTGATTCACTCGCCCGATTTCTGCAAAACGGTCTTGCCAGCGCGGCCACCAGCA
TCGTGACGATGGTCGCTATCGCCGCGGCAATGTTCTGGCTTGACCCCATTCTTGCGTTAACGGCATTAAGCGCCGTACCT
GTAGTGATACTGGCGACGTGGATTTACCGCCGTTTGAGCTCGCCTGCCTACGCCCAGGCACGGCTGGAAATTGGTAAGGT
GAACAGTACGCTTCAGGAAAAGGTCTCCGGTATGCGGGTAGTGCAGTCACACGGTCAACAGAAGCAGGAAGCCGCCAGGC
TACGGGCGTTATCAGACAACTTCCGCGCCACCCGCGTGCGGGCGCAAAAATACCTTGCTGTCTATTTTCCGTTCCTGACC
CTCTGCACCGAGGCCGCCTATGCCGCCGTGCTTTTAATCGGGGCTACCCGGGTCGCCGGAGGCGAAATGACGCCCGGGAT
ACTGGCGGCGTTCTTCCTGCTACTGGGACAATTTTATGGCCCGGTACAGCAGTTGTCAGGCATTGTTGATTCCTGGCAGC
AGGCGACCGCCAGCGGCAAACATATCAATGCGCTATTGGCGACTGAGGAAACGGAGAATATTGAACCGTCCTCCATAACG
CCTGGCACTGGCGGCGCGCTACGTCTGGAGGCATTGACATTCCGCTATCCCGAAAAAACGCAACCTGTGCTTGATAATCT
CTCGCTCACCATTCCGCCAGGAACGGTGGTCGCGGTGGTCGGACGTAGCGGCGCCGGCAAATCGACGCTGATCAAGCTGC
TGGCCGGGCTCTACTCTCCCGGCAGCGGGCAAATCCGTGTCGGTGAGCGCCTAATTGATGCCGCGTCGCTTAGTGATTAT
CGTCGCCAGACGGGGCTGGTCACTCAGGATGTCGCATTATTTAGCGGCGATATTGCCGAAAATATCCGCTATTCGCTGCC
AGACAGCAGCGACACGGAGGTGGAGATCGCGGCGCGACAGGCGGGACTCTTTGAAACTGTGCAACATCTGCCGCTGGGGT
TCCGTACTCCGGTCAATAACGGCGGCACGGATCTGTCCGCGGGCCAGCGTCAGTTGATTGCCCTCGCCCGCGCCCACCTG
GCGCAGGCGCATATTCTGCTGCTCGACGAGGCGACAGCGCGTATCGACCGTAGCGCCGAGGAGCGCTTAATGACCTCGCT
TACCAGGGTGACGCATACCGAGAAACGCATCGCGCTTATCGTCGCGCACCGGCTGACCACCGCTCGCCGTTGCGATGTTA
TTGTCGTAATCGATAAAGGATGTATCGCTGAATATGGCAGCCATGAGCAGTTGATAGCGACTCATGGCCTGTATGCTCGT
CTGTGGCGGGACAGCATCGGCCAGACACGCGATACGCAAGGAGAGGTCATAGGATAG

Protein sequence :
MPATHSPMPARAWIVRLARVCWERKTLSIIVIVASVSTILLAALTPLITRQAVNDAIAGDTTHLPLLACGLLLIALFDFI
GNYVRRGYAGELSLWVQHTLRSRAFDSIQKLDGAGQDALRTGQVISRTNSDLQQVHTLLQMCPVPLAVLTYYVAGIAVML
WMSPSMTLIVICVLAALAITALRARRRVFAQTGLASDRLAHMTEHMREVLEQISVVKSCVAELRETRWLDGQSRQMVRVR
IGAAISQAMPGATMLALPVIGQIVLLCYGGWSVMNGRIDLGTFVAFASFLAMLTGPTRVLASFLVIAQRTQASVERVFAL
IDTRSRMEDGTESVEGQIIGLDVEKMSFHYDNGNRILNEISFSIHAGETVAVVGASGSGKSTLLMLLARFYDPTSGGVWL
NTTTGQQNIRDLKLTALRRRVGVVFEDAFLFAGTVAENIAYGHPQATQDDIRRAADAAGASGFINALPQGFNTRLAERGS
NLSGGQRQRIALARALITAPELLILDDTTSAVDAGTEAEINTALGRYADNEHMLLVIARRRSTLQLADRIVVLDKGRVVD
IGTQAELDARCPTFRSLMSGEGDFLALAPAEQRTLWPTTQAAKSDDAHERQTPAGKGFVDRMTRVPERAVQMALAGHGRQ
VSSLLTPVAWMFVIAALLIALDSAAGVGVLVLLQRGIDSGVAAGDMSTIGICALLALCLVAIGWCCYALQTIFAARAAES
VQHTVRLRSFSHLLRLSLPWHEKHIDSRLTRMTVDVDSLARFLQNGLASAATSIVTMVAIAAAMFWLDPILALTALSAVP
VVILATWIYRRLSSPAYAQARLEIGKVNSTLQEKVSGMRVVQSHGQQKQEAARLRALSDNFRATRVRAQKYLAVYFPFLT
LCTEAAYAAVLLIGATRVAGGEMTPGILAAFFLLLGQFYGPVQQLSGIVDSWQQATASGKHINALLATEETENIEPSSIT
PGTGGALRLEALTFRYPEKTQPVLDNLSLTIPPGTVVAVVGRSGAGKSTLIKLLAGLYSPGSGQIRVGERLIDAASLSDY
RRQTGLVTQDVALFSGDIAENIRYSLPDSSDTEVEIAARQAGLFETVQHLPLGFRTPVNNGGTDLSAGQRQLIALARAHL
AQAHILLLDEATARIDRSAEERLMTSLTRVTHTEKRIALIVAHRLTTARRCDVIVVIDKGCIAEYGSHEQLIATHGLYAR
LWRDSIGQTRDTQGEVIG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
iroC CAC43427.2 ABC transport protein Virulence PAI III 536 Protein 0.0 80

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SeD_A3072 YP_002216747.1 ATP binding cassette VFG1653 Protein 0.0 80