A Decomposition Approach for Discovering Discriminative Motifs in a Sequence Database

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

Considerable effort has been invested over the years in ad-hoc algorithms for item set and pattern mining. Constraint programming has recently been proposed as a means to tackle item set mining tasks within a general modelling framework. We follow this approach to address the discovery of discriminative n-ary motifs in databases of labeled sequences. We define a n-ary motif as a mapping of n patterns to n class-wide embeddings and we restrict the interpretation of constraints on a motif to the sequences embedding all patterns. We formulate core constraints that minimize redundancy between motifs and introduce a general constraint optimization framework to compute common and exclusive motifs. We cast the discovery of closed and replication-free motifs in this framework for which we propose a two-stage approach based on constraint programming. Experimental results on datasets of protein sequences demonstrate the efficiency of the approach.

Original languageEnglish
Title of host publicationProceedings - 2014 IEEE 26th International Conference on Tools with Artificial Intelligence, ICTAI 2014
PublisherIEEE Computer Society
Pages544-551
Number of pages8
ISBN (Electronic)9781479965724
DOIs
Publication statusPublished - 12 Dec 2014
Event26th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2014 - Limassol, Cyprus
Duration: 10 Nov 201412 Nov 2014

Publication series

NameProceedings - International Conference on Tools with Artificial Intelligence, ICTAI
Volume2014-December
ISSN (Print)1082-3409

Conference

Conference26th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2014
Country/TerritoryCyprus
CityLimassol
Period10/11/1412/11/14

Keywords

  • Bioinformatics
  • Constraint Programming
  • Motifs
  • Optimization

Fingerprint

Dive into the research topics of 'A Decomposition Approach for Discovering Discriminative Motifs in a Sequence Database'. Together they form a unique fingerprint.

Cite this