Abstract
Itemset and pattern mining has numerous applications ranging from Marketing to Bioinformatics. We introduce a language, dubbed Maximal Matrix Problem (MMP), to model such problems. An instance of MMP is based on a matrix of finite domain variables and a set of matrix constraints. A solution is a maximal consistent submatrix whose assignment of the variables in its scope satisfies the constraints but cannot be extended over additional lines while preserving consistency. We propose a generic CP model for MMP and present various types of matrix contraints. We then tackle the problem of partially or totally ordering patterns that have been prelocalized over sequences in order to exclude predefined sequences. We present two CP models to solve these MMP together with a genetic algorithm. Experiments on datasets of protein sequences demonstrate the efficiency of the approach.
| Original language | English |
|---|---|
| Pages | 143-152 |
| Number of pages | 10 |
| Publication status | Published - 2016 |
| Event | Douziemes Journees Francophones de Programmation par Contraintes, JFPC 2016 - 12th French-Speaking Conference on Constraint Programming, JFPC 2016 - Montpellier, France Duration: 15 Jun 2016 → 17 Jun 2016 |
Conference
| Conference | Douziemes Journees Francophones de Programmation par Contraintes, JFPC 2016 - 12th French-Speaking Conference on Constraint Programming, JFPC 2016 |
|---|---|
| Country/Territory | France |
| City | Montpellier |
| Period | 15/06/16 → 17/06/16 |