Representative itemset mining

Research output: Chapter in Book/Report/Conference proceedingsConference proceedingpeer-review

Abstract

Frequent itemset mining is one of the most common of data mining tasks. In its simplest form one is given a table of data in which the columns represent attributes and each row specifies a value for each attribute, each attributevalue pair being referred to as an item. The task is to find sets of these items that occur frequently in the data, where frequency is specified as a minimum occurrence threshold. Such frequent sets of items are referred to as "frequent itemsets". Many efficient techniques have been developed for finding all frequent itemsets. However, a practical problem is that the results sets can be exponentially large in the number of items. In this paper we propose representative frequent itemset mining in which the set of itemsets returned provide examples of the space of all possible frequent itemsets. Specifically, every item that appears in a frequent itemset at least once is shown in at least one representative itemset. If there are frequent itemsets without a particular item, one such example will be presented. One can generalise our framework to seek representative sets in which pairs, triples, etc. of frequent itemsets are presented. One can see the representative frequent itemset framework as a generalisation of traditional frequent itemset mining that provides an additional parameter for controlling the size of the result set. Specifically, one has access to the traditional frequency threshold, but also the maximum arity of the tuples of itemsets being exemplified. We propose a dedicated algorithm that significantly outperforms using a state-of-The-Art itemset miner in generating representative itemsets.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016
EditorsAnna Esposito, Miltos Alamaniotis, Amol Mali, Nikolaos Bourbakis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages142-148
Number of pages7
ISBN (Electronic)9781509044597
DOIs
Publication statusPublished - 11 Jan 2017
Event28th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2016 - San Jose, United States
Duration: 6 Nov 20168 Nov 2016

Publication series

NameProceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016

Conference

Conference28th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2016
Country/TerritoryUnited States
CitySan Jose
Period6/11/168/11/16

Fingerprint

Dive into the research topics of 'Representative itemset mining'. Together they form a unique fingerprint.

Cite this