Active learning with visualization for text data

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

Labeled datasets are always limited, and oftentimes the quantity of labeled data is a bottleneck for data analytics. This especially affects supervised machine learning methods, which require labels for models to learn from the labeled data. Active learning algorithms have been proposed to help achieve good analytic models with limited labeling efforts, by determining which additional instance labels will be most beneficial for learning for a given model. Active learning is consistent with interactive analytics as it proceeds in a cycle in which the unlabeled data is automatically explored. However, in active learning users have no control of the instances to be labeled, and for text data, the annotation interface is usually document only. Both of these constraints seem to affect the performance of an active learning model. We hypothesize that visualization techniques, particularly interactive ones, will help to address these constraints. In this paper, we implement a pilot study of visualization in active learning for text classification, with an interactive labeling interface. We compare the results of three experiments. Early results indicate that visualization improves high-performance machine learning model building with an active learning algorithm. Copyright is held by the owner/author(s). Publication rights licensed to ACM.

Original languageEnglish
Title of host publicationESIDA 2017 - Proceedings of the 2017 ACM Workshop on Exploratory Search and Interactive Data Analytics, co-located with IUI 2017
PublisherAssociation for Computing Machinery, Inc
Pages69-74
Number of pages6
ISBN (Electronic)9781450349031
DOIs
Publication statusPublished - 13 Mar 2017
Externally publishedYes
EventACM Workshop on Exploratory Search and Interactive Data Analytics, ESIDA 2017 - Limassol, Cyprus
Duration: 13 Mar 2017 → …

Publication series

NameESIDA 2017 - Proceedings of the 2017 ACM Workshop on Exploratory Search and Interactive Data Analytics, co-located with IUI 2017

Conference

ConferenceACM Workshop on Exploratory Search and Interactive Data Analytics, ESIDA 2017
Country/TerritoryCyprus
CityLimassol
Period13/03/17 → …

Keywords

  • Active learning
  • Text classification
  • Visualization

Fingerprint

Dive into the research topics of 'Active learning with visualization for text data'. Together they form a unique fingerprint.

Cite this