An approach to supporting incremental visual data classification

  • Jose Gustavo S. Paiva
  • , William Robson Schwartz
  • , Helio Pedrini
  • , Rosane Minghim

Research output: Contribution to journalArticlepeer-review

Abstract

Automatic data classification is a computationally intensive task that presents variable precision and is considerably sensitive to the classifier configuration and to data representation, particularly for evolving data sets. Some of these issues can best be handled by methods that support users' control over the classification steps. In this paper, we propose a visual data classification methodology that supports users in tasks related to categorization such as training set selection; model creation, application and verification; and classifier tuning. The approach is then well suited for incremental classification, present in many applications with evolving data sets. Data set visualization is accomplished by means of point placement strategies, and we exemplify the method through multidimensional projections and Neighbor Joining trees. The same methodology can be employed by a user who wishes to create his or her own ground truth (or perspective) from a previously unlabeled data set. We validate the methodology through its application to categorization scenarios of image and text data sets, involving the creation, application, verification, and adjustment of classification models.

Original languageEnglish
Article number6840370
Pages (from-to)4-17
Number of pages14
JournalIEEE Transactions on Visualization and Computer Graphics
Volume21
Issue number1
DOIs
Publication statusPublished - 1 Jan 2015
Externally publishedYes

Keywords

  • information visualization
  • multidimensional point placement
  • Visual image classification

Fingerprint

Dive into the research topics of 'An approach to supporting incremental visual data classification'. Together they form a unique fingerprint.

Cite this