Interactive document clustering revisited: A visual analytics approach

  • Ehsan Sherkat
  • , Seyednaser Nourashrafeddin
  • , Evangelos E. Milios
  • , Rosane Minghim

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

UPDATED-December 29, 2017. Document clustering is an efficient way to get insight into large text collections. Due to the personalized nature of document clustering, even the best fully automatic algorithms cannot create clusters that accurately reflect the user's perspectives. To incorporate the user's perspective in the clustering process and, at the same time, effectively visualize document collections to enhance user's sense-making of data, we propose a novel visual analytics system for interactive document clustering. We built our system on top of clustering algorithms that can adapt to user's feedback. First, the initial clustering is created based on the user-defined number of clusters and the selected clustering algorithm. Second, the clustering result is visualized to the user. A collection of coordinated visualization modules and document projection is designed to guide the user towards a better insight into the document collection and clusters. The user changes clusters and key-terms iteratively as a feedback to the clustering algorithm until the result is satisfactory. In key-term based interaction, the user assigns a set of key-terms to each target cluster to guide the clustering algorithm. A set of quantitative experiments, a use case, and a user study have been conducted to show the advantages of the approach for document analytics based on clustering.

Original languageEnglish
Title of host publicationIUI 2018 - Proceedings of the 23rd International Conference on Intelligent User Interfaces
PublisherAssociation for Computing Machinery
Pages281-292
Number of pages12
ISBN (Electronic)9781450349451
DOIs
Publication statusPublished - 5 Mar 2018
Externally publishedYes
Event23rd ACM International Conference on Intelligent User Interfaces, IUI 2018 - Tokyo, Japan
Duration: 7 Mar 201811 Mar 2018

Publication series

NameInternational Conference on Intelligent User Interfaces, Proceedings IUI

Conference

Conference23rd ACM International Conference on Intelligent User Interfaces, IUI 2018
Country/TerritoryJapan
CityTokyo
Period7/03/1811/03/18

Keywords

  • Document projection
  • Interactive document clustering
  • Key-term
  • Text
  • User study
  • Visualization

Fingerprint

Dive into the research topics of 'Interactive document clustering revisited: A visual analytics approach'. Together they form a unique fingerprint.

Cite this