TY - GEN
T1 - Content-based text mapping using multi-dimensional projections for exploration of document collections
AU - Minghim, Rosane
AU - Paulovich, Fernando Vieira
AU - De Lopes, Alneu Andrade
PY - 2006
Y1 - 2006
N2 - This paper presents a technique for generation of maps of documents targeted at placing similar documents in the same neighborhood. As a result, besides being able to group (and separate) documents by their contents, it runs at very manageable computational costs. Based on multi-dimensional projection techniques and an algorithm for projection improvement, it results in a surface map that allows the user to identify a number of important relationships between documents and sub-groups of documents via visualization and interaction. Visual attributes such as height, color, isolines and glyphs as well as aural attributes (such as pitch), help add dimensions for integrated visual analysis. Exploration and narrowing of focus can be performed using a set of tools provided. This novel text mapping technique, named IDMAP (Interactive Document Map), is fully described in this paper. Results are compared with dimensionality reduction and cluster techniques for the same purposes. The maps are bound to support a large number of applications that rely on retrieval and examination of document collections and to complement the type of information offered by current knowledge domain visualizations.
AB - This paper presents a technique for generation of maps of documents targeted at placing similar documents in the same neighborhood. As a result, besides being able to group (and separate) documents by their contents, it runs at very manageable computational costs. Based on multi-dimensional projection techniques and an algorithm for projection improvement, it results in a surface map that allows the user to identify a number of important relationships between documents and sub-groups of documents via visualization and interaction. Visual attributes such as height, color, isolines and glyphs as well as aural attributes (such as pitch), help add dimensions for integrated visual analysis. Exploration and narrowing of focus can be performed using a set of tools provided. This novel text mapping technique, named IDMAP (Interactive Document Map), is fully described in this paper. Results are compared with dimensionality reduction and cluster techniques for the same purposes. The maps are bound to support a large number of applications that rely on retrieval and examination of document collections and to complement the type of information offered by current knowledge domain visualizations.
KW - Document mapping
KW - Domain knowledge visualization
KW - IDMAP
KW - Multi-dimensional projection
KW - Text visualization
UR - https://www.scopus.com/pages/publications/33645666704
U2 - 10.1117/12.650880
DO - 10.1117/12.650880
M3 - Conference proceeding
AN - SCOPUS:33645666704
SN - 0819461008
SN - 9780819461001
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Visualization and Data Analysis 2006 - Proceedings of SPIE-IS and T Electronic Imaging
T2 - Visualization and Data Analysis 2006
Y2 - 16 January 2006 through 17 January 2006
ER -