TY - GEN
T1 - Visual mapping of text collections through a fast high precision projection technique
AU - Paulovich, Fernando Vieira
AU - Nonato, Luis Gustavo
AU - Minghim, Rosane
AU - Levkowitz, Haim
PY - 2006
Y1 - 2006
N2 - This paper introduces Least Square Projection (LSP), a fast technique for projection of multi-dimensional data onto lower dimensions developed and tested successfully in the context of creation of text maps based on their content. Current solutions are either based on computationally expensive dimension reduction with no proper guarantee of the outcome or on faster techniques that need some sort of post-processing for recovering information lost during the process. LSP is based on least square approximation, a technique originally employed for surface modeling and reconstruction. Least square approximations are capable of computing the coordinates of a set of projected points based on a reduced number of control points with defined geometry. We extend the concept for general data sets. In order to perform the projection, a small number of distance calculations is necessary and no repositioning of the final points is required to obtain a satisfactory precision of the final solution. Textual information is a typically difficult data type to handle, due to its intrinsic dimensionality. We employ document corpora as a benchmark to demonstrate the capabilities of the LSP to group and separate documents by their content with high precision.
AB - This paper introduces Least Square Projection (LSP), a fast technique for projection of multi-dimensional data onto lower dimensions developed and tested successfully in the context of creation of text maps based on their content. Current solutions are either based on computationally expensive dimension reduction with no proper guarantee of the outcome or on faster techniques that need some sort of post-processing for recovering information lost during the process. LSP is based on least square approximation, a technique originally employed for surface modeling and reconstruction. Least square approximations are capable of computing the coordinates of a set of projected points based on a reduced number of control points with defined geometry. We extend the concept for general data sets. In order to perform the projection, a small number of distance calculations is necessary and no repositioning of the final points is required to obtain a satisfactory precision of the final solution. Textual information is a typically difficult data type to handle, due to its intrinsic dimensionality. We employ document corpora as a benchmark to demonstrate the capabilities of the LSP to group and separate documents by their content with high precision.
UR - https://www.scopus.com/pages/publications/34248169972
U2 - 10.1109/IV.2006.122
DO - 10.1109/IV.2006.122
M3 - Conference proceeding
AN - SCOPUS:34248169972
SN - 0769526020
SN - 9780769526027
T3 - Proceedings of the International Conference on Information Visualisation
SP - 282
EP - 288
BT - Information Visualization 2006, IV06
T2 - Information Visualization 2006, IV06
Y2 - 5 July 2006 through 7 July 2006
ER -