Abstract
TextVis is a visual data mining system for document collections. Such a collection represents an application domain, and the primary goal of the system is to derive patterns that provide knowledge about this domain. Additionally, the derived patterns can be used to browse the collection. TextVis takes a multi-strategy approach to text mining, and enables defining complex analysis schemas from basic components, provided by the system. An analysis schema is constructed by dragging functional icons from a tool-pallette onto the workspace and connecting them according to the desired flow of information. The system provides a large collection of basic analysis tools, including: frequent sets, associations, concept distributions, and concept correlations. The discovered patterns are presented in a visual interface allowing the user to operate on the results, and to access the associated documents. TextVis is a complete text mining system which uses agent technology to access various online information sources, text preprocessing tools to extract relevant information from the documents, a variety of data mining algorithms, and a set of visual browsers to view the results. This paper provides an overview on the TextVis system. We describe the system's architecture, the various tools, and discuss the advantages of our visual environment for mining large document collections.
Original language | English |
---|---|
Title of host publication | Principles of Data Mining and Knowledge Discovery - 2nd European Symposium, PKDD 1998, Proceedings |
Editors | Jan M. Zytkow, Mohamed Quafafou |
Publisher | Springer Verlag |
Pages | 56-64 |
Number of pages | 9 |
ISBN (Print) | 3540650687, 9783540650683 |
DOIs | |
State | Published - 1998 |
Externally published | Yes |
Event | 2nd European Symposium on Principles of Data Mining and Knowledge Discovery in Databases, PKDD 1998 - Nantes, France Duration: 23 Sep 1998 → 26 Sep 1998 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 1510 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 2nd European Symposium on Principles of Data Mining and Knowledge Discovery in Databases, PKDD 1998 |
---|---|
Country/Territory | France |
City | Nantes |
Period | 23/09/98 → 26/09/98 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 1998.