Document explorer: Discovering knowledge in document collections

Ronen Feldman, Willi Kiösgen, Amir Zilberstein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Document Explorer is a data mining system for document collections. Such a collection represents an application domain, and the primary goal of the system is to derive patterns that provide knowledge about this domain. Additionally, the derived patterns can be used to browse the collection. Document Explorer searches for patterns that capture relations between concepts of the domain. The patterns which have been verified as interesting are structured and presented in a visual user interface allowing the user to operate on the results to refine and redirect mining queries or to access the associated documents. The system offers preprocessing tools to construct or refine a knowledge base of domain concepts and to create an intermediate representation of the document collection that will be used by all subsequent data mining operations. The main pattern types, the system can search for, are frequent sets, associations, concept distributions, and keyword graphs. To enable the user to provide some explicit bias, the system provides a dedicated query language for searching the vast implicit spaces of pattern instances that exist in the collection.

Original languageEnglish
Title of host publicationFoundations of Intelligent Systems - 10th International Symposium, ISMIS 1997, Proceedings
EditorsZbigniew W. Ras, Andrzej Skowron
PublisherSpringer Verlag
Pages137-146
Number of pages10
ISBN (Print)3540636145, 9783540636144
DOIs
StatePublished - 1997
Externally publishedYes
Event10th International Symposium on Methodologies for Intelligent Systems, ISMIS 1997 - Charlotte, United States
Duration: 15 Oct 199718 Oct 1997

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1325
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Symposium on Methodologies for Intelligent Systems, ISMIS 1997
Country/TerritoryUnited States
CityCharlotte
Period15/10/9718/10/97

Bibliographical note

Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 1997.

Fingerprint

Dive into the research topics of 'Document explorer: Discovering knowledge in document collections'. Together they form a unique fingerprint.

Cite this