Abstract
As the cost of collecting and storing large amounts of data continues to drop, we see a constant rise in the amount of telemetry data collected by software applications and services. With the data mounding up, there is an increasing need for algorithms to automatically and efficiently mine insights from the collected data. One interesting case is the description of large tables using frequently occurring patterns, with implications for failure analysis and customer engagement. Finding frequently occurring patterns has applications both in an interactive usage where an analyst repeatedly query the data and in a completely automated process queries the data periodically and generate alerts and or reports based on the mining. Here we propose two novel mining algorithms for the purpose of computing such predominant patterns in relational data. The first method is a fast heuristic search, and the second is based on an adaptation of the apriori algorithm. Our methods are demonstrated on real-world datasets, and extensions to some additional fundamental mining tasks are discussed.
Original language | English |
---|---|
Title of host publication | ICPRAM 2017 - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods |
Editors | Maria De De Marsico, Gabriella Sanniti di Baja, Ana Fred |
Publisher | SciTePress |
Pages | 309-317 |
Number of pages | 9 |
ISBN (Electronic) | 9789897582226 |
DOIs | |
State | Published - 2017 |
Event | 6th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2017 - Porto, Portugal Duration: 24 Feb 2017 → 26 Feb 2017 |
Publication series
Name | ICPRAM 2017 - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods |
---|---|
Volume | 2017-January |
Conference
Conference | 6th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2017 |
---|---|
Country/Territory | Portugal |
City | Porto |
Period | 24/02/17 → 26/02/17 |
Bibliographical note
Publisher Copyright:Copyright © 2017 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved.
Keywords
- Data mining
- Failure analysis
- Pattern mining
- Software telemetry
- Subspace clustering