Interactive visualization of financial data : Development of a visual data mining tool

In this project, a prototype visual data mining tool was developed, allowing users to interactively investigate large multi-dimensional datasets visually (using 2D visualization techniques) using so called drill-down, roll-up and slicing operations. The project included all steps of the development,...

Full description

Bibliographic Details
Main Author:	Saltin, Joakim
Format:	Others
Language:	English
Published:	Uppsala universitet, Institutionen för informationsteknologi 2012
Subjects:	visual data mining visualization data warehousing software engineering materialized views OLAP OLAP cubes greedy algorithm high-performance query
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-181225

Description
Summary:	In this project, a prototype visual data mining tool was developed, allowing users to interactively investigate large multi-dimensional datasets visually (using 2D visualization techniques) using so called drill-down, roll-up and slicing operations. The project included all steps of the development, from writing specifications and designing the program to implementing and evaluating it. Using ideas from data warehousing, custom methods for storing pre-computed aggregations of data (commonly referred to as materialized views) and retrieving data from these were developed and implemented in order to achieve higher performance on large datasets. View materialization enables the program to easily fetch or calculate a view using other views, something which can yield significant performance gains if view sizes are much smaller than the underlying raw dataset. The choice of which views to materialize was done in an automated manner using a well-known algorithm - the greedy algorithm for view materialization - which selects the fraction of all possible views that is likely (but not guaranteed) to yield the best performance gain. The use of materialized views was shown to have good potential to increase performance for large datasets, with an average speedup (compared to on-the-fly queries) between 20 and 70 for a test dataset containing 500~000 rows. The end result was a program combining flexibility with good performance, which was also reflected by good scores in a user-acceptance test, with participants from the company where this project was carried out.

Interactive visualization of financial data : Development of a visual data mining tool

Similar Items