Methodologies in Predictive Visual Analytics

abstract: Predictive analytics embraces an extensive area of techniques from statistical modeling to machine learning to data mining and is applied in business intelligence, public health, disaster management and response, and many other fields. To date, visualization has been broadly used to suppor...

Full description

Bibliographic Details
Other Authors: Lu, Yafeng (Author)
Format: Doctoral Thesis
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.43997
id ndltd-asu.edu-item-43997
record_format oai_dc
spelling ndltd-asu.edu-item-439972018-06-22T03:08:13Z Methodologies in Predictive Visual Analytics abstract: Predictive analytics embraces an extensive area of techniques from statistical modeling to machine learning to data mining and is applied in business intelligence, public health, disaster management and response, and many other fields. To date, visualization has been broadly used to support tasks in the predictive analytics pipeline under the underlying assumption that a human-in-the-loop can aid the analysis by integrating domain knowledge that might not be broadly captured by the system. Primary uses of visualization in the predictive analytics pipeline have focused on data cleaning, exploratory analysis, and diagnostics. More recently, numerous visual analytics systems for feature selection, incremental learning, and various prediction tasks have been proposed to support the growing use of complex models, agent-specific optimization, and comprehensive model comparison and result exploration. Such work is being driven by advances in interactive machine learning and the desire of end-users to understand and engage with the modeling process. However, despite the numerous and promising applications of visual analytics to predictive analytics tasks, work to assess the effectiveness of predictive visual analytics is lacking. This thesis studies the current methodologies in predictive visual analytics. It first defines the scope of predictive analytics and presents a predictive visual analytics (PVA) pipeline. Following the proposed pipeline, a predictive visual analytics framework is developed to be used to explore under what circumstances a human-in-the-loop prediction process is most effective. This framework combines sentiment analysis, feature selection mechanisms, similarity comparisons and model cross-validation through a variety of interactive visualizations to support analysts in model building and prediction. To test the proposed framework, an instantiation for movie box-office prediction is developed and evaluated. Results from small-scale user studies are presented and discussed, and a generalized user study is carried out to assess the role of predictive visual analytics under a movie box-office prediction scenario. Dissertation/Thesis Lu, Yafeng (Author) Maciejewski, Ross (Advisor) Cooke, Nancy (Committee member) Liu, Huan (Committee member) He, Jingrui (Committee member) Arizona State University (Publisher) Computer science Movie Box Office Predictive Analytics Social Media User-in-the-Loop Visual Analytics Visualization eng 169 pages Doctoral Dissertation Engineering 2017 Doctoral Dissertation http://hdl.handle.net/2286/R.I.43997 http://rightsstatements.org/vocab/InC/1.0/ All Rights Reserved 2017
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Computer science
Movie Box Office
Predictive Analytics
Social Media
User-in-the-Loop
Visual Analytics
Visualization
spellingShingle Computer science
Movie Box Office
Predictive Analytics
Social Media
User-in-the-Loop
Visual Analytics
Visualization
Methodologies in Predictive Visual Analytics
description abstract: Predictive analytics embraces an extensive area of techniques from statistical modeling to machine learning to data mining and is applied in business intelligence, public health, disaster management and response, and many other fields. To date, visualization has been broadly used to support tasks in the predictive analytics pipeline under the underlying assumption that a human-in-the-loop can aid the analysis by integrating domain knowledge that might not be broadly captured by the system. Primary uses of visualization in the predictive analytics pipeline have focused on data cleaning, exploratory analysis, and diagnostics. More recently, numerous visual analytics systems for feature selection, incremental learning, and various prediction tasks have been proposed to support the growing use of complex models, agent-specific optimization, and comprehensive model comparison and result exploration. Such work is being driven by advances in interactive machine learning and the desire of end-users to understand and engage with the modeling process. However, despite the numerous and promising applications of visual analytics to predictive analytics tasks, work to assess the effectiveness of predictive visual analytics is lacking. This thesis studies the current methodologies in predictive visual analytics. It first defines the scope of predictive analytics and presents a predictive visual analytics (PVA) pipeline. Following the proposed pipeline, a predictive visual analytics framework is developed to be used to explore under what circumstances a human-in-the-loop prediction process is most effective. This framework combines sentiment analysis, feature selection mechanisms, similarity comparisons and model cross-validation through a variety of interactive visualizations to support analysts in model building and prediction. To test the proposed framework, an instantiation for movie box-office prediction is developed and evaluated. Results from small-scale user studies are presented and discussed, and a generalized user study is carried out to assess the role of predictive visual analytics under a movie box-office prediction scenario. === Dissertation/Thesis === Doctoral Dissertation Engineering 2017
author2 Lu, Yafeng (Author)
author_facet Lu, Yafeng (Author)
title Methodologies in Predictive Visual Analytics
title_short Methodologies in Predictive Visual Analytics
title_full Methodologies in Predictive Visual Analytics
title_fullStr Methodologies in Predictive Visual Analytics
title_full_unstemmed Methodologies in Predictive Visual Analytics
title_sort methodologies in predictive visual analytics
publishDate 2017
url http://hdl.handle.net/2286/R.I.43997
_version_ 1718701394869878784