QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data
The visualization of large high-dimensional datasets is an active topic within the research area of information visualization (infovis), a research area that studies the visual representations of complex abstract datasets. My thesis presents two infovis systems that were motivated by the desire t...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
2009
|
Online Access: | http://hdl.handle.net/2429/15921 |
id |
ndltd-UBC-oai-circle.library.ubc.ca-2429-15921 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UBC-oai-circle.library.ubc.ca-2429-159212018-01-05T17:38:02Z QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data Williams, Matt The visualization of large high-dimensional datasets is an active topic within the research area of information visualization (infovis), a research area that studies the visual representations of complex abstract datasets. My thesis presents two infovis systems that were motivated by the desire to explore a 294-dimensional environmental sustainability dataset. Our collaborators developed the environmental dataset from expert knowledge on ecological, economical, and social systems which were used to model future scenarios consisting of 294 measures of environmental sustainability such as urban population, water supply levels, or tonnes of waste. Since these complex systems and large datasets are difficult for a non-expert user to comprehend, we developed QuestVis, a tool that applies infovis theories and techniques to improve the comprehensibility during exploration of the environmental dataset. The tool consists of three components: the input panel, the Multiscale Dimension Visualizer (MDV), and the Scenario Space Explorer (SSE). The MDV presents up to ten 294-dimensional future scenarios simultaneously on the screen to enable users to get a quick overview of the data. The simultaneous presentation also enables users to compare multiple future scenarios side-by-side. The SSE presents the space of all 120 000 future scenarios in an interactive two-dimensional layout which provides the user an overview of the possibilities. The SSE is tightly coupled with the MDV to provide context to the specific future scenarios that are presented in the MDV. These tightly linked components together provide an overview-hdetails framework within which users can effectively explore the dataset and immediately see the consequences of their choices. The creation of the dimensionality reduced overview in QuestVis led to a second research direction. We realized that current implementations of Multidimensional Scaling (MDS), a technique that attempts to best represent data point similarity in a low-dimensional embedding, are not suited for many of today's largescale datasets. This realization motivated us to develop MDSteer, a steerable MDS computation engine and visualization tool that progressively computes an MDS layout and handles datasets of over one million points. Our technique employs hierarchical data structures and progressive layouts that allow the user to steer the computation of the algorithm to the interesting areas of the dataset. The algorithm iteratively alternates between a layout stage in which a sub-selection of points are added to the set of active points affected by the MDS iteration, and a binning stage which increases the depth of the bin hierarchy and organizes the currently unplaced points into separate spatial regions. This binning strategy allows the user to select onscreen regions of the layout to focus the MDS computation into the areas of the dataset that are assigned to the selected bins. We show both real and common synthetic benchmark datasets with dimensionalities ranging from 3 to 300 and cardinalities of over one million points. Science, Faculty of Computer Science, Department of Graduate 2009-11-27T23:30:15Z 2009-11-27T23:30:15Z 2004 2004-05 Text Thesis/Dissertation http://hdl.handle.net/2429/15921 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. 14228958 bytes application/pdf |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
description |
The visualization of large high-dimensional datasets is an active topic within the
research area of information visualization (infovis), a research area that studies the
visual representations of complex abstract datasets. My thesis presents two infovis
systems that were motivated by the desire to explore a 294-dimensional environmental
sustainability dataset. Our collaborators developed the environmental dataset
from expert knowledge on ecological, economical, and social systems which were
used to model future scenarios consisting of 294 measures of environmental sustainability
such as urban population, water supply levels, or tonnes of waste. Since
these complex systems and large datasets are difficult for a non-expert user to comprehend,
we developed QuestVis, a tool that applies infovis theories and techniques
to improve the comprehensibility during exploration of the environmental dataset.
The tool consists of three components: the input panel, the Multiscale Dimension
Visualizer (MDV), and the Scenario Space Explorer (SSE). The MDV presents up to
ten 294-dimensional future scenarios simultaneously on the screen to enable users
to get a quick overview of the data. The simultaneous presentation also enables
users to compare multiple future scenarios side-by-side. The SSE presents the space
of all 120 000 future scenarios in an interactive two-dimensional layout which provides
the user an overview of the possibilities. The SSE is tightly coupled with
the MDV to provide context to the specific future scenarios that are presented in
the MDV. These tightly linked components together provide an overview-hdetails
framework within which users can effectively explore the dataset and immediately
see the consequences of their choices.
The creation of the dimensionality reduced overview in QuestVis led to a
second research direction. We realized that current implementations of Multidimensional
Scaling (MDS), a technique that attempts to best represent data point
similarity in a low-dimensional embedding, are not suited for many of today's largescale
datasets. This realization motivated us to develop MDSteer, a steerable MDS
computation engine and visualization tool that progressively computes an MDS
layout and handles datasets of over one million points. Our technique employs hierarchical
data structures and progressive layouts that allow the user to steer the
computation of the algorithm to the interesting areas of the dataset. The algorithm iteratively alternates between a layout stage in which a sub-selection of points are
added to the set of active points affected by the MDS iteration, and a binning stage
which increases the depth of the bin hierarchy and organizes the currently unplaced
points into separate spatial regions. This binning strategy allows the user to select
onscreen regions of the layout to focus the MDS computation into the areas
of the dataset that are assigned to the selected bins. We show both real and common
synthetic benchmark datasets with dimensionalities ranging from 3 to 300 and
cardinalities of over one million points. === Science, Faculty of === Computer Science, Department of === Graduate |
author |
Williams, Matt |
spellingShingle |
Williams, Matt QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
author_facet |
Williams, Matt |
author_sort |
Williams, Matt |
title |
QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
title_short |
QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
title_full |
QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
title_fullStr |
QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
title_full_unstemmed |
QuestVis and MDSteer : the visualization of high-dimensional environmental sustainability data |
title_sort |
questvis and mdsteer : the visualization of high-dimensional environmental sustainability data |
publishDate |
2009 |
url |
http://hdl.handle.net/2429/15921 |
work_keys_str_mv |
AT williamsmatt questvisandmdsteerthevisualizationofhighdimensionalenvironmentalsustainabilitydata |
_version_ |
1718590050753576960 |