On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown

We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use...

Full description

Bibliographic Details
Main Authors: Julian Stander, Luciana Dalla Valle
Format: Article
Language:English
Published: Taylor & Francis Group 2017-05-01
Series:Journal of Statistics Education
Subjects:
Online Access:http://dx.doi.org/10.1080/10691898.2017.1322474
id doaj-6823f743765c460689ab9368406a1838
record_format Article
spelling doaj-6823f743765c460689ab9368406a18382020-11-25T00:45:01ZengTaylor & Francis GroupJournal of Statistics Education1069-18982017-05-01252606710.1080/10691898.2017.13224741322474On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdownJulian Stander0Luciana Dalla Valle1University of Plymouth,University of Plymouth,We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use of R, accessed through RStudio, and some popular R packages. After describing initial examples used to fire student enthusiasm, we explain our approach to teaching data visualization using the ggplot2 package. We discuss other module topics, including basic statistical inference, data manipulation with dplyr and tidyr, data bases and SQL, social media sentiment analysis, Likert-type data, reproducible research using RMarkdown, dimension reduction and clustering, and parallel R. We present four lesson outlines and describe the module assessment. We mention some of the problems encountered when teaching the module, and present student feedback and our plans for next year.http://dx.doi.org/10.1080/10691898.2017.1322474Data visualizationData scienceR softwareSocial media
collection DOAJ
language English
format Article
sources DOAJ
author Julian Stander
Luciana Dalla Valle
spellingShingle Julian Stander
Luciana Dalla Valle
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
Journal of Statistics Education
Data visualization
Data science
R software
Social media
author_facet Julian Stander
Luciana Dalla Valle
author_sort Julian Stander
title On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
title_short On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
title_full On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
title_fullStr On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
title_full_unstemmed On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
title_sort on enthusing students about big data and social media visualization and analysis using r, rstudio, and rmarkdown
publisher Taylor & Francis Group
series Journal of Statistics Education
issn 1069-1898
publishDate 2017-05-01
description We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use of R, accessed through RStudio, and some popular R packages. After describing initial examples used to fire student enthusiasm, we explain our approach to teaching data visualization using the ggplot2 package. We discuss other module topics, including basic statistical inference, data manipulation with dplyr and tidyr, data bases and SQL, social media sentiment analysis, Likert-type data, reproducible research using RMarkdown, dimension reduction and clustering, and parallel R. We present four lesson outlines and describe the module assessment. We mention some of the problems encountered when teaching the module, and present student feedback and our plans for next year.
topic Data visualization
Data science
R software
Social media
url http://dx.doi.org/10.1080/10691898.2017.1322474
work_keys_str_mv AT julianstander onenthusingstudentsaboutbigdataandsocialmediavisualizationandanalysisusingrrstudioandrmarkdown
AT lucianadallavalle onenthusingstudentsaboutbigdataandsocialmediavisualizationandanalysisusingrrstudioandrmarkdown
_version_ 1725271795743850496