On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown
We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2017-05-01
|
Series: | Journal of Statistics Education |
Subjects: | |
Online Access: | http://dx.doi.org/10.1080/10691898.2017.1322474 |
id |
doaj-6823f743765c460689ab9368406a1838 |
---|---|
record_format |
Article |
spelling |
doaj-6823f743765c460689ab9368406a18382020-11-25T00:45:01ZengTaylor & Francis GroupJournal of Statistics Education1069-18982017-05-01252606710.1080/10691898.2017.13224741322474On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdownJulian Stander0Luciana Dalla Valle1University of Plymouth,University of Plymouth,We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use of R, accessed through RStudio, and some popular R packages. After describing initial examples used to fire student enthusiasm, we explain our approach to teaching data visualization using the ggplot2 package. We discuss other module topics, including basic statistical inference, data manipulation with dplyr and tidyr, data bases and SQL, social media sentiment analysis, Likert-type data, reproducible research using RMarkdown, dimension reduction and clustering, and parallel R. We present four lesson outlines and describe the module assessment. We mention some of the problems encountered when teaching the module, and present student feedback and our plans for next year.http://dx.doi.org/10.1080/10691898.2017.1322474Data visualizationData scienceR softwareSocial media |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Julian Stander Luciana Dalla Valle |
spellingShingle |
Julian Stander Luciana Dalla Valle On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown Journal of Statistics Education Data visualization Data science R software Social media |
author_facet |
Julian Stander Luciana Dalla Valle |
author_sort |
Julian Stander |
title |
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown |
title_short |
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown |
title_full |
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown |
title_fullStr |
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown |
title_full_unstemmed |
On Enthusing Students About Big Data and Social Media Visualization and Analysis Using R, RStudio, and RMarkdown |
title_sort |
on enthusing students about big data and social media visualization and analysis using r, rstudio, and rmarkdown |
publisher |
Taylor & Francis Group |
series |
Journal of Statistics Education |
issn |
1069-1898 |
publishDate |
2017-05-01 |
description |
We discuss the learning goals, content, and delivery of a University of Plymouth intensive module delivered over four weeks entitled MATH1608PP Understanding Big Data from Social Networks, aimed at introducing students to a broad range of techniques used in modern Data Science. This module made use of R, accessed through RStudio, and some popular R packages. After describing initial examples used to fire student enthusiasm, we explain our approach to teaching data visualization using the ggplot2 package. We discuss other module topics, including basic statistical inference, data manipulation with dplyr and tidyr, data bases and SQL, social media sentiment analysis, Likert-type data, reproducible research using RMarkdown, dimension reduction and clustering, and parallel R. We present four lesson outlines and describe the module assessment. We mention some of the problems encountered when teaching the module, and present student feedback and our plans for next year. |
topic |
Data visualization Data science R software Social media |
url |
http://dx.doi.org/10.1080/10691898.2017.1322474 |
work_keys_str_mv |
AT julianstander onenthusingstudentsaboutbigdataandsocialmediavisualizationandanalysisusingrrstudioandrmarkdown AT lucianadallavalle onenthusingstudentsaboutbigdataandsocialmediavisualizationandanalysisusingrrstudioandrmarkdown |
_version_ |
1725271795743850496 |