Caveat emptor, computational social science: Large-scale missing data in a widely-published Reddit corpus.

As researchers use computational methods to study complex social behaviors at scale, the validity of this computational social science depends on the integrity of the data. On July 2, 2015, Jason Baumgartner published a dataset advertised to include "every publicly available Reddit comment"...

Full description

Bibliographic Details
Main Authors: Devin Gaffney, J Nathan Matias
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6034852?pdf=render