Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language

BackgroundWikipedia is a collaboratively edited encyclopedia. One of the most popular websites on the Internet, it is known to be a frequently used source of health care information by both professionals and the lay public. ObjectiveThis paper quantifies the produ...

Full description

Bibliographic Details
Main Authors: Heilman, James M, West, Andrew G
Format: Article
Language:English
Published: JMIR Publications 2015-03-01
Series:Journal of Medical Internet Research
Online Access:http://www.jmir.org/2015/3/e62/
id doaj-9d8e8dcaf7914cd396466fb99867d82f
record_format Article
spelling doaj-9d8e8dcaf7914cd396466fb99867d82f2021-04-02T18:40:23ZengJMIR PublicationsJournal of Medical Internet Research1438-88712015-03-01173e6210.2196/jmir.4069Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural LanguageHeilman, James MWest, Andrew G BackgroundWikipedia is a collaboratively edited encyclopedia. One of the most popular websites on the Internet, it is known to be a frequently used source of health care information by both professionals and the lay public. ObjectiveThis paper quantifies the production and consumption of Wikipedia’s medical content along 4 dimensions. First, we measured the amount of medical content in both articles and bytes and, second, the citations that supported that content. Third, we analyzed the medical readership against that of other health care websites between Wikipedia’s natural language editions and its relationship with disease prevalence. Fourth, we surveyed the quantity/characteristics of Wikipedia’s medical contributors, including year-over-year participation trends and editor demographics. MethodsUsing a well-defined categorization infrastructure, we identified medically pertinent English-language Wikipedia articles and links to their foreign language equivalents. With these, Wikipedia can be queried to produce metadata and full texts for entire article histories. Wikipedia also makes available hourly reports that aggregate reader traffic at per-article granularity. An online survey was used to determine the background of contributors. Standard mining and visualization techniques (eg, aggregation queries, cumulative distribution functions, and/or correlation metrics) were applied to each of these datasets. Analysis focused on year-end 2013, but historical data permitted some longitudinal analysis. ResultsWikipedia’s medical content (at the end of 2013) was made up of more than 155,000 articles and 1 billion bytes of text across more than 255 languages. This content was supported by more than 950,000 references. Content was viewed more than 4.88 billion times in 2013. This makes it one of if not the most viewed medical resource(s) globally. The core editor community numbered less than 300 and declined over the past 5 years. The members of this community were half health care providers and 85.5% (100/117) had a university education. ConclusionsAlthough Wikipedia has a considerable volume of multilingual medical content that is extensively read and well-referenced, the core group of editors that contribute and maintain that content is small and shrinking in size.http://www.jmir.org/2015/3/e62/
collection DOAJ
language English
format Article
sources DOAJ
author Heilman, James M
West, Andrew G
spellingShingle Heilman, James M
West, Andrew G
Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
Journal of Medical Internet Research
author_facet Heilman, James M
West, Andrew G
author_sort Heilman, James M
title Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
title_short Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
title_full Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
title_fullStr Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
title_full_unstemmed Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language
title_sort wikipedia and medicine: quantifying readership, editors, and the significance of natural language
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2015-03-01
description BackgroundWikipedia is a collaboratively edited encyclopedia. One of the most popular websites on the Internet, it is known to be a frequently used source of health care information by both professionals and the lay public. ObjectiveThis paper quantifies the production and consumption of Wikipedia’s medical content along 4 dimensions. First, we measured the amount of medical content in both articles and bytes and, second, the citations that supported that content. Third, we analyzed the medical readership against that of other health care websites between Wikipedia’s natural language editions and its relationship with disease prevalence. Fourth, we surveyed the quantity/characteristics of Wikipedia’s medical contributors, including year-over-year participation trends and editor demographics. MethodsUsing a well-defined categorization infrastructure, we identified medically pertinent English-language Wikipedia articles and links to their foreign language equivalents. With these, Wikipedia can be queried to produce metadata and full texts for entire article histories. Wikipedia also makes available hourly reports that aggregate reader traffic at per-article granularity. An online survey was used to determine the background of contributors. Standard mining and visualization techniques (eg, aggregation queries, cumulative distribution functions, and/or correlation metrics) were applied to each of these datasets. Analysis focused on year-end 2013, but historical data permitted some longitudinal analysis. ResultsWikipedia’s medical content (at the end of 2013) was made up of more than 155,000 articles and 1 billion bytes of text across more than 255 languages. This content was supported by more than 950,000 references. Content was viewed more than 4.88 billion times in 2013. This makes it one of if not the most viewed medical resource(s) globally. The core editor community numbered less than 300 and declined over the past 5 years. The members of this community were half health care providers and 85.5% (100/117) had a university education. ConclusionsAlthough Wikipedia has a considerable volume of multilingual medical content that is extensively read and well-referenced, the core group of editors that contribute and maintain that content is small and shrinking in size.
url http://www.jmir.org/2015/3/e62/
work_keys_str_mv AT heilmanjamesm wikipediaandmedicinequantifyingreadershipeditorsandthesignificanceofnaturallanguage
AT westandrewg wikipediaandmedicinequantifyingreadershipeditorsandthesignificanceofnaturallanguage
_version_ 1721551299271458816