Plain Text & Character Encoding: A Primer for Data Curators

Plain text data consists of a sequence of encoded characters or “code points” from a given standard such as the Unicode Standard. Some of the most common file formats for digital data used in eScience (CSV, XML, and JSON, for example) are built atop plain text standards. Plain text representations o...

Full description

Bibliographic Details
Main Author: Seth Erickson
Format: Article
Language:English
Published: University of Massachusetts Medical School, Lamar Soutter Library 2021-08-01
Series:Journal of eScience Librarianship
Subjects:
Online Access:https://escholarship.umassmed.edu/jeslib/vol10/iss3/12