Surnames and ancestry in Brazil.

This paper presents a method for classifying the ancestry of Brazilian surnames based on historical sources. The information obtained forms the basis for applying fuzzy matching and machine learning classification algorithms to more than 46 million workers in 5 categories: Iberian, Italian, Japanese...

Full description

Bibliographic Details
Main Author: Leonardo Monasterio
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5421764?pdf=render
id doaj-2756fcb69a9346d695a9c9b517f4c92d
record_format Article
spelling doaj-2756fcb69a9346d695a9c9b517f4c92d2020-11-24T21:52:04ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01125e017689010.1371/journal.pone.0176890Surnames and ancestry in Brazil.Leonardo MonasterioThis paper presents a method for classifying the ancestry of Brazilian surnames based on historical sources. The information obtained forms the basis for applying fuzzy matching and machine learning classification algorithms to more than 46 million workers in 5 categories: Iberian, Italian, Japanese, German and East European. The vast majority (96.7%) of the single surnames were identified using a fuzzy matching and the rest using a method proposed by Cavnar and Trenkle (1994). A comparison of the results of the procedures with data on foreigners in the 1920 Census and with the geographic distribution of non-Iberian surnames underscores the accuracy of the procedure. The study shows that surname ancestry is associated with significant differences in wages and schooling.http://europepmc.org/articles/PMC5421764?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Leonardo Monasterio
spellingShingle Leonardo Monasterio
Surnames and ancestry in Brazil.
PLoS ONE
author_facet Leonardo Monasterio
author_sort Leonardo Monasterio
title Surnames and ancestry in Brazil.
title_short Surnames and ancestry in Brazil.
title_full Surnames and ancestry in Brazil.
title_fullStr Surnames and ancestry in Brazil.
title_full_unstemmed Surnames and ancestry in Brazil.
title_sort surnames and ancestry in brazil.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description This paper presents a method for classifying the ancestry of Brazilian surnames based on historical sources. The information obtained forms the basis for applying fuzzy matching and machine learning classification algorithms to more than 46 million workers in 5 categories: Iberian, Italian, Japanese, German and East European. The vast majority (96.7%) of the single surnames were identified using a fuzzy matching and the rest using a method proposed by Cavnar and Trenkle (1994). A comparison of the results of the procedures with data on foreigners in the 1920 Census and with the geographic distribution of non-Iberian surnames underscores the accuracy of the procedure. The study shows that surname ancestry is associated with significant differences in wages and schooling.
url http://europepmc.org/articles/PMC5421764?pdf=render
work_keys_str_mv AT leonardomonasterio surnamesandancestryinbrazil
_version_ 1725877075473072128