Biases in segmenting non-concatenative morphology
Thesis: Ph. D. in Linguistics, Massachusetts Institute of Technology, Department of Linguistics and Philosophy, 2018. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 131-140). === Segmentation of words containing non-concatenative morphology into their compon...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
Massachusetts Institute of Technology
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/120676 |
id |
ndltd-MIT-oai-dspace.mit.edu-1721.1-120676 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-MIT-oai-dspace.mit.edu-1721.1-1206762019-05-02T16:17:56Z Biases in segmenting non-concatenative morphology Fullwood, Michelle Alison Adam Albright. Massachusetts Institute of Technology. Department of Linguistics and Philosophy. Massachusetts Institute of Technology. Department of Linguistics and Philosophy. Linguistics and Philosophy. Thesis: Ph. D. in Linguistics, Massachusetts Institute of Technology, Department of Linguistics and Philosophy, 2018. Cataloged from PDF version of thesis. Includes bibliographical references (pages 131-140). Segmentation of words containing non-concatenative morphology into their component morphemes, such as Arabic /kita:b/ 'book' into root [check symbol]ktb and vocalism /i-a:/ (McCarthy, 1979, 1981), is a difficult task due to the size of its search space of possibilities, which grows exponentially as word length increases, versus the linear growth that accompanies concatenative morphology. In this dissertation, I investigate via computational and typological simulations, as well as an artificial grammar experiment, the task of morphological segmentation in root-and-pattern languages, as well as the consequences for majority-concatenative languages such as English when we do not presuppose concatenative segmentation and its smaller hypothesis space. In particular, I examine the necessity and sufficiency conditions of three biases that may be hypothesised to govern the learning of such a segmentation: a bias towards a parsimonious morpheme lexicon with a power-law (Zipfian) distribution over tokens drawn from this lexicon, as has successfully been used in Bayesian models of word segmentation and morphological segmentation of concatenative languages (Goldwater et al., 2009; Poon et al., 2009, et seq.); a bias towards concatenativity; and a bias against interleaving morphemes that are mixtures of consonants and vowels. I demonstrate that while computationally, the parsimony bias is sufficient to segment Arabic verbal stems into roots and residues, typological considerations argue for the existence of biases towards concatenativity and towards separating consonants and vowels in root-and-pattern-style morphology. Further evidence for these as synchronic biases comes from the artificial grammar experiment, which demonstrates that languages respecting these biases have a small but significant learnability advantage. by Michelle Alison Fullwood. Ph. D. in Linguistics 2019-03-01T19:57:20Z 2019-03-01T19:57:20Z 2018 2018 Thesis http://hdl.handle.net/1721.1/120676 1088554863 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 140 pages application/pdf Massachusetts Institute of Technology |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Linguistics and Philosophy. |
spellingShingle |
Linguistics and Philosophy. Fullwood, Michelle Alison Biases in segmenting non-concatenative morphology |
description |
Thesis: Ph. D. in Linguistics, Massachusetts Institute of Technology, Department of Linguistics and Philosophy, 2018. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 131-140). === Segmentation of words containing non-concatenative morphology into their component morphemes, such as Arabic /kita:b/ 'book' into root [check symbol]ktb and vocalism /i-a:/ (McCarthy, 1979, 1981), is a difficult task due to the size of its search space of possibilities, which grows exponentially as word length increases, versus the linear growth that accompanies concatenative morphology. In this dissertation, I investigate via computational and typological simulations, as well as an artificial grammar experiment, the task of morphological segmentation in root-and-pattern languages, as well as the consequences for majority-concatenative languages such as English when we do not presuppose concatenative segmentation and its smaller hypothesis space. In particular, I examine the necessity and sufficiency conditions of three biases that may be hypothesised to govern the learning of such a segmentation: a bias towards a parsimonious morpheme lexicon with a power-law (Zipfian) distribution over tokens drawn from this lexicon, as has successfully been used in Bayesian models of word segmentation and morphological segmentation of concatenative languages (Goldwater et al., 2009; Poon et al., 2009, et seq.); a bias towards concatenativity; and a bias against interleaving morphemes that are mixtures of consonants and vowels. I demonstrate that while computationally, the parsimony bias is sufficient to segment Arabic verbal stems into roots and residues, typological considerations argue for the existence of biases towards concatenativity and towards separating consonants and vowels in root-and-pattern-style morphology. Further evidence for these as synchronic biases comes from the artificial grammar experiment, which demonstrates that languages respecting these biases have a small but significant learnability advantage. === by Michelle Alison Fullwood. === Ph. D. in Linguistics |
author2 |
Adam Albright. |
author_facet |
Adam Albright. Fullwood, Michelle Alison |
author |
Fullwood, Michelle Alison |
author_sort |
Fullwood, Michelle Alison |
title |
Biases in segmenting non-concatenative morphology |
title_short |
Biases in segmenting non-concatenative morphology |
title_full |
Biases in segmenting non-concatenative morphology |
title_fullStr |
Biases in segmenting non-concatenative morphology |
title_full_unstemmed |
Biases in segmenting non-concatenative morphology |
title_sort |
biases in segmenting non-concatenative morphology |
publisher |
Massachusetts Institute of Technology |
publishDate |
2019 |
url |
http://hdl.handle.net/1721.1/120676 |
work_keys_str_mv |
AT fullwoodmichellealison biasesinsegmentingnonconcatenativemorphology |
_version_ |
1719037748144242688 |