Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics

While text Information Retrieval applications often focus on extracting semantic features to identify the topic of a document, and Music Information Research tends to deal with melodic, timbral or meta-tagged data of songs, useful information can be gained from surface-level features of musical text...

Full description

Bibliographic Details
Main Author:	Hirjee, Hussein
Language:	en
Published:	2010
Subjects:	information retrieval music lyrics hip hop rap rhyme misheard mondegreen phonetic similarity scansion poetry meter Computer Science
Online Access:	http://hdl.handle.net/10012/5419

id	ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-5419
record_format	oai_dc
spelling	ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-54192013-01-08T18:53:49ZHirjee, Hussein2010-08-30T20:37:51Z2010-08-30T20:37:51Z2010-08-30T20:37:51Z2010http://hdl.handle.net/10012/5419While text Information Retrieval applications often focus on extracting semantic features to identify the topic of a document, and Music Information Research tends to deal with melodic, timbral or meta-tagged data of songs, useful information can be gained from surface-level features of musical texts as well. This is especially true for texts such as song lyrics and poetry, in which the sound and structure of the words is important. These types of lyrical verse usually contain regular and repetitive patterns, like the rhymes in rap lyrics or the meter in metrical poetry. The existence of such patterns is not always categorical, as there may be a degree to which they appear or apply in any sample of text. For example, rhymes in hip hop are often imperfect and vary in the degree to which their constituent parts differ. Although a definitive decision as to the existence of any such feature cannot always be made, large corpora of known examples can be used to train probabilistic models enumerating the likelihood of their appearance. In this thesis, we apply likelihood-based methods to identify and characterize patterns in lyrical verse. We use a probabilistic model of mishearing in music to resolve misheard lyric search queries. We then apply a probabilistic model of rhyme to detect imperfect and internal rhymes in rap lyrics and quantitatively characterize rappers' styles in their use. Finally, we compute likelihoods of prosodic stress in words to perform automated scansion of poetry and compare poets' usage of and adherence to meter. In these applications, we find that likelihood-based methods outperform simpler, rule-based models at finding and quantifying lyrical features in text.eninformation retrievalmusiclyricship hopraprhymemisheardmondegreenphonetic similarityscansionpoetrymeterRhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard LyricsThesis or DissertationSchool of Computer ScienceMaster of MathematicsComputer Science
collection	NDLTD
language	en
sources	NDLTD
topic	information retrieval music lyrics hip hop rap rhyme misheard mondegreen phonetic similarity scansion poetry meter Computer Science
spellingShingle	information retrieval music lyrics hip hop rap rhyme misheard mondegreen phonetic similarity scansion poetry meter Computer Science Hirjee, Hussein Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
description	While text Information Retrieval applications often focus on extracting semantic features to identify the topic of a document, and Music Information Research tends to deal with melodic, timbral or meta-tagged data of songs, useful information can be gained from surface-level features of musical texts as well. This is especially true for texts such as song lyrics and poetry, in which the sound and structure of the words is important. These types of lyrical verse usually contain regular and repetitive patterns, like the rhymes in rap lyrics or the meter in metrical poetry. The existence of such patterns is not always categorical, as there may be a degree to which they appear or apply in any sample of text. For example, rhymes in hip hop are often imperfect and vary in the degree to which their constituent parts differ. Although a definitive decision as to the existence of any such feature cannot always be made, large corpora of known examples can be used to train probabilistic models enumerating the likelihood of their appearance. In this thesis, we apply likelihood-based methods to identify and characterize patterns in lyrical verse. We use a probabilistic model of mishearing in music to resolve misheard lyric search queries. We then apply a probabilistic model of rhyme to detect imperfect and internal rhymes in rap lyrics and quantitatively characterize rappers' styles in their use. Finally, we compute likelihoods of prosodic stress in words to perform automated scansion of poetry and compare poets' usage of and adherence to meter. In these applications, we find that likelihood-based methods outperform simpler, rule-based models at finding and quantifying lyrical features in text.
author	Hirjee, Hussein
author_facet	Hirjee, Hussein
author_sort	Hirjee, Hussein
title	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
title_short	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
title_full	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
title_fullStr	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
title_full_unstemmed	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics
title_sort	rhyme, rhythm, and rhubarb: using probabilistic methods to analyze hip hop, poetry, and misheard lyrics
publishDate	2010
url	http://hdl.handle.net/10012/5419
work_keys_str_mv	AT hirjeehussein rhymerhythmandrhubarbusingprobabilisticmethodstoanalyzehiphoppoetryandmisheardlyrics
_version_	1716573618333286400

Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics

Similar Items