Story retrieval and comparison using concept patterns

Traditional story comparison uses key words to determine similarity. However, the use of key words misses much of what makes two stories alike. The method we have developed use high level concept patterns, which are comprised of multiple events, and compares them across stories. Comparison based on...

Full description

Bibliographic Details
Main Authors: Krakauer, Caryn E. (Author), Winston, Patrick Henry (Author)
Format: Article
Language:English
Published: © The Association for Computational Linguistics, 2022-04-06T17:38:11Z.
Subjects:
Online Access:Get fulltext
LEADER 02269 am a22002053u 4500
001 141723
042 |a dc 
100 1 0 |a Krakauer, Caryn E.  |e author 
700 1 0 |a Winston, Patrick Henry  |e author 
245 0 0 |a Story retrieval and comparison using concept patterns 
260 |b © The Association for Computational Linguistics,   |c 2022-04-06T17:38:11Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/141723 
520 |a Traditional story comparison uses key words to determine similarity. However, the use of key words misses much of what makes two stories alike. The method we have developed use high level concept patterns, which are comprised of multiple events, and compares them across stories. Comparison based on concept patterns can note that two stories are similar because both contain, for example, revenge and betrayal concept patterns, even though the words revenge and betrayal do not appear in either story, and one may be about kings and kingdoms while the other is about presidents and countries. Using a small corpus of 15 conflict stories, we have shown that similarity measurement using concept patterns does, in fact, differ substantially from similarity measurement using key words. The Goldilocks principle states that features should be of intermediate size; they should be not too big, and they should not too small. Our work can be viewed as adhering to the Goldilocks principle because concept patterns are features of intermediate size, hence not so large as an entire story, because no story will be exactly like another story, and not so small as individual words, because individual words tend to be common in all stories taken from the same domain. While our goal is to develop a human competence model, we note application potential in retrieval, prediction, explanation, and grouping. 
520 |a This material is based on work supported by the U.S. Office of Naval Research, Grant No. N00014-09-1-0597. Any opinions, findings, conclusions or recommendations therein are those of the author(s) and do not necessarily reflect the views of the Office of Naval Research. 
546 |a en_US 
690 |a Goldilocks principle 
690 |a story retrieval 
690 |a intermediate features 
690 |a concept patterns 
655 7 |a Article