Compounds in dictionary-based Cross-language information retrieval_revised

Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often...

Full description

Bibliographic Details
Format: Article
Language:English
Published: University of Borås 2002-01-01
Series:Information Research: An International Electronic Journal
Online Access:http://informationr.net/ir/7-2/paper128.html
id doaj-621db8721c584a3aa89fda162cd82c0b
record_format Article
spelling doaj-621db8721c584a3aa89fda162cd82c0b2020-11-25T01:37:22ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132002-01-0172128Compounds in dictionary-based Cross-language information retrieval_revisedCompound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds. Important research questions concerning compound handling in dictionary-based cross-language information retrieval are 1) compound splitting into components, 2) normalisation of components, 3) translation of components and 4) query structuring for compounds and their components in the target language. The impact of compound processing on the performance of the cross-language information retrieval process is evaluated in this study and the results indicate that the effect is clearly positive.http://informationr.net/ir/7-2/paper128.html
collection DOAJ
language English
format Article
sources DOAJ
title Compounds in dictionary-based Cross-language information retrieval_revised
spellingShingle Compounds in dictionary-based Cross-language information retrieval_revised
Information Research: An International Electronic Journal
title_short Compounds in dictionary-based Cross-language information retrieval_revised
title_full Compounds in dictionary-based Cross-language information retrieval_revised
title_fullStr Compounds in dictionary-based Cross-language information retrieval_revised
title_full_unstemmed Compounds in dictionary-based Cross-language information retrieval_revised
title_sort compounds in dictionary-based cross-language information retrieval_revised
publisher University of Borås
series Information Research: An International Electronic Journal
issn 1368-1613
publishDate 2002-01-01
description Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds. Important research questions concerning compound handling in dictionary-based cross-language information retrieval are 1) compound splitting into components, 2) normalisation of components, 3) translation of components and 4) query structuring for compounds and their components in the target language. The impact of compound processing on the performance of the cross-language information retrieval process is evaluated in this study and the results indicate that the effect is clearly positive.
url http://informationr.net/ir/7-2/paper128.html
_version_ 1725058028320849920