Targeted s-gram matching: a novel n-gram matching technique for cross- and monolingual word form variants

We present a novel n-gram based string matching technique, which we call the targeted s-gram matching technique. In the technique, n-grams are classified into categories on the basis of character contiguity in words. The categories are then utilized in matching. The technique was compared with the c...

Full description

Bibliographic Details
Main Authors: Ari Pirkola, Heikki Keskustalo, Erkka Leppänen, Antti-Pekka Känsälä, Kalervo Järvelin
Format: Article
Language:English
Published: University of Borås 2002-01-01
Series:Information Research: An International Electronic Journal
Online Access:http://informationr.net/ir/7-2/paper126.html