Targeted s-gram matching: a novel n-gram matching technique for cross- and monolingual word form variants
We present a novel n-gram based string matching technique, which we call the targeted s-gram matching technique. In the technique, n-grams are classified into categories on the basis of character contiguity in words. The categories are then utilized in matching. The technique was compared with the c...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Borås
2002-01-01
|
Series: | Information Research: An International Electronic Journal |
Online Access: | http://informationr.net/ir/7-2/paper126.html |