A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection

This article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a)...

Full description

Bibliographic Details
Main Authors: Hanqing Zhou, Amal Zouaq, Diana Inkpen
Format: Article
Language:English
Published: MDPI AG 2018-12-01
Series:Information
Subjects:
Online Access:http://www.mdpi.com/2078-2489/10/1/6
id doaj-69900d043e6042e0974c5f83dc58d52e
record_format Article
spelling doaj-69900d043e6042e0974c5f83dc58d52e2020-11-24T22:49:17ZengMDPI AGInformation2078-24892018-12-01101610.3390/info10010006info10010006A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity DetectionHanqing Zhou0Amal Zouaq1Diana Inkpen2School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaSchool of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaSchool of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaThis article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a) the detection of entity types, which can be used to detect invalid DBpedia types and assign DBpedia types for type-less entities; and (b) the detection of invalid entities in the resource description of a DBpedia entity. Our results show that entity embeddings outperform n-gram models for type and entity detection and can contribute to the improvement of DBpedia’s quality, maintenance, and evolution.http://www.mdpi.com/2078-2489/10/1/6semantic webDBpediaentity embeddingn-gramstype identificationentity identificationdata miningmachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Hanqing Zhou
Amal Zouaq
Diana Inkpen
spellingShingle Hanqing Zhou
Amal Zouaq
Diana Inkpen
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
Information
semantic web
DBpedia
entity embedding
n-grams
type identification
entity identification
data mining
machine learning
author_facet Hanqing Zhou
Amal Zouaq
Diana Inkpen
author_sort Hanqing Zhou
title A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
title_short A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
title_full A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
title_fullStr A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
title_full_unstemmed A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
title_sort comparison of word embeddings and n-gram models for dbpedia type and invalid entity detection
publisher MDPI AG
series Information
issn 2078-2489
publishDate 2018-12-01
description This article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a) the detection of entity types, which can be used to detect invalid DBpedia types and assign DBpedia types for type-less entities; and (b) the detection of invalid entities in the resource description of a DBpedia entity. Our results show that entity embeddings outperform n-gram models for type and entity detection and can contribute to the improvement of DBpedia’s quality, maintenance, and evolution.
topic semantic web
DBpedia
entity embedding
n-grams
type identification
entity identification
data mining
machine learning
url http://www.mdpi.com/2078-2489/10/1/6
work_keys_str_mv AT hanqingzhou acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
AT amalzouaq acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
AT dianainkpen acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
AT hanqingzhou comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
AT amalzouaq comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
AT dianainkpen comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection
_version_ 1725676527844065280