A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
This article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a)...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-12-01
|
Series: | Information |
Subjects: | |
Online Access: | http://www.mdpi.com/2078-2489/10/1/6 |
id |
doaj-69900d043e6042e0974c5f83dc58d52e |
---|---|
record_format |
Article |
spelling |
doaj-69900d043e6042e0974c5f83dc58d52e2020-11-24T22:49:17ZengMDPI AGInformation2078-24892018-12-01101610.3390/info10010006info10010006A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity DetectionHanqing Zhou0Amal Zouaq1Diana Inkpen2School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaSchool of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaSchool of Electrical Engineering and Computer Science, University of Ottawa, Ottawa ON K1N 6N5, CanadaThis article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a) the detection of entity types, which can be used to detect invalid DBpedia types and assign DBpedia types for type-less entities; and (b) the detection of invalid entities in the resource description of a DBpedia entity. Our results show that entity embeddings outperform n-gram models for type and entity detection and can contribute to the improvement of DBpedia’s quality, maintenance, and evolution.http://www.mdpi.com/2078-2489/10/1/6semantic webDBpediaentity embeddingn-gramstype identificationentity identificationdata miningmachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hanqing Zhou Amal Zouaq Diana Inkpen |
spellingShingle |
Hanqing Zhou Amal Zouaq Diana Inkpen A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection Information semantic web DBpedia entity embedding n-grams type identification entity identification data mining machine learning |
author_facet |
Hanqing Zhou Amal Zouaq Diana Inkpen |
author_sort |
Hanqing Zhou |
title |
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection |
title_short |
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection |
title_full |
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection |
title_fullStr |
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection |
title_full_unstemmed |
A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection |
title_sort |
comparison of word embeddings and n-gram models for dbpedia type and invalid entity detection |
publisher |
MDPI AG |
series |
Information |
issn |
2078-2489 |
publishDate |
2018-12-01 |
description |
This article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a) the detection of entity types, which can be used to detect invalid DBpedia types and assign DBpedia types for type-less entities; and (b) the detection of invalid entities in the resource description of a DBpedia entity. Our results show that entity embeddings outperform n-gram models for type and entity detection and can contribute to the improvement of DBpedia’s quality, maintenance, and evolution. |
topic |
semantic web DBpedia entity embedding n-grams type identification entity identification data mining machine learning |
url |
http://www.mdpi.com/2078-2489/10/1/6 |
work_keys_str_mv |
AT hanqingzhou acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection AT amalzouaq acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection AT dianainkpen acomparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection AT hanqingzhou comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection AT amalzouaq comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection AT dianainkpen comparisonofwordembeddingsandngrammodelsfordbpediatypeandinvalidentitydetection |
_version_ |
1725676527844065280 |