Thematic harvesting of agricultural resources from generic repositories
Metadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect da...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2015-09-01
|
Series: | Information Processing in Agriculture |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2214317315000293 |
id |
doaj-baf481794d8f414587684fdbbc8585c6 |
---|---|
record_format |
Article |
spelling |
doaj-baf481794d8f414587684fdbbc8585c62021-04-02T07:47:07ZengKeAi Communications Co., Ltd.Information Processing in Agriculture2214-31732015-09-01229310010.1016/j.inpa.2015.05.002Thematic harvesting of agricultural resources from generic repositoriesDevika P. MadalliMetadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect data from multiple sources. The main problem is that harvesters, at present, do not have the facility to distinguish themes such as domains. In the present work, an attempt has been through Tharvest, a thematic harvester model using the proposed methodology harvesting agricultural resources from generic repositories. Tharvest encompasses a process where technical terms of the domain of agriculture are taken from AGROVOC, a multilingual, structured controlled vocabulary designed to cover concepts and terminologies in the agriculture domain. AGROVOC is deployed to provide the basis for selective harvesting. The system components and workflows are presented and described. Metadata aggregators provide end-users a single platform discovery facility to resources collected from various data providers. It is observed that aggregators such as INDUS [www.drtc.isibang/ac.in/indus] dealing with agriculture and related domains facilitate aggregating metadata from not only repositories but also other sources such as journals and enable a centralized access to full text and objects. While harvesting can be fairly simple and straight forward, it is not without its challenges. This paper intends to highlight some of the issues in harvesting metadata in agricultural domain. The particular focus is to identify agriculture related metadata from generic sets.http://www.sciencedirect.com/science/article/pii/S2214317315000293Thematic harvestingAgricultural dataIssues in harvesting |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Devika P. Madalli |
spellingShingle |
Devika P. Madalli Thematic harvesting of agricultural resources from generic repositories Information Processing in Agriculture Thematic harvesting Agricultural data Issues in harvesting |
author_facet |
Devika P. Madalli |
author_sort |
Devika P. Madalli |
title |
Thematic harvesting of agricultural resources from generic repositories |
title_short |
Thematic harvesting of agricultural resources from generic repositories |
title_full |
Thematic harvesting of agricultural resources from generic repositories |
title_fullStr |
Thematic harvesting of agricultural resources from generic repositories |
title_full_unstemmed |
Thematic harvesting of agricultural resources from generic repositories |
title_sort |
thematic harvesting of agricultural resources from generic repositories |
publisher |
KeAi Communications Co., Ltd. |
series |
Information Processing in Agriculture |
issn |
2214-3173 |
publishDate |
2015-09-01 |
description |
Metadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect data from multiple sources. The main problem is that harvesters, at present, do not have the facility to distinguish themes such as domains. In the present work, an attempt has been through Tharvest, a thematic harvester model using the proposed methodology harvesting agricultural resources from generic repositories. Tharvest encompasses a process where technical terms of the domain of agriculture are taken from AGROVOC, a multilingual, structured controlled vocabulary designed to cover concepts and terminologies in the agriculture domain. AGROVOC is deployed to provide the basis for selective harvesting. The system components and workflows are presented and described.
Metadata aggregators provide end-users a single platform discovery facility to resources collected from various data providers. It is observed that aggregators such as INDUS [www.drtc.isibang/ac.in/indus] dealing with agriculture and related domains facilitate aggregating metadata from not only repositories but also other sources such as journals and enable a centralized access to full text and objects. While harvesting can be fairly simple and straight forward, it is not without its challenges. This paper intends to highlight some of the issues in harvesting metadata in agricultural domain. The particular focus is to identify agriculture related metadata from generic sets. |
topic |
Thematic harvesting Agricultural data Issues in harvesting |
url |
http://www.sciencedirect.com/science/article/pii/S2214317315000293 |
work_keys_str_mv |
AT devikapmadalli thematicharvestingofagriculturalresourcesfromgenericrepositories |
_version_ |
1724170933676015616 |