Thematic harvesting of agricultural resources from generic repositories

Metadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect da...

Full description

Bibliographic Details
Main Author: Devika P. Madalli
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2015-09-01
Series:Information Processing in Agriculture
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2214317315000293
id doaj-baf481794d8f414587684fdbbc8585c6
record_format Article
spelling doaj-baf481794d8f414587684fdbbc8585c62021-04-02T07:47:07ZengKeAi Communications Co., Ltd.Information Processing in Agriculture2214-31732015-09-01229310010.1016/j.inpa.2015.05.002Thematic harvesting of agricultural resources from generic repositoriesDevika P. MadalliMetadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect data from multiple sources. The main problem is that harvesters, at present, do not have the facility to distinguish themes such as domains. In the present work, an attempt has been through Tharvest, a thematic harvester model using the proposed methodology harvesting agricultural resources from generic repositories. Tharvest encompasses a process where technical terms of the domain of agriculture are taken from AGROVOC, a multilingual, structured controlled vocabulary designed to cover concepts and terminologies in the agriculture domain. AGROVOC is deployed to provide the basis for selective harvesting. The system components and workflows are presented and described. Metadata aggregators provide end-users a single platform discovery facility to resources collected from various data providers. It is observed that aggregators such as INDUS [www.drtc.isibang/ac.in/indus] dealing with agriculture and related domains facilitate aggregating metadata from not only repositories but also other sources such as journals and enable a centralized access to full text and objects. While harvesting can be fairly simple and straight forward, it is not without its challenges. This paper intends to highlight some of the issues in harvesting metadata in agricultural domain. The particular focus is to identify agriculture related metadata from generic sets.http://www.sciencedirect.com/science/article/pii/S2214317315000293Thematic harvestingAgricultural dataIssues in harvesting
collection DOAJ
language English
format Article
sources DOAJ
author Devika P. Madalli
spellingShingle Devika P. Madalli
Thematic harvesting of agricultural resources from generic repositories
Information Processing in Agriculture
Thematic harvesting
Agricultural data
Issues in harvesting
author_facet Devika P. Madalli
author_sort Devika P. Madalli
title Thematic harvesting of agricultural resources from generic repositories
title_short Thematic harvesting of agricultural resources from generic repositories
title_full Thematic harvesting of agricultural resources from generic repositories
title_fullStr Thematic harvesting of agricultural resources from generic repositories
title_full_unstemmed Thematic harvesting of agricultural resources from generic repositories
title_sort thematic harvesting of agricultural resources from generic repositories
publisher KeAi Communications Co., Ltd.
series Information Processing in Agriculture
issn 2214-3173
publishDate 2015-09-01
description Metadata aggregators and service providers harvest entire collections or they restrict harvesting by date or sets. However most often user approach to collections is not by dates or set names but by domain based keywords. Harvesting by domains is an issue when service providers attempt to collect data from multiple sources. The main problem is that harvesters, at present, do not have the facility to distinguish themes such as domains. In the present work, an attempt has been through Tharvest, a thematic harvester model using the proposed methodology harvesting agricultural resources from generic repositories. Tharvest encompasses a process where technical terms of the domain of agriculture are taken from AGROVOC, a multilingual, structured controlled vocabulary designed to cover concepts and terminologies in the agriculture domain. AGROVOC is deployed to provide the basis for selective harvesting. The system components and workflows are presented and described. Metadata aggregators provide end-users a single platform discovery facility to resources collected from various data providers. It is observed that aggregators such as INDUS [www.drtc.isibang/ac.in/indus] dealing with agriculture and related domains facilitate aggregating metadata from not only repositories but also other sources such as journals and enable a centralized access to full text and objects. While harvesting can be fairly simple and straight forward, it is not without its challenges. This paper intends to highlight some of the issues in harvesting metadata in agricultural domain. The particular focus is to identify agriculture related metadata from generic sets.
topic Thematic harvesting
Agricultural data
Issues in harvesting
url http://www.sciencedirect.com/science/article/pii/S2214317315000293
work_keys_str_mv AT devikapmadalli thematicharvestingofagriculturalresourcesfromgenericrepositories
_version_ 1724170933676015616