Managing Index Repartitioning

Careful architectural decisions are required in order to create a highly available and scalable search system. This requires an in-depth analysis and understanding of the architecture and context of each deployment. Different requirements placed upon the system by different deployments mean differen...

Full description

Bibliographic Details
Main Author: Karevoll, Njål
Format: Others
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap 2011
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-13491
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-13491
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-134912013-01-08T13:32:50ZManaging Index RepartitioningengKarevoll, NjålNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2011ntnudaim:5864MTDT datateknikkData- og informasjonsforvaltningCareful architectural decisions are required in order to create a highly available and scalable search system. This requires an in-depth analysis and understanding of the architecture and context of each deployment. Different requirements placed upon the system by different deployments mean different solutions provide the best case by case result, thus benchmarks provide an invaluable source of information.This thesis provides an overview of common components and important aspects of a distributed search system. It then gives an overview of different partitioning techniques before going into the details of repartitioning and rebalancing in a document-partitioned full-text search system.A processing framework that draws inspiration from flow-based programming literature is introduced, which is shown a valuable tool in creating custom tailored search solutions. The implementation is used to benchmark different repartitioning and rebalancing strategies.In conclusion, the techniques mentioned in the thesis show great promise in creating custom, maintainable and flexible partitions. The processing framework enables each specific deployment to easily compare different partitioning schemes and associated manageability and maintenance costs to determine the best fit for any given situation. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-13491Local ntnudaim:5864application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic ntnudaim:5864
MTDT datateknikk
Data- og informasjonsforvaltning
spellingShingle ntnudaim:5864
MTDT datateknikk
Data- og informasjonsforvaltning
Karevoll, Njål
Managing Index Repartitioning
description Careful architectural decisions are required in order to create a highly available and scalable search system. This requires an in-depth analysis and understanding of the architecture and context of each deployment. Different requirements placed upon the system by different deployments mean different solutions provide the best case by case result, thus benchmarks provide an invaluable source of information.This thesis provides an overview of common components and important aspects of a distributed search system. It then gives an overview of different partitioning techniques before going into the details of repartitioning and rebalancing in a document-partitioned full-text search system.A processing framework that draws inspiration from flow-based programming literature is introduced, which is shown a valuable tool in creating custom tailored search solutions. The implementation is used to benchmark different repartitioning and rebalancing strategies.In conclusion, the techniques mentioned in the thesis show great promise in creating custom, maintainable and flexible partitions. The processing framework enables each specific deployment to easily compare different partitioning schemes and associated manageability and maintenance costs to determine the best fit for any given situation.
author Karevoll, Njål
author_facet Karevoll, Njål
author_sort Karevoll, Njål
title Managing Index Repartitioning
title_short Managing Index Repartitioning
title_full Managing Index Repartitioning
title_fullStr Managing Index Repartitioning
title_full_unstemmed Managing Index Repartitioning
title_sort managing index repartitioning
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
publishDate 2011
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-13491
work_keys_str_mv AT karevollnjal managingindexrepartitioning
_version_ 1716523516428288000