Optimization for search engines based on external revision database

The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. I...

Full description

Bibliographic Details
Main Authors: Westerdahl, Simon, Lemón Larsson, Fredrik
Format: Others
Language:English
Published: Högskolan Kristianstad, Fakulteten för naturvetenskap 2020
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000
id ndltd-UPSALLA1-oai-DiVA.org-hkr-21000
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-hkr-210002020-08-25T05:41:47ZOptimization for search engines based on external revision databaseengWesterdahl, SimonLemón Larsson, FredrikHögskolan Kristianstad, Fakulteten för naturvetenskapHögskolan Kristianstad, Fakulteten för naturvetenskap2020Search engineoptimizationrevision databasemachine learningretrieval modelscomplexityComputer SciencesDatavetenskap (datalogi)The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. In this study a search engine (SE) is optimized using several methods and techniques. Thesis looks into how to optimize a SE that is based on an external revision database.The optimized implementation is compared to a non-optimized implementation when executing a query. An artificial neural network (ANN) trained on a dataset containing 3 years normal usage at a company is used to prioritize within the resultset before returning the result to the caller. The new indexing algorithms have improved the document space complexity by removing all duplicate documents that add no value. Machine learning (ML) has been used to analyze the user behaviour to reduce the necessary amount of documents that gets retrieved by a query. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Search engine
optimization
revision database
machine learning
retrieval models
complexity
Computer Sciences
Datavetenskap (datalogi)
spellingShingle Search engine
optimization
revision database
machine learning
retrieval models
complexity
Computer Sciences
Datavetenskap (datalogi)
Westerdahl, Simon
Lemón Larsson, Fredrik
Optimization for search engines based on external revision database
description The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. In this study a search engine (SE) is optimized using several methods and techniques. Thesis looks into how to optimize a SE that is based on an external revision database.The optimized implementation is compared to a non-optimized implementation when executing a query. An artificial neural network (ANN) trained on a dataset containing 3 years normal usage at a company is used to prioritize within the resultset before returning the result to the caller. The new indexing algorithms have improved the document space complexity by removing all duplicate documents that add no value. Machine learning (ML) has been used to analyze the user behaviour to reduce the necessary amount of documents that gets retrieved by a query.
author Westerdahl, Simon
Lemón Larsson, Fredrik
author_facet Westerdahl, Simon
Lemón Larsson, Fredrik
author_sort Westerdahl, Simon
title Optimization for search engines based on external revision database
title_short Optimization for search engines based on external revision database
title_full Optimization for search engines based on external revision database
title_fullStr Optimization for search engines based on external revision database
title_full_unstemmed Optimization for search engines based on external revision database
title_sort optimization for search engines based on external revision database
publisher Högskolan Kristianstad, Fakulteten för naturvetenskap
publishDate 2020
url http://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000
work_keys_str_mv AT westerdahlsimon optimizationforsearchenginesbasedonexternalrevisiondatabase
AT lemonlarssonfredrik optimizationforsearchenginesbasedonexternalrevisiondatabase
_version_ 1719338708732215296