Optimization for search engines based on external revision database
The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. I...
Main Authors: | , |
---|---|
Format: | Others |
Language: | English |
Published: |
Högskolan Kristianstad, Fakulteten för naturvetenskap
2020
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000 |
id |
ndltd-UPSALLA1-oai-DiVA.org-hkr-21000 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-hkr-210002020-08-25T05:41:47ZOptimization for search engines based on external revision databaseengWesterdahl, SimonLemón Larsson, FredrikHögskolan Kristianstad, Fakulteten för naturvetenskapHögskolan Kristianstad, Fakulteten för naturvetenskap2020Search engineoptimizationrevision databasemachine learningretrieval modelscomplexityComputer SciencesDatavetenskap (datalogi)The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. In this study a search engine (SE) is optimized using several methods and techniques. Thesis looks into how to optimize a SE that is based on an external revision database.The optimized implementation is compared to a non-optimized implementation when executing a query. An artificial neural network (ANN) trained on a dataset containing 3 years normal usage at a company is used to prioritize within the resultset before returning the result to the caller. The new indexing algorithms have improved the document space complexity by removing all duplicate documents that add no value. Machine learning (ML) has been used to analyze the user behaviour to reduce the necessary amount of documents that gets retrieved by a query. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Search engine optimization revision database machine learning retrieval models complexity Computer Sciences Datavetenskap (datalogi) |
spellingShingle |
Search engine optimization revision database machine learning retrieval models complexity Computer Sciences Datavetenskap (datalogi) Westerdahl, Simon Lemón Larsson, Fredrik Optimization for search engines based on external revision database |
description |
The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. In this study a search engine (SE) is optimized using several methods and techniques. Thesis looks into how to optimize a SE that is based on an external revision database.The optimized implementation is compared to a non-optimized implementation when executing a query. An artificial neural network (ANN) trained on a dataset containing 3 years normal usage at a company is used to prioritize within the resultset before returning the result to the caller. The new indexing algorithms have improved the document space complexity by removing all duplicate documents that add no value. Machine learning (ML) has been used to analyze the user behaviour to reduce the necessary amount of documents that gets retrieved by a query. |
author |
Westerdahl, Simon Lemón Larsson, Fredrik |
author_facet |
Westerdahl, Simon Lemón Larsson, Fredrik |
author_sort |
Westerdahl, Simon |
title |
Optimization for search engines based on external revision database |
title_short |
Optimization for search engines based on external revision database |
title_full |
Optimization for search engines based on external revision database |
title_fullStr |
Optimization for search engines based on external revision database |
title_full_unstemmed |
Optimization for search engines based on external revision database |
title_sort |
optimization for search engines based on external revision database |
publisher |
Högskolan Kristianstad, Fakulteten för naturvetenskap |
publishDate |
2020 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-21000 |
work_keys_str_mv |
AT westerdahlsimon optimizationforsearchenginesbasedonexternalrevisiondatabase AT lemonlarssonfredrik optimizationforsearchenginesbasedonexternalrevisiondatabase |
_version_ |
1719338708732215296 |