Detecção de réplicas de sítios Web em máquinas de busca usando aprendizado de máquina

=== Earlier work estimate that at least 30% of all content available on the Web is replicated content, posing serious challenges to search engines, such as waste of computational resources and decrease in the search effectiveness. Thus, detection of replicated websites is currently a prerequisite f...

Full description

Bibliographic Details
Main Author: Rickson Guidolini
Other Authors: Nivio Ziviani
Format: Others
Language:English
Published: Universidade Federal de Minas Gerais 2011
Online Access:http://hdl.handle.net/1843/SLSS-8GQLET