Summary: | 碩士 === 國立中央大學 === 資訊管理研究所 === 94 === As the fast dissemination of research results on the worldwide web, a user’s task of finding useful information becomes more challenging. Usage of scholarly material is growing rapidly and there is a growing demand for high-quality scholarly information. Since a scientific document is a structural text, there would have some useful features that can be used to improve retrieval performance. Here, we investigate three features, fonts, positions and cited references. Although in the past these three individual features have been used in document search, no existing research discusses how to integrate these three together to improve retrieval performance. Therefore, we will first investigate the relationships among them, and then study how to combine them to design a novel retrieval method based on their relationships. Finally, extensive experiments have been carried out through real scientific documents to show its usefulness and performance.
|