Research on the Extraction Technology of Hot-words in Tibetan WebPages
The construction of Tibetan corpus is the field of Tibetan information processing of basic work. This paper uses the technology of web crawler and pretreatment and real-time acquisition of web sites to obtain a large number of Tibetan corpus in short time. The hot words reflected the hotspot of Tibe...
Main Authors: | Wang Chang-Zhi, Xu Gui-Xian, Wang Hui |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2016-01-01
|
Series: | ITM Web of Conferences |
Online Access: | http://dx.doi.org/10.1051/itmconf/20160701005 |
Similar Items
-
An Establishment of WebPage by the Internet Technology
by: Preeyanat Vongchan
Published: (2015-02-01) -
Webpage Component Detection and Dynamic Webpage Testing Methods
by: Wei-Yen Wang, et al.
Published: (2011) -
The Research of Applying Data Mining Technique on Adaptive WebPages Structure ----- An Example of a Nationwide Large Scaled Furniture Website
by: Te-Hung Kuo, et al.
Published: (2004) -
The influence of web design elements on webpage usage
by: Kang-jung Wong, et al.
Published: (2005) -
PageRank's ability to track webpage quality: reconciling Google's wisdom-of-crowds justification with the scale-free structure of the web
by: George Masterton, et al.
Published: (2018-11-01)