(第 10 期)   第五卷第二期   2011 年 6 月 1 日出刊

維基百科瀏覽輔助介面─整合連結探勘與語意關聯分析

本文關鍵字:連結探勘;正規化Google距離;語意關聯分析;主題導向維基百地圖 Link mining; Normalized Google distance; Semantic relatedness analysis; Topic-based WikiMap

本文摘要

隨著網際網路與Web 2.0技術的推陳出新,以使用者貢獻為本之新型態的社會媒體服務(social media service)網站紛紛崛起。由於網站易於開發與網頁易於存取的特性,造成網路資訊快速的成長,網路世界逐漸成為使用者獲取資訊的來源,其中維基百科(Wikipedia)更為使用者快速獲取定義、解釋……等資訊的重要網路服務。由於網路資訊不斷倍增,故其延伸之主要問題為資訊超載,因此使用者經常花費許多時間尋找與過濾所需資訊。本研究即以Wikipedia為研究對象,以連結探勘與語意關聯分析技術為理論基礎,試圖建構特定主題之知識網路圖。本研究首先提出藉由Wikipedia頁面連結型態(type)與連結頻率(frequency)之連結關聯強度法(link strength measure)以建構初始網路,再進一步採用以搜尋結果為依據之Normalized Google Distance(NGD)演算法計算節點間的語意關係以建構主題網路。本研究最後採用社會網路分析指標來分析主題間之關係,並以視覺化的方式呈現研究結果。本研究透過不同使用者搜尋任務設計以評估所提出方法與建構之主題導向維基百科地圖介面之有效性,研究結果顯示該發展介面有助於協助使用者快速瀏覽Wikipedia資訊,且能協助使用者完成較複雜的任務搜尋。
With the ubiquity of the Internet and the emergence of Web 2.0 technologies, social web sites (i.e., social networking websites and, micro-blogging services) are providing unprecedented opportunities for creating user-generated content, as well as for promoting communication, collaboration and information-sharing among users. Wikipedia, one of the most famous collaborative projects on the Web, has become an extremely popular reference database for people seeking information or knowledge. However, since the number of articles and the wide variety of topics in Wikipedia is constantly expanding, it is difficult for users to find information efficiently via the hypertext links, i.e., the network of linked documents. To address the problem, we propose a hybrid approach that is based on the theories and techniques of link-based analysis and semantic relatedness analysis. Specifically, we employ a link strength measure to establish a preliminary topic network by analyzing the relationships between articles. We also refine the “Normalized Google Distance” to quantify the strength of the relationship between two articles via key terms. Then, we apply social network analysis indicators to determine the relationships between topics and visualize the analysis results in order to help users browse Wikipedia efficiently. Finally, a topic-based WikiMap is generated based on the proposed hybrid approach. We conducted a user-task oriented evaluation study to confirm that the derived topic-based WikiMap can help users browse topics and execute complicated tasks easily and efficiently.
全文下載網址:http://lac3.glis.ntnu.edu.tw/vj-attachment/2011/07/attach72.pdf

本文附件:

本刊著作權屬於「中華民國圖書館學會」所有。
Powered By Vanilla Journal - 香草期刊系統 0.256 / 2006 - 2007 © Weizhong Yang