Article thumbnail
Location of Repository

A Study on the Cited Sources of Answers on Web Question Answering Service

By 何怡融 and Yi-Rong He

Abstract

[[abstract]]The widespread use of internet has made the share of information and communications more convenient. It is common for users to surf the web for information. The keyword search engine contains a broad range of information. However, the internet questioning and answering services provide a way to search for information by inserting a question. This type of service allows users to save time filtering information and so they are able to collect information more efficiently. The purpose of study was to analyze web question answering services "Taiwan's Yahoo! Knowledge+", replied the content of citing sources, include: cited source channel, cited source type and cited source theme category. In addition, and then by accessibility, repeatability and trustworthiness, to judge the quality of the source cited. This study uses Content Analysis. We collected data using a crawler that browses Taiwan's Yahoo! Knowledge which discusses knowledge-based questions. The sampling period is from March 1, 2010 to March 20, 2010. From this pool of data, we net of "Anxieties and feelings" after class ten categories, each type of data before the 1000, for a total of 10,000 data. The total number of answers is 23,874; the total number of cited sources is 16,239. Thus, the average number of answers per question is 2.39; the average number of cited sources per answers is 0.68. Lastly, we randomly selected a total of 5,391 sources for further analysis. Cited source theme category based on Yahoo! Knowledge+ of Categories was then divided into major theme categories 17. Our results show that the most frequently cited sources are Human cited source (49.56%) and Internet cited source (49.05%). According to the source distribution by genre, “personal experience” was main genre in human cited source; “blog” as main genre in internet cited source. The majority of the Internet Cited Sources are from the ".com" domain. The subject category of “Life information” is the most cited subject category. “Business & Finance” has the highest rate of repeatability of cited source classification. Since certain “Business & Finance” web materials may sometimes be unreliable. Therefore, determining the cited sources of will help us analyze the quality of the answers. In addition, cited source has the following characteristics: (1) Rich to ask questions, citing multiple sources of channel. (2) In most cases, primary users to ask questions and senior users to responses answer. Cited sources in response to the content can help to answer more completely, will help determine the best answer.Finally, we recommend Web question answering services users can try to assessment objective and confirmation of information.Web question answering services provider can provide information related specifications cited sources, will help to enhance the quality of the Web Social Q&A.

Topics: 網路問答;網路問答服務;引用來源, Web Social Q&A;Web question answering services;Cited source, [[classification]]3
Year: 2011
OAI identifier: oai:ir.lib.ntnu.edu.tw:309250000Q/75711
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://ir.lib.ntnu.edu.tw/ir/h... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.