Search CORE

2,720 research outputs found

A User-Centered Concept Mining System for Query and Document Understanding at Tencent

Author: Guo Weidong
Lai Kunfeng
Lin Jinghong
Liu Bang
Niu Di
Wang Chaoyue
Xu Shunnan
Xu Yu
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/05/2019
Field of study

Concepts embody the knowledge of the world and facilitate the cognitive processes of human beings. Mining concepts from web documents and constructing the corresponding taxonomy are core research problems in text understanding and support many downstream tasks such as query analysis, knowledge base construction, recommendation, and search. However, we argue that most prior studies extract formal and overly general concepts from Wikipedia or static web pages, which are not representing the user perspective. In this paper, we describe our experience of implementing and deploying ConcepT in Tencent QQ Browser. It discovers user-centered concepts at the right granularity conforming to user interests, by mining a large amount of user queries and interactive search click logs. The extracted concepts have the proper granularity, are consistent with user language styles and are dynamically updated. We further present our techniques to tag documents with user-centered concepts and to construct a topic-concept-instance taxonomy, which has helped to improve search as well as news feeds recommendation in Tencent QQ Browser. We performed extensive offline evaluation to demonstrate that our approach could extract concepts of higher quality compared to several other existing methods. Our system has been deployed in Tencent QQ Browser. Results from online A/B testing involving a large number of real users suggest that the Impression Efficiency of feeds users increased by 6.01% after incorporating the user-centered concepts into the recommendation framework of Tencent QQ Browser.Comment: Accepted by KDD 201

arXiv.org e-Print Archive

Crossref

Knowledge Graph based Question and Answer System for Cosmetic Domain

Author: Ren Fuji
Xue Siyuan
Publication venue: AIA International Advanced Information Institute
Publication date: 10/08/2021
Field of study

With the development of E-commerce, the requirements of customers for products become more detailed, and the workload of customer service consultants will increase massively. However, the manufacturer is not obliged to provide specific product ingredients on the website. Therefore, it is necessary to construct a KBQA system to relieve the pressure of online customer service and effectively help customers to find suitable skincare production. For the cosmetic filed, the different basic cosmetics may have varied effects depending on its ingredients. In this paper, we utilize CosDNA website and online cosmetic websites to construct a cosmetic product knowledge graph to broaden the relationship between cosmetics, ingredients, skin type, and effects. Besides, we build the question answering system based on the cosmetic knowledge graph to allow users to understand product details directly and make the decision quickly

Tokushima University Institutional Repository

Keyword Search on RDF Graphs - A Query Graph Assembly Approach

Author: Han Shuo
Yu Jeffrey Xu
Zhao Dongyan
Zou Lei
Publication venue
Publication date: 25/08/2017
Field of study

Keyword search provides ordinary users an easy-to-use interface for querying RDF data. Given the input keywords, in this paper, we study how to assemble a query graph that is to represent user's query intention accurately and efficiently. Based on the input keywords, we first obtain the elementary query graph building blocks, such as entity/class vertices and predicate edges. Then, we formally define the query graph assembly (QGA) problem. Unfortunately, we prove theoretically that QGA is a NP-complete problem. In order to solve that, we design some heuristic lower bounds and propose a bipartite graph matching-based best-first search algorithm. The algorithm's time complexity is

O(k^{2l} \cdot l^{3l})

, where

l

is the number of the keywords and

k

is a tunable parameter, i.e., the maximum number of candidate entity/class vertices and predicate edges allowed to match each keyword. Although QGA is intractable, both

l

and

k

are small in practice. Furthermore, the algorithm's time complexity does not depend on the RDF graph size, which guarantees the good scalability of our system in large RDF graphs. Experiments on DBpedia and Freebase confirm the superiority of our system on both effectiveness and efficiency

arXiv.org e-Print Archive

Crossref

OPUS - University of Technology Sydney

Intention to Use Abstract Sentence Classification Technology

Author: Busch Peter
Smith Stephen
Stead Connor
Vatanasakdakul Savanid
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2021
Field of study

This paper introduces research in progress to study the intention of researchers to use academic abstract sentence classification technology when undertaking literature acquisition activities. We introduce an enhanced prototypical academic abstract sentence classification system capable of performing on demand sentence classification for metadata results from several academic literature indices. We also outline a preliminary theoretical information systems model developed to explore the intention of researchers to use the system when searching for literature via digital means. Additionally, we provide the survey instrument to be used for review. The overarching body of work this paper introduces will benefit the research community as it is the first time primary research has been conducted to examine the utility of this technology to improve the way researchers interact more efficiently with the large body of literature digitally available

AIS Electronic Library (AISeL)

Research on Medical Question Answering System Based on Knowledge Graph

Author: Chi Chengying
Jiang Zhixue
Zhan Yun Yun
Publication venue: Technological University Dublin
Publication date: 01/01/2021
Field of study

To meet the high-efficiency question answering needs of existing patients and doctors, this system integrates medical professional knowledge, knowledge graphs, and question answering systems that conduct man-machine dialogue through natural language. This system locates the medical field, uses crawler technology to use vertical medical websites as data sources, and uses diseases as the core entity to construct a knowledge graph containing 44,000 knowledge entities of 7 types and 300,000 entities of 11 kinds. It is stored in the Neo4j graph database, using rule-based matching methods and string-matching algorithms to construct a domain lexicon to classify and query questions. This system has specific practical value in the medical field knowledge graph and question answering system

Arrow@TUDublin