2 research outputs found

    Searching for Authoritative Documents in Knowledge-Base Communities

    Get PDF
    Knowledge-based communities are popular Web-based tools that allow members to share and seek knowledge globally. However, research on how to search effectively within such knowledge repositories is scant. In this paper we study the problem of finding authoritative documents for user queries within a knowledge-based community. Unlike prior research on the ranking function design which considers only content or hyperlink information, we leverage the social network information embedded in the rich social media, in addition to content, to design novel ranking strategies. Using the Knowledge Adoption Model as the guiding theoretical framework, we design features that gauge the two major factors affecting users’ knowledge adoption decisions: argument quality (AQ) and source credibility (SC). We design two ranking strategies that blend these two sources of evidence with the content-based relevance judgment. A preliminary study using a real world knowledge-based community showed that both AQ and SC features improved search effectiveness

    A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences

    Get PDF
    This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to "artificial language", such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontologybased approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure
    corecore