Search CORE

494 research outputs found

Domain specific syntax based approach for text classification in machine learning context

Author: Bader-El-Den Mohamed
Cocea Mihaela
Liu Han
Mohasseb Alaa
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/11/2017
Field of study

Crossref

Portsmouth University Research Portal (Pure)

Survey and evaluation of query intent detection methods

Author: Brenes Martínez David José
Gayo Avello Daniel
Pérez González Kilian
Publication venue: ACM
Publication date: 01/01/2009
Field of study

Second ACM International Conference on Web Search and Data Mining, Barcelona (Spain)User interactions with search engines reveal three main underlying intents, namely navigational, informational, and transactional. By providing more accurate results depending on such query intents the performance of search engines can be greatly improved. Therefore, query classification has been an active research topic for the last years. However, while query topic classification has deserved a specific bakeoff, no evaluation campaign has been devoted to the study of automatic query intent detection. In this paper some of the available query intent detection techniques are reviewed, an evaluation framework is proposed, and it is used to compare those methods in order to shed light on their relative performance and drawbacks. As it will be shown, manually prepared gold-standard files are much needed, and traditional pooling is not the most feasible evaluation method. In addition to this, future lines of work in both query intent detection and its evaluation are propose

Crossref

Repositorio Institucional de la Universidad de Oviedo

A customised grammar framework for query classification

Author: Bader-El-Den Mohamed
Cocea Mihaela
Mohasseb Alaa
Publication venue: 'Elsevier BV'
Publication date: 30/11/2019
Field of study

Portsmouth University Research Portal (Pure)

Identifying The Relationship Between Page Content and Title

Author: Detchou Yabo Ornella
Publication venue: Bard Digital Commons
Publication date: 01/01/2022
Field of study

This project seeks to find the similarity score between content on the page and title using cosine similarity from a word2vec model. Frequent words and randomly chosen words from each article were analyzed and compared against the title using three samples. Frequent words were found to have a higher similarity score with the title than random words. Word frequency helps you identify the most relevant keyword on the page. The bigger goal of the project is to develop a keyword suggestion tool. Identifying which keywords are most relevant in writing content is the first step

Bard College

Intent-aware search result diversification

Author: Macdonald C.
Ounis I.
Santos R.L.T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Search result diversification has gained momentum as a way to tackle ambiguous queries. An effective approach to this problem is to explicitly model the possible aspects underlying a query, in order to maximise the estimated relevance of the retrieved documents with respect to the different aspects. However, such aspects themselves may represent information needs with rather distinct intents (e.g., informational or navigational). Hence, a diverse ranking could benefit from applying intent-aware retrieval models when estimating the relevance of documents to different aspects. In this paper, we propose to diversify the results retrieved for a given query, by learning the appropriateness of different retrieval models for each of the aspects underlying this query. Thorough experiments within the evaluation framework provided by the diversity task of the TREC 2009 and 2010 Web tracks show that the proposed approach can significantly improve state-of-the-art diversification approaches

CiteSeerX

Crossref

Enlighten