Search CORE

48 research outputs found

Evaluation of Automatic Video Captioning Using Direct Assessment

Author: Awad George
Graham Yvette
Smeaton Alan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 29/10/2017
Field of study

We present Direct Assessment, a method for manually assessing the quality of automatically-generated captions for video. Evaluating the accuracy of video captions is particularly difficult because for any given video clip there is no definitive ground truth or correct answer against which to measure. Automatic metrics for comparing automatic video captions against a manual caption such as BLEU and METEOR, drawn from techniques used in evaluating machine translation, were used in the TRECVid video captioning task in 2016 but these are shown to have weaknesses. The work presented here brings human assessment into the evaluation by crowdsourcing how well a caption describes a video. We automatically degrade the quality of some sample captions which are assessed manually and from this we are able to rate the quality of the human assessors, a factor we take into account in the evaluation. Using data from the TRECVid video-to-text task in 2016, we show how our direct assessment method is replicable and robust and should scale to where there many caption-generation techniques to be evaluated.Comment: 26 pages, 8 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

Machine Learning and Text Segmentation in Novelty Detection

Author: McKeown Kathleen
Schiffman Barry
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2004
Field of study

This paper explores a combination of machine learning, approximate text segmentation and a vector-space model to distinguish novel information from repeated information. In experiments with the data from the Novelty Track at the Text Retrieval Conference, we show improvements over a variety of approaches, in particular in raising precision scores on this data, while maintaining a reasonable amount of recall

CiteSeerX

Columbia University Academic Commons

Context and Learning in Novelty Detection

Author: McKeown Kathleen
Schiffman Barry
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2005
Field of study

We demonstrate the value of using context in a new-information detection system that achieved the highest precision scores at the Text Retrieval Conference's Novelty Track in 2004. In order to determine whether information within a sentence has been seen in material read previously, our system integrates information about the context of the sentence with novel words and named entities within the sentence, and uses a specialized learning algorithm to tune the system parameters

Crossref

Columbia University Academic Commons

A Decision-theoretic Multi-dimensional Model for Meeting Users\u27 Information Filtering Needs

Author: Forgionne Guisseppi
Wang Ye
Publication venue: AIS Electronic Library (AISeL)
Publication date: 31/12/2004
Field of study

AIS Electronic Library (AISeL)

Promoting Insight-Based Evaluation of Visualizations: From Contest to Benchmark Repository

Author: C. Plaisant
G. Grinstein
J.-D. Fekete
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Word Embedding, Neural Networks and Text Classification: What is the State-of-the-Art?

Author: Vilar Estevan
Publication venue: Junior Management Science
Publication date: 20/03/2019
Field of study

In this bachelor thesis, I first introduce the machine learning methodology of text classification with the goal to describe the functioning of neural networks. Then, I identify and discuss the current development of Convolutional Neural Networks and Recurrent Neural Networks from a text classification perspective and compare both models. Furthermore, I introduce different techniques used to translate textual information in a language comprehensible by the computer, which ultimately serve as inputs for the models previously discussed. From there, I propose a method for the models to cope with words absent from a training corpus. This first part has also the goal to facilitate the access to the machine learning world to a broader audience than computer science students and experts. To test the proposal, I implement and compare two state-of-the-art models and eight different word representations using pre-trained vectors on a dataset given by LogMeIn and on a common benchmark. I find that, with my configuration, Convolutional Neural Networks are easier to train and are also yielding better results. Nevertheless, I highlight that models that combine both architectures can potentially have a better performance, but need more work on identifying appropriate hyperparameters for training. Finally, I find that the efficacy of word embedding methods depends not only on the dataset but also on the model used to tackle the subsequent task. In my context, they can boost performance by up to 10.2% compared to a random initialization. However, further investigations are necessary to evaluate the value of my proposal with a corpus that contains a greater ratio of unknown relevant words. Keywords: neural networks; machine learning; word embedding; text classification; business analytic

Junior Management Science (E-Journal - LMÜ München)