The Monkeytyping Solution to the YouTube-8M Video Understanding
  Challenge

Wang, He-Da; Zhang, Teng; Wu, Ji

research

The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge

Authors: He-Da Wang
Teng Zhang
Ji Wu
Publication date: 1 January 1880
Publisher

Abstract

This article describes the final solution of team monkeytyping, who finished in second place in the YouTube-8M video understanding challenge. The dataset used in this challenge is a large-scale benchmark for multi-label video classification. We extend the work in [1] and propose several improvements for frame sequence modeling. We propose a network structure called Chaining that can better capture the interactions between labels. Also, we report our approaches in dealing with multi-scale information and attention pooling. In addition, We find that using the output of model ensemble as a side target in training can boost single model performance. We report our experiments in bagging, boosting, cascade, and stacking, and propose a stacking algorithm called attention weighted stacking. Our final submission is an ensemble that consists of 74 sub models, all of which are listed in the appendix.Comment: Submitted to the CVPR 2017 Workshop on YouTube-8M Large-Scale Video Understandin

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Biblioteca Virtual del Patrimonio Bibliográfico (Virtual Library of Bibliographical Heritage)

oai:bvpb.mcu.es:502288

Last time updated on 12/01/2018

Biblioteca Virtual del Patrimonio Bibliográfico (Virtual Library of Bibliographical Heritage)

oai:bvpb.mcu.es:502286

Last time updated on 12/01/2018