Search CORE

31 research outputs found

Improved data visualisation through nonlinear dissimilarity modelling

Author: Abrol
Ali
Amari
Baghshah
Bay
Belkin
Bishop
Broomhead
Campbell
Carter
Demartines
Fei-Fei
Goodfellow
Gottlieb
Gönen
Hein
Hinton
Iain Rice
Kantz
Lawrence
Lee
Lee
Lee
Lin
Lin
Lowe
Marinoni
Marinoni
McGee
Nazarpour
Neal
Park
Pekalska
Rekabdarkolaee
Ren
Rice
Roweis
Sammon
Sandberg
Similarity-based pattern analysis and recognition
Sun
Sun
Sun
Tenenbaum
Tipping
Wang
Wang
Wang
Whitney
Wu
Zhang
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Inherent to state-of-the-art dimension reduction algorithms is the assumption that global distances between observations are Euclidean, despite the potential for altogether non-Euclidean data manifolds. We demonstrate that a non-Euclidean manifold chart can be approximated by implementing a universal approximator over a dictionary of dissimilarity measures, building on recent developments in the field. This approach is transferable across domains such that observations can be vectors, distributions, graphs and time series for instance. Our novel dissimilarity learning method is illustrated with four standard visualisation datasets showing the benefits over the linear dissimilarity learning approach

Crossref

Aston Publications Explorer

The Missing Mass Problem

Author: Berend Daniel
Kontorovich Aryeh
Publication venue
Publication date: 09/11/2011
Field of study

We give tight lower and upper bounds on the expected missing mass for distributions over finite and countably infinite spaces. An essential characterization of the extremal distributions is given. We also provide an extension to totally bounded metric spaces that may be of independent interest.Comment: 15 page

arXiv.org e-Print Archive

CiteSeerX

Greedy MAXCUT Algorithms and their Information Content

Author: Bian Yatao
Buhmann Joachim M.
Gronskiy Alexey
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/09/2016
Field of study

MAXCUT defines a classical NP-hard problem for graph partitioning and it serves as a typical case of the symmetric non-monotone Unconstrained Submodular Maximization (USM) problem. Applications of MAXCUT are abundant in machine learning, computer vision and statistical physics. Greedy algorithms to approximately solve MAXCUT rely on greedy vertex labelling or on an edge contraction strategy. These algorithms have been studied by measuring their approximation ratios in the worst case setting but very little is known to characterize their robustness to noise contaminations of the input data in the average case. Adapting the framework of Approximation Set Coding, we present a method to exactly measure the cardinality of the algorithmic approximation sets of five greedy MAXCUT algorithms. Their information contents are explored for graph instances generated by two different noise models: the edge reversal model and Gaussian edge weights model. The results provide insights into the robustness of different greedy heuristics and techniques for MAXCUT, which can be used for algorithm design of general USM problems.Comment: This is a longer version of the paper published in 2015 IEEE Information Theory Workshop (ITW

arXiv.org e-Print Archive

Crossref

Multiple Kernel-Based Multimedia Fusion for Automated Event Detection from Tweets

Author: Alqhtani Samar M.
Li Jiaming
Luo Suhuai
Publication venue: 'IntechOpen'
Publication date: 19/09/2018
Field of study

A method for detecting hot events such as wildfires is proposed. It uses visual and textual information to improve detection. Starting with picking up tweets having texts and images, it preprocesses the data to eliminate unwanted data, transforms unstructured data into structured data, then extracts features. Text features include term frequency-inverse document frequency. Image features include histogram of oriented gradients, gray-level co-occurrence matrix, color histogram, and scale-invariant feature transform. Next, it inputs the features to the multiple kernel learning (MKL) for fusion to automatically combine both feature types to achieve the best performance. Finally, it does event detection. The method was tested on Brisbane hailstorm 2014 and California wildfires 2017. It was compared with methods that used text only or images only. With the Brisbane hailstorm data, the proposed method achieved the best performance, with a fusion accuracy of 0.93, comparing to 0.89 with text only, and 0.85 with images only. With the California wildfires data, a similar performance was recorded. It has demonstrated that event detection in Twitter is enhanced and improved by combination of multiple features. It has delivered an accurate and effective event detection method for spreading awareness and organizing responses, leading to better disaster management

IntechOpen

Crossref

Covariance descriptor based on bio-inspired features for person re-identification and face verification

Author: Ayedi
Aziz
Aziz
Bak
Bak
Bak
Bazzani
Bingpeng Ma
Chen
Cheng
Davis
Dikmen
Ess
Farenzena
Fei-Fei
Figueira
Frédéric Jurie
Gandhi
Gheissari
Globerson
Gray
Gray
Guillaumin
Guo
Hirzer
Huang
Hussain
Kai
Köstinger
Ma
Meyers
Mignon
Moon
Ojala
Oreifej
Perronnin
Prosser
Riesenhuber
Ruiz-del-Solar
Satta
Satta
Schwartz
Seo
Serre
Song
Turk
Tuzel
Vu
Weinberger
Wiskott
Wright
Yu Su
Zhang
Zheng
Zheng
Zheng
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Indefinite Core Vector Machine

Author: Schleif Frank-Michael
Tino Peter
Publication venue: 'Elsevier BV'
Publication date: 01/11/2017
Field of study

Crossref

University of Birmingham Research Portal