Search CORE

26,273 research outputs found

An ontology enhanced parallel SVM for scalable spam filter training

Author: Bauer
Blanco
Blanzieri
Blei
Breiman
Cao
Caruana
Chawla
Colas
Cristianini
Dean
Do
Gansterer
Godwin Caruana
Graf
Hall
Huang
Kearns
Kim
Maozhen Li
Mei
Platt
Suykens
Taura
Vapnik
Wang
Woodsend
Yang Liu
Zanghirati
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/05/2013
Field of study

This is the post-print version of the final paper published in Neurocomputing. The published article is available from the link below. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. Copyright @ 2013 Elsevier B.V.Spam, under a variety of shapes and forms, continues to inflict increased damage. Varying approaches including Support Vector Machine (SVM) techniques have been proposed for spam filter training and classification. However, SVM training is a computationally intensive process. This paper presents a MapReduce based parallel SVM algorithm for scalable spam filter training. By distributing, processing and optimizing the subsets of the training data across multiple participating computer nodes, the parallel SVM reduces the training time significantly. Ontology semantics are employed to minimize the impact of accuracy degradation when distributing the training data among a number of SVM classifiers. Experimental results show that ontology based augmentation improves the accuracy level of the parallel SVM beyond the original sequential counterpart

Crossref

Brunel University Research Archive

How proofs are prepared at Camelot

Author: Freivalds R.
Gao S.
Nešetřil J.
Publication venue
Publication date: 01/01/2016
Field of study

We study a design framework for robust, independently verifiable, and workload-balanced distributed algorithms working on a common input. An algorithm based on the framework is essentially a distributed encoding procedure for a Reed--Solomon code, which enables (a) robustness against byzantine failures with intrinsic error-correction and identification of failed nodes, and (b) independent randomized verification to check the entire computation for correctness, which takes essentially no more resources than each node individually contributes to the computation. The framework builds on recent Merlin--Arthur proofs of batch evaluation of Williams~[{\em Electron.\ Colloq.\ Comput.\ Complexity}, Report TR16-002, January 2016] with the observation that {\em Merlin's magic is not needed} for batch evaluation---mere Knights can prepare the proof, in parallel, and with intrinsic error-correction. The contribution of this paper is to show that in many cases the verifiable batch evaluation framework admits algorithms that match in total resource consumption the best known sequential algorithm for solving the problem. As our main result, we show that the

k

-cliques in an

n

-vertex graph can be counted {\em and} verified in per-node

O(n^{(\omega+\epsilon)k/6})

time and space on

O(n^{(\omega+\epsilon)k/6})

compute nodes, for any constant

\epsilon>0

and positive integer

k

divisible by

6

, where

2\leq\omega<2.3728639

is the exponent of matrix multiplication. This matches in total running time the best known sequential algorithm, due to Ne{\v{s}}et{\v{r}}il and Poljak [{\em Comment.~Math.~Univ.~Carolin.}~26 (1985) 415--419], and considerably improves its space usage and parallelizability. Further results include novel algorithms for counting triangles in sparse graphs, computing the chromatic polynomial of a graph, and computing the Tutte polynomial of a graph.Comment: 42 p

arXiv.org e-Print Archive

Lund University Publications

Crossref

Domain decomposition methods for compressed sensing

Author: Fornasier Massimo
Langer Andreas
Schönlieb Carola-Bibiane
Publication venue
Publication date: 01/01/2009
Field of study

We present several domain decomposition algorithms for sequential and parallel minimization of functionals formed by a discrepancy term with respect to data and total variation constraints. The convergence properties of the algorithms are analyzed. We provide several numerical experiments, showing the successful application of the algorithms for the restoration 1D and 2D signals in interpolation/inpainting problems respectively, and in a compressed sensing problem, for recovering piecewise constant medical-type images from partial Fourier ensembles.Comment: 4 page

arXiv.org e-Print Archive

CiteSeerX

Coventry University Pure Portal

An efficient steady-state analysis of the eddy current problem using a parallel-in-time algorithm

Author: De Gersem Herbert
Kulchytska-Ruchka Iryna
Schöps Sebastian
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/01/2019
Field of study

This paper introduces a parallel-in-time algorithm for efficient steady-state solution of the eddy current problem. Its main idea is based on the application of the well-known multi-harmonic (or harmonic balance) approach as the coarse solver within the periodic parallel-in-time framework. A frequency domain representation allows for the separate calculation of each harmonic component in parallel and therefore accelerates the solution of the time-periodic system. The presented approach is verified for a nonlinear coaxial cable model

arXiv.org e-Print Archive

TUbiblio

Crossref

Objective multiscale analysis of random heterogeneous materials

Author: Everdij F. P. X.
Lloberas Valls Oriol
Rixen D. J.
Simone A.
Sluys L. J.
Publication venue: CIMNE
Publication date: 01/01/2013
Field of study

The multiscale framework presented in [1, 2] is assessed in this contribution for a study of random heterogeneous materials. Results are compared to direct numerical simulations (DNS) and the sensitivity to user-deﬁned parameters such as the domain decomposition type and initial coarse scale resolution is reported. The parallel performance of the implementation is studied for diﬀerent domain decompositions

UPCommons. Portal del coneixement obert de la UPC