Search CORE

1,842 research outputs found

Decoding billions of integers per second through vectorization

Author: Aksyonoff A
Büttcher S
Jones DM
Witten IH
Publication venue: 'Wiley'
Publication date: 01/01/2015
Field of study

In many important applications -- such as search engines and relational database systems -- data is stored in the form of arrays of integers. Encoding and, most importantly, decoding of these arrays consumes considerable CPU time. Therefore, substantial effort has been made to reduce costs associated with compression and decompression. In particular, researchers have exploited the superscalar nature of modern processors and SIMD instructions. Nevertheless, we introduce a novel vectorized scheme called SIMD-BP128 that improves over previously proposed vectorized approaches. It is nearly twice as fast as the previously fastest schemes on desktop processors (varint-G8IU and PFOR). At the same time, SIMD-BP128 saves up to 2 bits per integer. For even better compression, we propose another new vectorized scheme (SIMD-FastPFOR) that has a compression ratio within 10% of a state-of-the-art scheme (Simple-8b) while being two times faster during decoding.Comment: For software, see https://github.com/lemire/FastPFor, For data, see http://boytsov.info/datasets/clueweb09gap

arXiv.org e-Print Archive

R-libre

Crossref

Automated forensic extraction of encryption keys using behavioural analysis

Author: Owen Gareth
Publication venue
Publication date: 01/06/2012
Field of study

Portsmouth University Research Portal (Pure)

The Inflation Technique Completely Solves the Causal Compatibility Problem

Author: Navascues Miguel
Wolfe Elie
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 04/08/2020
Field of study

The causal compatibility question asks whether a given causal structure graph -- possibly involving latent variables -- constitutes a genuinely plausible causal explanation for a given probability distribution over the graph's observed variables. Algorithms predicated on merely necessary constraints for causal compatibility typically suffer from false negatives, i.e. they admit incompatible distributions as apparently compatible with the given graph. In [arXiv:1609.00672], one of us introduced the inflation technique for formulating useful relaxations of the causal compatibility problem in terms of linear programming. In this work, we develop a formal hierarchy of such causal compatibility relaxations. We prove that inflation is asymptotically tight, i.e., that the hierarchy converges to a zero-error test for causal compatibility. In this sense, the inflation technique fulfills a longstanding desideratum in the field of causal inference. We quantify the rate of convergence by showing that any distribution which passes the

n^{th}

-order inflation test must be

O\left(n^{-1/2}\right)

-close in Euclidean norm to some distribution genuinely compatible with the given causal structure. Furthermore, we show that for many causal structures, the (unrelaxed) causal compatibility problem is faithfully formulated already by either the first or second order inflation test.Comment: Updated to match forthcoming journal publication as closely as possible. Some content removed for brevity. Expanded citations. Most footnotes moved into the main text. Significant changes to subsection 4.1, where we corrected an error in the example of second order inflation not converging, and added an converse example where second order inflation outperforms other technique

arXiv.org e-Print Archive

Instructions-Based Detection of Sophisticated Obfuscation and Packing

Author: Ratazzi Edward Paul
Saleh Moustafa
Xu Shouhuai
Publication venue: SURFACE at Syracuse University
Publication date: 01/10/2014
Field of study

Every day thousands of malware are released online. The vast majority of these malware employ some kind of obfuscation ranging from simple XOR encryption, to more sophisticated anti-analysis, packing and encryption techniques. Dynamic analysis methods can unpack the file and reveal its hidden code. However, these methods are very time consuming when compared to static analysis. Moreover, considering the large amount of new malware being produced daily, it is not practical to solely depend on dynamic analysis methods. Therefore, finding an effective way to filter the samples and delegate only obfuscated and suspicious ones to more rigorous tests would significantly improve the overall scanning process. Current techniques of identifying obfuscation rely mainly on signatures of known packers, file entropy score, or anomalies in file header. However, these features are not only easily bypass-able, but also do not cover all types of obfuscation. In this paper, we introduce a novel approach to identify obfuscated files based on anomalies in their instructions-based characteristics. We detect the presence of interleaving instructions which are the result of the opaque predicate anti-disassembly trick, and present distinguishing statistical properties based on the opcodes and control flow graphs of obfuscated files. Our detection system combines these features with other file structural features and leads to a very good result of detecting obfuscated malware

Crossref

Syracuse University Research Facility and Collaborative Environment

Role of Secondary Motifs in Fast Folding Polymers: A Dynamical Variational Principle

Author: A. P. Capaldi
Amos Maritan
B. H. Zimm
C. Levinthal
C. Micheletti
C. Micheletti
C. Micheletti
Cristian Micheletti
D. G. Covell
D. P. Yee
H. Li
I. M. Lifshitz
J. C. Nelson
J. D. Bryngelson
J. D. Bryngelson
J. H. Holland
J. N. Onuchic
Jayanth R. Banavar
K. A. Dill
K. M. Plaxco
L. Pauling
N. D. Socci
N. E. G. Buchler
N. G. Hunt
N. Go
O. B. Ptitsyn
P. E. Leopold
P. G. Wolynes
R. Aurora
V. Muñoz
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2000
Field of study

A fascinating and open question challenging biochemistry, physics and even geometry is the presence of highly regular motifs such as alpha-helices in the folded state of biopolymers and proteins. Stimulating explanations ranging from chemical propensity to simple geometrical reasoning have been invoked to rationalize the existence of such secondary structures. We formulate a dynamical variational principle for selection in conformation space based on the requirement that the backbone of the native state of biologically viable polymers be rapidly accessible from the denatured state. The variational principle is shown to result in the emergence of helical order in compact structures.Comment: 4 pages, RevTex, 4 eps figure

arXiv.org e-Print Archive

Crossref

Sissa Digital Library

Archivio istituzionale della ricerca - Università di Padova