198 research outputs found
An Evaluation of Potential Compute Platforms for Picosatellites
What compute platform should picosatellites use? CubeSats, classified as nanosatellites, are transitioning from microcontrollers that cannot run modern operating systems and modern programming environments to Linux-capable compute platforms. As electronics continue to shrink, picosatellite missions are likely to become more common, perhaps using the PocketQube standard. This paper characterizes the requirements that compute platforms for picosatellites should satisfy and analyzes in detail 4 potential platforms. We show that suitable hardware does exist, but that it is not yet supported well enough to allow small teams to use it in satellites or other specialized sensor nodes
Unsupervised Acoustic Scene Mapping Based on Acoustic Features and Dimensionality Reduction
Classical methods for acoustic scene mapping require the estimation of time
difference of arrival (TDOA) between microphones. Unfortunately, TDOA
estimation is very sensitive to reverberation and additive noise. We introduce
an unsupervised data-driven approach that exploits the natural structure of the
data. Our method builds upon local conformal autoencoders (LOCA) - an offline
deep learning scheme for learning standardized data coordinates from
measurements. Our experimental setup includes a microphone array that measures
the transmitted sound source at multiple locations across the acoustic
enclosure. We demonstrate that LOCA learns a representation that is isometric
to the spatial locations of the microphones. The performance of our method is
evaluated using a series of realistic simulations and compared with other
dimensionality-reduction schemes. We further assess the influence of
reverberation on the results of LOCA and show that it demonstrates considerable
robustness
Composable Sketches for Functions of Frequencies: Beyond the Worst Case
Recently there has been increased interest in using machine learning
techniques to improve classical algorithms. In this paper we study when it is
possible to construct compact, composable sketches for weighted sampling and
statistics estimation according to functions of data frequencies. Such
structures are now central components of large-scale data analytics and machine
learning pipelines. However, many common functions, such as thresholds and p-th
frequency moments with p > 2, are known to require polynomial-size sketches in
the worst case. We explore performance beyond the worst case under two
different types of assumptions. The first is having access to noisy advice on
item frequencies. This continues the line of work of Hsu et al. (ICLR 2019),
who assume predictions are provided by a machine learning model. The second is
providing guaranteed performance on a restricted class of input frequency
distributions that are better aligned with what is observed in practice. This
extends the work on heavy hitters under Zipfian distributions in a seminal
paper of Charikar et al. (ICALP 2002). Surprisingly, we show analytically and
empirically that "in practice" small polylogarithmic-size sketches provide
accuracy for "hard" functions.Comment: Full version of a paper from ICML 2020. Python implementation
available as part of the supplemental material accompanying the ICML
publicatio
Graded Embeddings of Finite Dimensional Simple Graded Algebras
Let A,B be finite dimensional G-graded algebras over an algebraically closed
field K with char(K)=0, where G is an abelian group, and let Id_G(A) be the set
of graded identities of A (res. Id_G(B)). We show that if A,B are G-simple then
there is a graded embedding of A in B iff Id_G(B) is contained in Id_G(A). We
also give a weaker generalization for the case where A is G-semisimple and B is
arbitrary.Comment: 25 page
FastML: a web server for probabilistic reconstruction of ancestral sequences
Ancestral sequence reconstruction is essential to a variety of evolutionary studies. Here, we present the FastML web server, a user-friendly tool for the reconstruction of ancestral sequences. FastML implements various novel features that differentiate it from existing tools: (i) FastML uses an indel-coding method, in which each gap, possibly spanning multiples sites, is coded as binary data. FastML then reconstructs ancestral indel states assuming a continuous time Markov process. FastML provides the most likely ancestral sequences, integrating both indels and characters; (ii) FastML accounts for uncertainty in ancestral states: it provides not only the posterior probabilities for each character and indel at each sequence position, but also a sample of ancestral sequences from this posterior distribution, and a list of the k-most likely ancestral sequences; (iii) FastML implements a large array of evolutionary models, which makes it generic and applicable for nucleotide, protein and codon sequences; and (iv) a graphical representation of the results is provided, including, for example, a graphical logo of the inferred ancestral sequences. The utility of FastML is demonstrated by reconstructing ancestral sequences of the Env protein from various HIV-1 subtypes. FastML is freely available for all academic users and is available online at http://fastml.tau.ac.i
- …