633 research outputs found
CLUSTERED HIERARCHICAL ANOMALY AND OUTLIER DETECTION ALGORITHMS
Anomaly and outlier detection is a long-standing problem in machine learning. In some cases, anomaly detection is easy, such as when data are drawn from well-characterized distributions such as the Gaussian. However, when data occupy high-dimensional spaces, anomaly detection becomes more difficult. We present CLAM (Clustered Learning of Approximate Manifolds), a manifold mapping technique in any metric space. CLAM begins with a fast hierarchical clustering technique and then induces a graph from the cluster tree, based on overlapping clusters as selected using several geometric and topological features. Using these graphs, we implement CHAODA (Clustered Hierarchical Anomaly and Outlier Detection Algorithms), exploring various properties of the graphs and their constituent clusters to find outliers. CHAODA employs a form of transfer learning based on a training set of datasets, and applies this knowledge to a separate test set of datasets of different cardinalities, dimensionalities, and domains. On 24 publicly available datasets, we compare CHAODA (by measure of ROC AUC) to a variety of state-of-the-art unsupervised anomaly-detection algorithms. Six of the datasets are used for training. CHAODA outperforms other approaches on 16 of the remaining 18 datasets. CLAM and CHAODA scale to large, high-dimensional “big data” anomalydetection problems, and generalize across datasets and distance functions. Source code to CLAM and CHAODA are freely available on GitHub1
Separated flow
A brief overview of flow separation phenomena is provided. Langley has many active research programs in flow separation related areas. Three cases are presented which describe specific examples of flow separation research. In each example, a description of the fundamental fluid physics and the complexity of the flow field is presented along with a method of either reducing or controlling the extent of separation. The following examples are discussed: flow over a smooth surface with an adverse pressure gradient; flow over a surface with a geometric discontinuity; and flow with shock-boundary layer interactions. These results will show that improvements are being made in the understanding of flow separation and its control
CLAM-Accelerated K-Nearest Neighbors Entropy-Scaling Search of Large High-Dimensional Datasets via an Actualization of the Manifold Hypothesis
Many fields are experiencing a Big Data explosion, with data collection rates
outpacing the rate of computing performance improvements predicted by Moore's
Law.
Researchers are often interested in similarity search on such data.
We present CAKES (CLAM-Accelerated -NN Entropy Scaling Search), a novel
algorithm for -nearest-neighbor (-NN) search which leverages geometric
and topological properties inherent in large datasets.
CAKES assumes the manifold hypothesis and performs best when data occupy a
low dimensional manifold, even if the data occupy a very high dimensional
embedding space.
We demonstrate performance improvements ranging from hundreds to tens of
thousands of times faster when compared to state-of-the-art approaches such as
FAISS and HNSW, when benchmarked on 5 standard datasets.
Unlike locality-sensitive hashing approaches, CAKES can work with any
user-defined distance function.
When data occupy a metric space, CAKES exhibits perfect recall.Comment: As submitted to IEEE Big Data 202
The Kepler Follow-up Observation Program
The Kepler Mission was launched on March 6, 2009 to perform a photometric
survey of more than 100,000 dwarf stars to search for terrestrial-size planets
with the transit technique. Follow-up observations of planetary candidates
identified by detection of transit-like events are needed both for
identification of astrophysical phenomena that mimic planetary transits and for
characterization of the true planets and planetary systems found by Kepler. We
have developed techniques and protocols for detection of false planetary
transits and are currently conducting observations on 177 Kepler targets that
have been selected for follow-up. A preliminary estimate indicates that between
24% and 62% of planetary candidates selected for follow-up will turn out to be
true planets.Comment: 12 pages, submitted to the Astrophysical Journal Letter
Production of Magnetic Arsenic–Phosphorus Alloy Nanoribbons with Small Band Gaps and High Hole Conductivities
Quasi-1D nanoribbons provide a unique route to diversifying the properties of their parent 2D nanomaterial, introducing lateral quantum confinement and an abundance of edge sites. Here, a new family of nanomaterials is opened with the creation of arsenic–phosphorus alloy nanoribbons (AsPNRs). By ionically etching the layered crystal black arsenic–phosphorus using lithium electride followed by dissolution in amidic solvents, solutions of AsPNRs are formed. The ribbons are typically few-layered, several micrometers long with widths tens of nanometers across, and both highly flexible and crystalline. The AsPNRs are highly electrically conducting above 130 K due to their small band gap (ca. 0.035 eV), paramagnetic in nature, and have high hole mobilities, as measured with the first generation of AsP devices, directly highlighting their properties and utility in electronic devices such as near-infrared detectors, quantum computing, and charge carrier layers in solar cells
KOI-54: The Kepler Discovery of Tidally Excited Pulsations and Brightenings in a Highly Eccentric Binary
Kepler observations of the star HD 187091 (KIC 8112039, hereafter KOI-54) revealed a remarkable light curve exhibiting sharp periodic brightening events every 41.8 days with a superimposed set of oscillations forming a beating pattern in phase with the brightenings. Spectroscopic observations revealed that this is a binary star with a highly eccentric orbit, e = 0.83. We are able to match the Kepler light curve and radial velocities with a nearly face-on (i = 5 degrees.5) binary star model in which the brightening events are caused by tidal distortion and irradiation of nearly identical A stars during their close periastron passage. The two dominant oscillations in the light curve, responsible for the beating pattern, have frequencies that are the 91st and 90th harmonic of the orbital frequency. The power spectrum of the light curve, after removing the binary star brightening component, reveals a large number of pulsations, 30 of which have a signal-to-noise ratio greater than or similar to 7. Nearly all of these pulsations have frequencies that are either integer multiples of the orbital frequency or are tidally split multiples of the orbital frequency. This pattern of frequencies unambiguously establishes the pulsations as resonances between the dynamic tides at periastron and the free oscillation modes of one or both of the stars. KOI-54 is only the fourth star to show such a phenomenon and is by far the richest in terms of excited modes.NASA, Science Mission DirectorateNASA NNX08AR14GEuropean Research Council under the European Community 227224W.M. Keck FoundationMcDonald Observator
A First Comparison of Kepler Planet Candidates in Single and Multiple Systems
In this letter we present an overview of the rich population of systems with
multiple candidate transiting planets found in the first four months of Kepler
data. The census of multiples includes 115 targets that show 2 candidate
planets, 45 with 3, 8 with 4, and 1 each with 5 and 6, for a total of 170
systems with 408 candidates. When compared to the 827 systems with only one
candidate, the multiples account for 17 percent of the total number of systems,
and a third of all the planet candidates. We compare the characteristics of
candidates found in multiples with those found in singles. False positives due
to eclipsing binaries are much less common for the multiples, as expected.
Singles and multiples are both dominated by planets smaller than Neptune; 69
+2/-3 percent for singles and 86 +2/-5 percent for multiples. This result, that
systems with multiple transiting planets are less likely to include a
transiting giant planet, suggests that close-in giant planets tend to disrupt
the orbital inclinations of small planets in flat systems, or maybe even to
prevent the formation of such systems in the first place.Comment: 13 pages, 13 figures, submitted to ApJ Letter
Kepler-20: A Sun-like Star with Three Sub-Neptune Exoplanets and Two Earth-size Candidates
We present the discovery of the Kepler-20 planetary system, which we
initially identified through the detection of five distinct periodic transit
signals in the Kepler light curve of the host star 2MASSJ19104752+4220194. We
find a stellar effective temperature Teff=5455+-100K, a metallicity of
[Fe/H]=0.01+-0.04, and a surface gravity of log(g)=4.4+-0.1. Combined with an
estimate of the stellar density from the transit light curves we deduce a
stellar mass of Mstar=0.912+-0.034 Msun and a stellar radius of
Rstar=0.944^{+0.060}_{-0.095} Rsun. For three of the transit signals, our
results strongly disfavor the possibility that these result from astrophysical
false positives. We conclude that the planetary scenario is more likely than
that of an astrophysical false positive by a factor of 2e5 (Kepler-20b), 1e5
(Kepler-20c), and 1.1e3 (Kepler-20d), sufficient to validate these objects as
planetary companions. For Kepler-20c and Kepler-20d, the blend scenario is
independently disfavored by the achromaticity of the transit: From Spitzer data
gathered at 4.5um, we infer a ratio of the planetary to stellar radii of
0.075+-0.015 (Kepler-20c) and 0.065+-0.011 (Kepler-20d), consistent with each
of the depths measured in the Kepler optical bandpass. We determine the orbital
periods and physical radii of the three confirmed planets to be 3.70d and
1.91^{+0.12}_{-0.21} Rearth for Kepler-20b, 10.85 d and 3.07^{+0.20}_{-0.31}
Rearth for Kepelr-20c, and 77.61 d and 2.75^{+0.17}_{-0.30} Rearth for
Kepler-20d. From multi-epoch radial velocities, we determine the masses of
Kepler-20b and Kepler-20c to be 8.7\+-2.2 Mearth and 16.1+-3.5 Mearth,
respectively, and we place an upper limit on the mass of Kepler-20d of 20.1
Mearth (2 sigma).Comment: accepted by ApJ, 58 pages, 12 figures revised Jan 2012 to correct
table 2 and clarify planet parameter extractio
Modeling Kepler transit light curves as false positives: Rejection of blend scenarios for Kepler-9, and validation of Kepler-9d, a super-Earth-size planet in a multiple system
Light curves from the Kepler Mission contain valuable information on the
nature of the phenomena producing the transit-like signals. To assist in
exploring the possibility that they are due to an astrophysical false positive,
we describe a procedure (BLENDER) to model the photometry in terms of a "blend"
rather than a planet orbiting a star. A blend may consist of a background or
foreground eclipsing binary (or star-planet pair) whose eclipses are attenuated
by the light of the candidate and possibly other stars within the photometric
aperture. We apply BLENDER to the case of Kepler-9, a target harboring two
previously confirmed Saturn-size planets (Kepler-9b and Kepler-9c) showing
transit timing variations, and an additional shallower signal with a 1.59-day
period suggesting the presence of a super-Earth-size planet. Using BLENDER
together with constraints from other follow-up observations we are able to rule
out all blends for the two deeper signals, and provide independent validation
of their planetary nature. For the shallower signal we rule out a large
fraction of the false positives that might mimic the transits. The false alarm
rate for remaining blends depends in part (and inversely) on the unknown
frequency of small-size planets. Based on several realistic estimates of this
frequency we conclude with very high confidence that this small signal is due
to a super-Earth-size planet (Kepler-9d) in a multiple system, rather than a
false positive. The radius is determined to be 1.64 (+0.19/-0.14) R(Earth), and
current spectroscopic observations are as yet insufficient to establish its
mass.Comment: 20 pages in emulateapj format, including 8 tables and 16 figures. To
appear in ApJ, 1 January 2010. Accepted versio
- …