2,046 research outputs found
Using machine learning techniques to automate sky survey catalog generation
We describe the application of machine classification techniques to the development of an automated tool for the reduction of a large scientific data set. The 2nd Palomar Observatory Sky Survey provides comprehensive photographic coverage of the northern celestial hemisphere. The photographic plates are being digitized into images containing on the order of 10(exp 7) galaxies and 10(exp 8) stars. Since the size of this data set precludes manual analysis and classification of objects, our approach is to develop a software system which integrates independently developed techniques for image processing and data classification. Image processing routines are applied to identify and measure features of sky objects. Selected features are used to determine the classification of each object. GID3* and O-BTree, two inductive learning techniques, are used to automatically learn classification decision trees from examples. We describe the techniques used, the details of our specific application, and the initial encouraging results which indicate that our approach is well-suited to the problem. The benefits of the approach are increased data reduction throughput, consistency of classification, and the automated derivation of classification rules that will form an objective, examinable basis for classifying sky objects. Furthermore, astronomers will be freed from the tedium of an intensely visual task to pursue more challenging analysis and interpretation problems given automatically cataloged data
Suppression of quantum oscillations and the dependence on site energies in electronic excitation transfer in the Fenna-Matthews-Olson trimer
Energy transfer in the photosynthetic complex of the Green Sulfur Bacteria
known as the Fenna-Matthews-Olson (FMO) complex is studied theoretically taking
all three subunits (monomers) of the FMO trimer and the recently found eighth
bacteriochlorophyll (BChl) molecule into account. We find that in all
considered cases there is very little transfer between the monomers. Since it
is believed that the eighth BChl is located near the main light harvesting
antenna we look at the differences in transfer between the situation when BChl
8 is initially excited and the usually considered case when BChl 1 or 6 is
initially excited. We find strong differences in the transfer dynamics, both
qualitatively and quantitatively. When the excited state dynamics is
initialized at site eight of the FMO complex, we see a slow exponential-like
decay of the excitation. This is in contrast to the oscillations and a
relatively fast transfer that occurs when only seven sites or initialization at
sites 1 and 6 is considered. Additionally we show that differences in the
values of the electronic transition energies found in the literature lead to a
large difference in the transfer dynamics
Data-Mining a Large Digital Sky Survey: From the Challenges to the Scientific Results
The analysis and an efficient scientific exploration of the Digital Palomar
Observatory Sky Survey (DPOSS) represents a major technical challenge. The
input data set consists of 3 Terabytes of pixel information, and contains a few
billion sources. We describe some of the specific scientific problems posed by
the data, including searches for distant quasars and clusters of galaxies, and
the data-mining techniques we are exploring in addressing them.
Machine-assisted discovery methods may become essential for the analysis of
such multi-Terabyte data sets. New and future approaches involve unsupervised
classification and clustering analysis in the Giga-object data space, including
various Bayesian techniques. In addition to the searches for known types of
objects in this data base, these techniques may also offer the possibility of
discovering previously unknown, rare types of astronomical objects.Comment: Invited paper, to appear in Applications of Digital Image Processing
XX, ed. A. Tescher, Proc. S.P.I.E. vol. 3164, in press; 10 pages, a
self-contained TeX file, and 3 separate postscript figure
The Effect of Elevated [CO2] on Growth and Photosynthesis of Two Eucalyptus Species Exposed to High Temperatures and Water Deficits
Mining gene expression data by interpreting principal components
BACKGROUND: There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. RESULTS: We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset). We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.). Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. CONCLUSION: We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It has proven especially valuable in instances where there are many diverse conditions (10's to hundreds of different tissues or cell types), a situation in which many clustering and ordering algorithms become problematic. This approach also shows promise in other topic domains such as multi-spectral imaging datasets
Impaired Mitochondrial Function and Insulin Resistance of Skeletal Muscle in Mitochondrial Diabetes
OBJECTIVE - Impaired muscular mitochondrial function is related to common insulin resistance in type 2 diabetes. Mitochondrial diseases frequently lead to diabetes, which is mostly attributed to defective beta-cell mitochondria and secretion. RESEARCH DESIGN AND METHODS - We assessed muscular mitochondrial function and lipid deposition in liver (hepatocellular lipids [HCLs]) and muscle (intramyocellular lipids [IMCLs]) using P-31/H-1 magnetic resonance spectroscopy and insulin sensitivity and endogenous glucose production (EGP) using hyperinsulinemic-euglycemic clamps combined with isotopic tracer dilution in one female patient suffering from MELAS(myopathy,encephalopathy, lactic acidosis, and stroke-like episodes) syndrome and in six control subjects. RESULTS - The MELAS patient showed impaired insulin sensitivity (4.3 vs. 8.6 +/- 0.5 mg . kg(-1) . min(-1)) and suppression of EGP (69 vs. 94 +/- 1%), and her baseline and insulin-stimulated ATP synthesis were reduced (7.3 and 8.9 vs. 10.6 +/- 1.0 and 12.8 +/- 1.3 mu mol . l(-1) . min(-1)) compared with those of the control subjects. HCLs and IMCLs were comparable between the MELAS patient and control subjects. CONCLUSIONS - Impairment of muscle mitochondrial fitness promotes insulin resistance and could thereby contribute to the development of diabetes in some patients with the MELAS syndrome
Multilayer metamaterial absorbers inspired by perfectly matched layers
We derive periodic multilayer absorbers with effective uniaxial properties
similar to perfectly matched layers (PML). This approximate representation of
PML is based on the effective medium theory and we call it an effective medium
PML (EM-PML). We compare the spatial reflection spectrum of the layered
absorbers to that of a PML material and demonstrate that after neglecting gain
and magnetic properties, the absorber remains functional. This opens a route to
create electromagnetic absorbers for real and not only numerical applications
and as an example we introduce a layered absorber for the wavelength of
~m made of SiO and NaCl. We also show that similar cylindrical
core-shell nanostructures derived from flat multilayers also exhibit very good
absorptive and reflective properties despite the different geometry
Risk factors for development of symptoms after autologous transplantation for multiple myeloma
Author Correction:The impact of sleep, physical activity and sedentary behaviour on symptoms of depression and anxiety before and during the COVID-19 pandemic in a sample of South African participants (Scientific reports (2021) 11 1 (24059))
Procedural personas as critics for dungeon generation
This paper introduces a constrained optimization method which uses
procedural personas to evaluate the playability and quality of evolved dungeon
levels. Procedural personas represent archetypical player behaviors, and their
controllers have been evolved to maximize a specific utility which drives their
decisions. A “baseline” persona evaluates whether a level is playable by testing
if it can survive in a worst-case scenario of the playthrough. On the other hand, a
Monster Killer persona or a Treasure Collector persona evaluates playable levels
based on how many monsters it can kill or how many treasures it can collect, respectively.
Results show that the implemented two-population genetic algorithm
discovers playable levels quickly and reliably, while the different personas affect
the layout, difficulty level and tactical depth of the generated dungeons.The research was supported, in part, by the FP7 ICT project C2Learn (project no:
318480) and by the FP7 Marie Curie CIG project AutoGameDesign (project no: 630665).peer-reviewe
- …