2,046 research outputs found

    Using machine learning techniques to automate sky survey catalog generation

    Get PDF
    We describe the application of machine classification techniques to the development of an automated tool for the reduction of a large scientific data set. The 2nd Palomar Observatory Sky Survey provides comprehensive photographic coverage of the northern celestial hemisphere. The photographic plates are being digitized into images containing on the order of 10(exp 7) galaxies and 10(exp 8) stars. Since the size of this data set precludes manual analysis and classification of objects, our approach is to develop a software system which integrates independently developed techniques for image processing and data classification. Image processing routines are applied to identify and measure features of sky objects. Selected features are used to determine the classification of each object. GID3* and O-BTree, two inductive learning techniques, are used to automatically learn classification decision trees from examples. We describe the techniques used, the details of our specific application, and the initial encouraging results which indicate that our approach is well-suited to the problem. The benefits of the approach are increased data reduction throughput, consistency of classification, and the automated derivation of classification rules that will form an objective, examinable basis for classifying sky objects. Furthermore, astronomers will be freed from the tedium of an intensely visual task to pursue more challenging analysis and interpretation problems given automatically cataloged data

    Suppression of quantum oscillations and the dependence on site energies in electronic excitation transfer in the Fenna-Matthews-Olson trimer

    Full text link
    Energy transfer in the photosynthetic complex of the Green Sulfur Bacteria known as the Fenna-Matthews-Olson (FMO) complex is studied theoretically taking all three subunits (monomers) of the FMO trimer and the recently found eighth bacteriochlorophyll (BChl) molecule into account. We find that in all considered cases there is very little transfer between the monomers. Since it is believed that the eighth BChl is located near the main light harvesting antenna we look at the differences in transfer between the situation when BChl 8 is initially excited and the usually considered case when BChl 1 or 6 is initially excited. We find strong differences in the transfer dynamics, both qualitatively and quantitatively. When the excited state dynamics is initialized at site eight of the FMO complex, we see a slow exponential-like decay of the excitation. This is in contrast to the oscillations and a relatively fast transfer that occurs when only seven sites or initialization at sites 1 and 6 is considered. Additionally we show that differences in the values of the electronic transition energies found in the literature lead to a large difference in the transfer dynamics

    Data-Mining a Large Digital Sky Survey: From the Challenges to the Scientific Results

    Get PDF
    The analysis and an efficient scientific exploration of the Digital Palomar Observatory Sky Survey (DPOSS) represents a major technical challenge. The input data set consists of 3 Terabytes of pixel information, and contains a few billion sources. We describe some of the specific scientific problems posed by the data, including searches for distant quasars and clusters of galaxies, and the data-mining techniques we are exploring in addressing them. Machine-assisted discovery methods may become essential for the analysis of such multi-Terabyte data sets. New and future approaches involve unsupervised classification and clustering analysis in the Giga-object data space, including various Bayesian techniques. In addition to the searches for known types of objects in this data base, these techniques may also offer the possibility of discovering previously unknown, rare types of astronomical objects.Comment: Invited paper, to appear in Applications of Digital Image Processing XX, ed. A. Tescher, Proc. S.P.I.E. vol. 3164, in press; 10 pages, a self-contained TeX file, and 3 separate postscript figure

    Mining gene expression data by interpreting principal components

    Get PDF
    BACKGROUND: There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. RESULTS: We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset). We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.). Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. CONCLUSION: We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It has proven especially valuable in instances where there are many diverse conditions (10's to hundreds of different tissues or cell types), a situation in which many clustering and ordering algorithms become problematic. This approach also shows promise in other topic domains such as multi-spectral imaging datasets

    Impaired Mitochondrial Function and Insulin Resistance of Skeletal Muscle in Mitochondrial Diabetes

    Get PDF
    OBJECTIVE - Impaired muscular mitochondrial function is related to common insulin resistance in type 2 diabetes. Mitochondrial diseases frequently lead to diabetes, which is mostly attributed to defective beta-cell mitochondria and secretion. RESEARCH DESIGN AND METHODS - We assessed muscular mitochondrial function and lipid deposition in liver (hepatocellular lipids [HCLs]) and muscle (intramyocellular lipids [IMCLs]) using P-31/H-1 magnetic resonance spectroscopy and insulin sensitivity and endogenous glucose production (EGP) using hyperinsulinemic-euglycemic clamps combined with isotopic tracer dilution in one female patient suffering from MELAS(myopathy,encephalopathy, lactic acidosis, and stroke-like episodes) syndrome and in six control subjects. RESULTS - The MELAS patient showed impaired insulin sensitivity (4.3 vs. 8.6 +/- 0.5 mg . kg(-1) . min(-1)) and suppression of EGP (69 vs. 94 +/- 1%), and her baseline and insulin-stimulated ATP synthesis were reduced (7.3 and 8.9 vs. 10.6 +/- 1.0 and 12.8 +/- 1.3 mu mol . l(-1) . min(-1)) compared with those of the control subjects. HCLs and IMCLs were comparable between the MELAS patient and control subjects. CONCLUSIONS - Impairment of muscle mitochondrial fitness promotes insulin resistance and could thereby contribute to the development of diabetes in some patients with the MELAS syndrome

    Multilayer metamaterial absorbers inspired by perfectly matched layers

    Get PDF
    We derive periodic multilayer absorbers with effective uniaxial properties similar to perfectly matched layers (PML). This approximate representation of PML is based on the effective medium theory and we call it an effective medium PML (EM-PML). We compare the spatial reflection spectrum of the layered absorbers to that of a PML material and demonstrate that after neglecting gain and magnetic properties, the absorber remains functional. This opens a route to create electromagnetic absorbers for real and not only numerical applications and as an example we introduce a layered absorber for the wavelength of 88~μ\mum made of SiO2_2 and NaCl. We also show that similar cylindrical core-shell nanostructures derived from flat multilayers also exhibit very good absorptive and reflective properties despite the different geometry

    Procedural personas as critics for dungeon generation

    Get PDF
    This paper introduces a constrained optimization method which uses procedural personas to evaluate the playability and quality of evolved dungeon levels. Procedural personas represent archetypical player behaviors, and their controllers have been evolved to maximize a specific utility which drives their decisions. A “baseline” persona evaluates whether a level is playable by testing if it can survive in a worst-case scenario of the playthrough. On the other hand, a Monster Killer persona or a Treasure Collector persona evaluates playable levels based on how many monsters it can kill or how many treasures it can collect, respectively. Results show that the implemented two-population genetic algorithm discovers playable levels quickly and reliably, while the different personas affect the layout, difficulty level and tactical depth of the generated dungeons.The research was supported, in part, by the FP7 ICT project C2Learn (project no: 318480) and by the FP7 Marie Curie CIG project AutoGameDesign (project no: 630665).peer-reviewe
    corecore