3,881 research outputs found

    A spectroscopy of texts for effective clustering

    Get PDF
    For many clustering algorithms, such as k-means, EM, and CLOPE, there is usually a requirement to set some parameters. Often, these parameters directly or indirectly control the number of clusters to return. In the presence of different data characteristics and analysis contexts, it is often difficult for the user to estimate the number of clusters in the data set. This is especially true in text collections such as Web documents, images or biological data. The fundamental question this paper addresses is: ldquoHow can we effectively estimate the natural number of clusters in a given text collection?rdquo. We propose to use spectral analysis, which analyzes the eigenvalues (not eigenvectors) of the collection, as the solution to the above. We first present the relationship between a text collection and its underlying spectra. We then show how the answer to this question enhances the clustering process. Finally, we conclude with empirical results and related work.<br /

    Data Deluge in Astrophysics: Photometric Redshifts as a Template Use Case

    Get PDF
    Astronomy has entered the big data era and Machine Learning based methods have found widespread use in a large variety of astronomical applications. This is demonstrated by the recent huge increase in the number of publications making use of this new approach. The usage of machine learning methods, however is still far from trivial and many problems still need to be solved. Using the evaluation of photometric redshifts as a case study, we outline the main problems and some ongoing efforts to solve them.Comment: 13 pages, 3 figures, Springer's Communications in Computer and Information Science (CCIS), Vol. 82

    Sterile neutrinos in cosmology

    Full text link
    Sterile neutrinos are natural extensions to the standard model of particle physics in neutrino mass generation mechanisms. If they are relatively light, less than approximately 10 keV, they can alter cosmology significantly, from the early Universe to the matter and radiation energy density today. Here, we review the cosmological role such light sterile neutrinos can play from the early Universe, including production of keV-scale sterile neutrinos as dark matter candidates, and dynamics of light eV-scale sterile neutrinos during the weakly-coupled active neutrino era. We review proposed signatures of light sterile neutrinos in cosmic microwave background and large scale structure data. We also discuss keV-scale sterile neutrino dark matter decay signatures in X-ray observations, including recent candidate \sim3.5 keV X-ray line detections consistent with the decay of a \sim7 keV sterile neutrino dark matter particle.Comment: Accepted version of an invited review for Physics Reports. 33 pages, 7 figures, approximately 16,000 words; v3: expanded discussion of low reheating temperature universe models with a new figure, large scale structure effects, scalar decay model

    The Data Big Bang and the Expanding Digital Universe: High-Dimensional, Complex and Massive Data Sets in an Inflationary Epoch

    Get PDF
    Recent and forthcoming advances in instrumentation, and giant new surveys, are creating astronomical data sets that are not amenable to the methods of analysis familiar to astronomers. Traditional methods are often inadequate not merely because of the size in bytes of the data sets, but also because of the complexity of modern data sets. Mathematical limitations of familiar algorithms and techniques in dealing with such data sets create a critical need for new paradigms for the representation, analysis and scientific visualization (as opposed to illustrative visualization) of heterogeneous, multiresolution data across application domains. Some of the problems presented by the new data sets have been addressed by other disciplines such as applied mathematics, statistics and machine learning and have been utilized by other sciences such as space-based geosciences. Unfortunately, valuable results pertaining to these problems are mostly to be found only in publications outside of astronomy. Here we offer brief overviews of a number of concepts, techniques and developments, some "old" and some new. These are generally unknown to most of the astronomical community, but are vital to the analysis and visualization of complex datasets and images. In order for astronomers to take advantage of the richness and complexity of the new era of data, and to be able to identify, adopt, and apply new solutions, the astronomical community needs a certain degree of awareness and understanding of the new concepts. One of the goals of this paper is to help bridge the gap between applied mathematics, artificial intelligence and computer science on the one side and astronomy on the other.Comment: 24 pages, 8 Figures, 1 Table. Accepted for publication: "Advances in Astronomy, special issue "Robotic Astronomy

    SciTech News Volume 71, No. 1 (2017)

    Get PDF
    Columns and Reports From the Editor 3 Division News Science-Technology Division 5 Chemistry Division 8 Engineering Division Aerospace Section of the Engineering Division 9 Architecture, Building Engineering, Construction and Design Section of the Engineering Division 11 Reviews Sci-Tech Book News Reviews 12 Advertisements IEEE

    Measuring cognitive load and cognition: metrics for technology-enhanced learning

    Get PDF
    This critical and reflective literature review examines international research published over the last decade to summarise the different kinds of measures that have been used to explore cognitive load and critiques the strengths and limitations of those focussed on the development of direct empirical approaches. Over the last 40 years, cognitive load theory has become established as one of the most successful and influential theoretical explanations of cognitive processing during learning. Despite this success, attempts to obtain direct objective measures of the theory's central theoretical construct – cognitive load – have proved elusive. This obstacle represents the most significant outstanding challenge for successfully embedding the theoretical and experimental work on cognitive load in empirical data from authentic learning situations. Progress to date on the theoretical and practical approaches to cognitive load are discussed along with the influences of individual differences on cognitive load in order to assess the prospects for the development and application of direct empirical measures of cognitive load especially in technology-rich contexts

    Color-Induced Displacement double stars in SDSS

    Full text link
    We report the first successful application of the astrometric color-induced displacement technique (CID, the displacement of the photocenter between different bandpasses due to a varying contribution of differently colored components to the total light), originally proposed by Wielen (1996) for discovering unresolved binary stars. Using the Sloan Digital Sky Survey (SDSS) Data Release 1 with 2.5 million stars brighter than 21m in the u and g bands, we select 419 candidate binary stars with CID greater than 0.5 arcsec. The SDSS colors of the majority of these candidates are consistent with binary systems including a white dwarf and any main sequence star with spectral type later than ~K7. The astrometric CID method discussed here is complementary to the photometric selection of binary stars in SDSS discussed by Smolcic et al. (2004), but there is considerable overlap (15%) between the two samples of selected candidates. This overlap testifies both to the physical soundness of both methods, as well as to the astrometric and photometric quality of SDSS data.Comment: submitted to A&A, 13 pages, 6 figure
    corecore