300 research outputs found

    Dynamic Key-Value Memory Networks for Knowledge Tracing

    Full text link
    Knowledge Tracing (KT) is a task of tracing evolving knowledge state of students with respect to one or more concepts as they engage in a sequence of learning activities. One important purpose of KT is to personalize the practice sequence to help students learn knowledge concepts efficiently. However, existing methods such as Bayesian Knowledge Tracing and Deep Knowledge Tracing either model knowledge state for each predefined concept separately or fail to pinpoint exactly which concepts a student is good at or unfamiliar with. To solve these problems, this work introduces a new model called Dynamic Key-Value Memory Networks (DKVMN) that can exploit the relationships between underlying concepts and directly output a student's mastery level of each concept. Unlike standard memory-augmented neural networks that facilitate a single memory matrix or two static memory matrices, our model has one static matrix called key, which stores the knowledge concepts and the other dynamic matrix called value, which stores and updates the mastery levels of corresponding concepts. Experiments show that our model consistently outperforms the state-of-the-art model in a range of KT datasets. Moreover, the DKVMN model can automatically discover underlying concepts of exercises typically performed by human annotations and depict the changing knowledge state of a student.Comment: To appear in 26th International Conference on World Wide Web (WWW), 201

    Non-equilibrium relaxation of hot states in organic semiconductors: Impact of mode-selective excitation on charge transfer.

    Get PDF
    The theoretical study of open quantum systems strongly coupled to a vibrational environment remains computationally challenging due to the strongly non-Markovian characteristics of the dynamics. We study this problem in the case of a molecular dimer of the organic semiconductor tetracene, the exciton states of which are strongly coupled to a few hundreds of molecular vibrations. To do so, we employ a previously developed tensor network approach, based on the formalism of matrix product states. By analyzing the entanglement structure of the system wavefunction, we can expand it in a tree tensor network state, which allows us to perform a fully quantum mechanical time evolution of the exciton-vibrational system, including the effect of 156 molecular vibrations. We simulate the dynamics of hot states, i.e., states resulting from excess energy photoexcitation, by constructing various initial bath states, and show that the exciton system indeed has a memory of those initial configurations. In particular, the specific pathway of vibrational relaxation is shown to strongly affect the quantum coherence between exciton states in time scales relevant for the ultrafast dynamics of application-relevant processes such as charge transfer. The preferential excitation of low-frequency modes leads to a limited number of relaxation pathways, thus "protecting" quantum coherence and leading to a significant increase in the charge transfer yield in the dimer structure.A.M.A. acknowledges the support of the Engineering and Physical Sciences Research Council (EPSRC) for funding under Grant No. EP/L015552/1

    Application of Machine Learning Techniques to Parameter Selection for Flight Risk Identification

    Get PDF
    In recent years, the use of data mining and machine learning techniques for safety analysis, incident and accident investigation, and fault detection has gained traction among the aviation community. Flight data collected from recording devices contains a large number of heterogeneous parameters, sometimes reaching up to thousands on modern commercial aircraft. More data is being collected continuously which adds to the ever-increasing pool of data available for safety analysis. However, among the data collected, not all parameters are important from a risk and safety analysis perspective. Similarly, in order to be useful for modern analysis techniques such as machine learning, using thousands of parameters collected at a high frequency might not be computationally tractable. As such, an intelligent and repeatable methodology to select a reduced set of significant parameters is required to allow safety analysts to focus on the right parameters for risk identification. In this paper, a step-by-step methodology is proposed to down-select a reduced set of parameters that can be used for safety analysis. First, correlation analysis is conducted to remove highly correlated, duplicate, or redundant parameters from the data set. Second, a pre-processing step removes metadata and empty parameters. This step also considers requirements imposed by regulatory bodies such as the Federal Aviation Administration and subject matter experts to further trim the list of parameters. Third, a clustering algorithm is used to group similar flights and identify abnormal operations and anomalies. A retrospective analysis is conducted on the clusters to identify their characteristics and impact on flight safety. Finally, analysis of variance techniques are used to identify which parameters were significant in the formation of the clusters. Visualization dashboards were created to analyze the cluster characteristics and parameter significance. This methodology is employed on data from the approach phase of a representative single-aisle aircraft to demonstrate its application and robustness across heterogeneous data sets. It is envisioned that this methodology can be further extended to other phases of flight and aircraft

    IdeaHound: Improving Large-scale Collaborative Ideation with Crowd-powered Real-time Semantic Modeling

    Get PDF
    Prior work on creativity support tools demonstrates how a computational semantic model of a solution space can enable interventions that substantially improve the number, quality and diversity of ideas. However, automated semantic modeling often falls short when people contribute short text snippets or sketches. Innovation platforms can employ humans to provide semantic judgments to construct a semantic model, but this relies on external workers completing a large number of tedious micro tasks. This requirement threatens both accuracy (external workers may lack expertise and context to make accurate semantic judgments) and scalability (external workers are costly). In this paper, we introduce IDEAHOUND, an ideation system that seamlessly integrates the task of defining semantic relationships among ideas into the primary task of idea generation. The system combines implicit human actions with machine learning to create a computational semantic model of the emerging solution space. The integrated nature of these judgments allows IDEAHOUND to leverage the expertise and efforts of participants who are already motivated to contribute to idea generation, overcoming the issues of scalability inherent to existing approaches. Our results show that participants were equally willing to use (and just as productive using) IDEAHOUND compared to a conventional platform that did not require organizing ideas. Our integrated crowdsourcing approach also creates a more accurate semantic model than an existing crowdsourced approach (performed by external crowds). We demonstrate how this model enables helpful creative interventions: providing diverse inspirational examples, providing similar ideas for a given idea and providing a visual overview of the solution space.Engineering and Applied Science

    Spectral Graph Analysis for Process Monitoring

    Get PDF
    Process monitoring is a fundamental task to support operator decisions under ab- normal situations. Most process monitoring approaches, such as Principal Components Analysis and Locality Preserving Projections, are based on dimensionality reduction. In this paper Spectral Graph Analysis Monitoring (SGAM) is introduced. SGAM is a new process monitoring technique that does not require dimensionality reduction techniques. The approach it is based on the spectral graph analysis theory. Firstly, a weighted graph representation of process measurements is developed. Secondly, the process behavior is parameterized by means of graph spectral features, in particular the graph algebraic connectivity and the graph spectral energy. The developed methodology has been illustrated in autocorrelated and non-linear synthetic cases, and applied to the well known Tennessee Eastman process benchmark with promising results.Fil: Musulin, Estanislao. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentin

    Kepler eclipsing binary stars. VII. the catalogue of eclipsing binaries found in the entire Kepler data set

    Get PDF
    The primary Kepler Mission provided nearly continuous monitoring of ~200,000 objects with unprecedented photometric precision. We present the final catalog of eclipsing binary systems within the 105 deg2 Kepler field of view. This release incorporates the full extent of the data from the primary mission (Q0-Q17 Data Release). As a result, new systems have been added, additional false positives have been removed, ephemerides and principal parameters have been recomputed, classifications have been revised to rely on analytical models, and eclipse timing variations have been computed for each system. We identify several classes of systems including those that exhibit tertiary eclipse events, systems that show clear evidence of additional bodies, heartbeat systems, systems with changing eclipse depths, and systems exhibiting only one eclipse event over the duration of the mission. We have updated the period and galactic latitude distribution diagrams and included a catalog completeness evaluation. The total number of identified eclipsing and ellipsoidal binary systems in the Kepler field of view has increased to 2878, 1.3% of all observed Kepler targets

    On the relevance of preprocessing in predictive maintenance for dynamic systems

    Get PDF
    The complexity involved in the process of real-time data-driven monitoring dynamic systems for predicted maintenance is usually huge. With more or less in-depth any data-driven approach is sensitive to data preprocessing, understood as any data treatment prior to the application of the monitoring model, being sometimes crucial for the final development of the employed monitoring technique. The aim of this work is to quantify the sensitiveness of data-driven predictive maintenance models in dynamic systems in an exhaustive way. We consider a couple of predictive maintenance scenarios, each of them defined by some public available data. For each scenario, we consider its properties and apply several techniques for each of the successive preprocessing steps, e.g. data cleaning, missing values treatment, outlier detection, feature selection, or imbalance compensation. The pretreatment configurations, i.e. sequential combinations of techniques from different preprocessing steps, are considered together with different monitoring approaches, in order to determine the relevance of data preprocessing for predictive maintenance in dynamical systems

    Resolving early mesoderm diversification through single-cell expression profiling.

    Get PDF
    In mammals, specification of the three major germ layers occurs during gastrulation, when cells ingressing through the primitive streak differentiate into the precursor cells of major organ systems. However, the molecular mechanisms underlying this process remain unclear, as numbers of gastrulating cells are very limited. In the mouse embryo at embryonic day 6.5, cells located at the junction between the extra-embryonic region and the epiblast on the posterior side of the embryo undergo an epithelial-to-mesenchymal transition and ingress through the primitive streak. Subsequently, cells migrate, either surrounding the prospective ectoderm contributing to the embryo proper, or into the extra-embryonic region to form the yolk sac, umbilical cord and placenta. Fate mapping has shown that mature tissues such as blood and heart originate from specific regions of the pre-gastrula epiblast, but the plasticity of cells within the embryo and the function of key cell-type-specific transcription factors remain unclear. Here we analyse 1,205 cells from the epiblast and nascent Flk1(+) mesoderm of gastrulating mouse embryos using single-cell RNA sequencing, representing the first transcriptome-wide in vivo view of early mesoderm formation during mammalian gastrulation. Additionally, using knockout mice, we study the function of Tal1, a key haematopoietic transcription factor, and demonstrate, contrary to previous studies performed using retrospective assays, that Tal1 knockout does not immediately bias precursor cells towards a cardiac fate.We thank M. de Bruijn, A. Martinez-Arias, J. Nichols and C. Mulas for discussion, the Cambridge Institute for Medical Research Flow Cytometry facility for their expertise in single-cell index sorting, and S. Lorenz from the Sanger Single Cell Genomics Core for supervising purification of Tal1−/− sequencing libraries. ChIP-seq reads were processed by R. Hannah. Research in the authors’ laboratories is supported by the Medical Research Council, Cancer Research UK, the Biotechnology and Biological Sciences Research Council, Bloodwise, the Leukemia and Lymphoma Society, and the Sanger-EBI Single Cell Centre, and by core support grants from the Wellcome Trust to the Cambridge Institute for Medical Research and Wellcome Trust - MRC Cambridge Stem Cell Institute and by core funding from Cancer Research UK and the European Molecular Biology Laboratory. Y.T. was supported by a fellowship from the Japan Society for the Promotion of Science. W.J. is a Wellcome Trust Clinical Research Fellow. A.S. is supported by the Sanger-EBI Single Cell Centre. This work was funded as part of Wellcome Trust Strategic Award 105031/D/14/Z ‘Tracing early mammalian lineage decisions by single-cell genomics’ awarded to W. Reik, S. Teichmann, J. Nichols, B. Simons, T. Voet, S. Srinivas, L. Vallier, B. Göttgens and J. Marioni.This is the author accepted manuscript. The final version is available from Nature Publishing Group via http://dx.doi.org/10.1038/nature1863
    corecore