211,827 research outputs found

    Smooth markets: A basic mechanism for organizing gradient-based learners

    Get PDF
    With the success of modern machine learning, it is becoming increasingly important to understand and control how learning algorithms interact. Unfortunately, negative results from game theory show there is little hope of understanding or controlling general n-player games. We therefore introduce smooth markets (SM-games), a class of n-player games with pairwise zero sum interactions. SM-games codify a common design pattern in machine learning that includes (some) GANs, adversarial training, and other recent algorithms. We show that SM-games are amenable to analysis and optimization using first-order methods.Comment: 18 pages, 3 figure

    The Future: Machine learning

    Get PDF
    Learning is a process or activity of understanding and improving one’s then it may be human or machineability to perform a operations efficiently. Machine learning is one of sub-field of computer science, which enables computers to learn and analyze thinks without being or writing explicitly programmed. It basically evolved from AI(artificial intelligence) via pattern recognition and computational learning theory. It also explores the area of algorithms, which can make high end predictions on data. Currently it deployed in a wide range of computing tasks, where designing efficient algorithms and programs becomes rather difficult, such as email spam filtering, optical character recognition, search engine improvement etc.Advance machine learning will basically concentrate on modeling, generation, and prediction of multiple inter-dependent variables

    Automatic offensive language detection from Twitter data using machine learning and feature selection of metadata

    Get PDF
    The popularity of social networks has only increased in recent years. In theory, the use of social media was proposed so we could share our views online, keep in contact with loved ones or share good moments of life. However, the reality is not so perfect, so you have people sharing hate speech-related messages, or using it to bully specific individuals, for instance, or even creating robots where their only goal is to target specific situations or people. Identifying who wrote such text is not easy and there are several possible ways of doing it, such as using natural language processing or machine learning algorithms that can investigate and perform predictions using the metadata associated with it. In this work, we present an initial investigation of which are the best machine learning techniques to detect offensive language in tweets. After an analysis of the current trend in the literature about the recent text classification techniques, we have selected Linear SVM and Naive Bayes algorithms for our initial tests. For the preprocessing of data, we have used different techniques for attribute selection that will be justified in the literature section. After our experiments, we have obtained 92% of accuracy and 95% of recall to detect offensive language with Naive Bayes and 90% of accuracy and 92% of recall with Linear SVM. From our understanding, these results overcome our related literature and are a good indicative of the importance of the data description approach we have used

    Incorporating a Machine Learning Model into a Web-Based Administrative Decision Support Tool for Predicting Workplace Absenteeism

    Get PDF
    Productivity losses caused by absenteeism at work cost U.S. employers billions of dollars each year. In addition, employers typically spend a considerable amount of time managing employees who perform poorly. By using predictive analytics and machine learning algorithms, organizations can make better decisions, thereby increasing organizational productivity, reducing costs, and im-proving efficiency. Thus, in this paper we propose hybrid optimization methods in order to find the most parsimonious model for absenteeism classification. We utilized data from a Brazilian courier company. In order to categorize absenteeism classes, we preprocessed the data, selected the attributes via multiple methods, balanced the dataset using the synthetic minority over-sampling method, and then employed four methods of machine learning classification: Support Vector Machine (SVM), Multinomial Logistic Regression (MLR), Artificial Neural Network (ANN), and Random Forest (RF). We selected the best model based on several validation scores, and compared its performance against the existing model. Furthermore, project managers may lack experience in machine learning, or may not have the time to spend developing machine learning algorithms. Thus, we propose a web-based interactive tool supported by cognitive analytics management (CAM) theory. The web-based decision tool enables managers to make more informed decisions, and can be used without any prior knowledge of machine learning. Understanding absenteeism patterns can assist managers in revising policies or creating new arrangements to reduce absences in the workplace, financial losses, and the probability of economic insolvency

    Machine Learning As Tool And Theory For Computational Neuroscience

    Get PDF
    Computational neuroscience is in the midst of constructing a new framework for understanding the brain based on the ideas and methods of machine learning. This is effort has been encouraged, in part, by recent advances in neural network models. It is also driven by a recognition of the complexity of neural computation and the challenges that this poses for neuroscience’s methods. In this dissertation, I first work to describe these problems of complexity that have prompted a shift in focus. In particular, I develop machine learning tools for neurophysiology that help test whether tuning curves and other statistical models in fact capture the meaning of neural activity. Then, taking up a machine learning framework for understanding, I consider theories about how neural computation emerges from experience. Specifically, I develop hypotheses about the potential learning objectives of sensory plasticity, the potential learning algorithms in the brain, and finally the consequences for sensory representations of learning with such algorithms. These hypotheses pull from advances in several areas of machine learning, including optimization, representation learning, and deep learning theory. Each of these subfields has insights for neuroscience, offering up links for a chain of knowledge about how we learn and think. Together, this dissertation helps to further an understanding of the brain in the lens of machine learning

    Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

    Full text link
    Many machine learning tasks can be formulated as a stochastic compositional optimization (SCO) problem such as reinforcement learning, AUC maximization, and meta-learning, where the objective function involves a nested composition associated with an expectation. While a significant amount of studies has been devoted to studying the convergence behavior of SCO algorithms, there is little work on understanding their generalization, i.e., how these learning algorithms built from training examples would behave on future test examples. In this paper, we provide the stability and generalization analysis of stochastic compositional gradient descent algorithms through the lens of algorithmic stability in the framework of statistical learning theory. Firstly, we introduce a stability concept called compositional uniform stability and establish its quantitative relation with generalization for SCO problems. Then, we establish the compositional uniform stability results for two popular stochastic compositional gradient descent algorithms, namely SCGD and SCSC. Finally, we derive dimension-independent excess risk bounds for SCGD and SCSC by trade-offing their stability results and optimization errors. To the best of our knowledge, these are the first-ever-known results on stability and generalization analysis of stochastic compositional gradient descent algorithms

    Causal Inference in Healthcare: Approaches to Causal Modeling and Reasoning through Graphical Causal Models

    Get PDF
    In the era of big data, researchers have access to large healthcare datasets collected over a long period. These datasets hold valuable information, frequently investigated using traditional Machine Learning algorithms or Neural Networks. These algorithms perform great in finding patterns out of datasets (as a predictive machine); however, the models lack extensive interpretability to be used in the healthcare sector (as an explainable machine). Without exploring underlying causal relationships, the algorithms fail to explain their reasoning. Causal Inference, a relatively newer branch of Artificial Intelligence, deals with interpretability and portrays causal relationships in data through graphical models. It explores the issue of causality and works towards an explainability of underlying causal models deeply buried in data. For this dissertation work, the research goal is to use Causal Inference to build an applied framework that lets researchers leverage observational datasets in understanding causal relationships between features. To achieve that, we focus on specific objectives such as (a) the addition of background knowledge to causal structure learning algorithms, (b) the proposal of new causal inference methodologies, (c) generation of theories connecting causality to standard statistical analyses (e.g., Odds Ratio, Survival Analysis), and (d) application of proposed approaches in real-world healthcare problems. This dissertation encapsulates the tasks mentioned above, through various new methodologies and experiments under the rubric of Structural Theory of Causation. We discuss the common research theme in causal inference, historical development, the structural theory of causation, and underlying assumptions. Finally, we explore the impact of these proposed methodologies in real-world treatment controversy of Delirium patients, by examining the efficacy of antipsychotic drugs prescribed in treating Delirium in the ICU, from a curated observational healthcare dataset
    • …
    corecore