39 research outputs found

    Stratified Rule-Aware Network for Abstract Visual Reasoning

    Full text link
    Abstract reasoning refers to the ability to analyze information, discover rules at an intangible level, and solve problems in innovative ways. Raven's Progressive Matrices (RPM) test is typically used to examine the capability of abstract reasoning. The subject is asked to identify the correct choice from the answer set to fill the missing panel at the bottom right of RPM (e.g., a 3×\times3 matrix), following the underlying rules inside the matrix. Recent studies, taking advantage of Convolutional Neural Networks (CNNs), have achieved encouraging progress to accomplish the RPM test. However, they partly ignore necessary inductive biases of RPM solver, such as order sensitivity within each row/column and incremental rule induction. To address this problem, in this paper we propose a Stratified Rule-Aware Network (SRAN) to generate the rule embeddings for two input sequences. Our SRAN learns multiple granularity rule embeddings at different levels, and incrementally integrates the stratified embedding flows through a gated fusion module. With the help of embeddings, a rule similarity metric is applied to guarantee that SRAN can not only be trained using a tuplet loss but also infer the best answer efficiently. We further point out the severe defects existing in the popular RAVEN dataset for RPM test, which prevent from the fair evaluation of the abstract reasoning ability. To fix the defects, we propose an answer set generation algorithm called Attribute Bisection Tree (ABT), forming an improved dataset named Impartial-RAVEN (I-RAVEN for short). Extensive experiments are conducted on both PGM and I-RAVEN datasets, showing that our SRAN outperforms the state-of-the-art models by a considerable margin.Comment: AAAI 2021 paper. Code: https://github.com/husheng12345/SRA

    Graduate employment prediction with bias

    Get PDF
    The failure of landing a job for college students could cause serious social consequences such as drunkenness and suicide. In addition to academic performance, unconscious biases can become one key obstacle for hunting jobs for graduating students. Thus, it is necessary to understand these unconscious biases so that we can help these students at an early stage with more personalized intervention. In this paper, we develop a framework, i.e., MAYA (Multi-mAjor emploYment stAtus) to predict students’ employment status while considering biases. The framework consists of four major components. Firstly, we solve the heterogeneity of student courses by embedding academic performance into a unified space. Then, we apply a generative adversarial network (GAN) to overcome the class imbalance problem. Thirdly, we adopt Long Short-Term Memory (LSTM) with a novel dropout mechanism to comprehensively capture sequential information among semesters. Finally, we design a bias-based regularization to capture the job market biases. We conduct extensive experiments on a large-scale educational dataset and the results demonstrate the effectiveness of our prediction framework. Copyright © 2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. **Please note that there are multiple authors for this article therefore only the name of the first 5 including Federation University Australia affiliate “Feng Xia” is provided in this record*

    Lossy and Lossless (L2^2) Post-training Model Size Compression

    Full text link
    Deep neural networks have delivered remarkable performance and have been widely used in various visual tasks. However, their huge size causes significant inconvenience for transmission and storage. Many previous studies have explored model size compression. However, these studies often approach various lossy and lossless compression methods in isolation, leading to challenges in achieving high compression ratios efficiently. This work proposes a post-training model size compression method that combines lossy and lossless compression in a unified way. We first propose a unified parametric weight transformation, which ensures different lossy compression methods can be performed jointly in a post-training manner. Then, a dedicated differentiable counter is introduced to guide the optimization of lossy compression to arrive at a more suitable point for later lossless compression. Additionally, our method can easily control a desired global compression ratio and allocate adaptive ratios for different layers. Finally, our method can achieve a stable 10×10\times compression ratio without sacrificing accuracy and a 20×20\times compression ratio with minor accuracy loss in a short time. Our code is available at https://github.com/ModelTC/L2_Compression

    Lost at starting line : predicting maladaptation of university freshmen based on educational big data

    Get PDF
    The transition from secondary education to higher education could be challenging for most freshmen. For students who fail to adjust to university life smoothly, their status may worsen if the university cannot offer timely and proper guidance. Helping students adapt to university life is a long-term goal for any academic institution. Therefore, understanding the nature of the maladaptation phenomenon and the early prediction of “at-risk” students are crucial tasks that urgently need to be tackled effectively. This article aims to analyze the relevant factors that affect the maladaptation phenomenon and predict this phenomenon in advance. We develop a prediction framework (MAladaptive STudEnt pRediction, MASTER) for the early prediction of students with maladaptation. First, our framework uses the SMOTE (Synthetic Minority Oversampling Technique) algorithm to solve the data label imbalance issue. Moreover, a novel ensemble algorithm, priority forest, is proposed for outputting ranks instead of binary results, which enables us to perform proactive interventions in a prioritized manner where limited education resources are available. Experimental results on real-world education datasets demonstrate that the MASTER framework outperforms other state-of-art methods. © 2022 The Authors. Journal of the Association for Information Science and Technology published by Wiley Periodicals LLC on behalf of Association for Information Science and Technology

    Graduate Employment Prediction with Bias

    Full text link
    The failure of landing a job for college students could cause serious social consequences such as drunkenness and suicide. In addition to academic performance, unconscious biases can become one key obstacle for hunting jobs for graduating students. Thus, it is necessary to understand these unconscious biases so that we can help these students at an early stage with more personalized intervention. In this paper, we develop a framework, i.e., MAYA (Multi-mAjor emploYment stAtus) to predict students' employment status while considering biases. The framework consists of four major components. Firstly, we solve the heterogeneity of student courses by embedding academic performance into a unified space. Then, we apply a generative adversarial network (GAN) to overcome the class imbalance problem. Thirdly, we adopt Long Short-Term Memory (LSTM) with a novel dropout mechanism to comprehensively capture sequential information among semesters. Finally, we design a bias-based regularization to capture the job market biases. We conduct extensive experiments on a large-scale educational dataset and the results demonstrate the effectiveness of our prediction framework

    Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight Reinforcement Learning

    Full text link
    Agile and adaptive maneuvers such as fall recovery, high-speed turning, and sprinting in the wild are challenging for legged systems. We propose a Curricular Hindsight Reinforcement Learning (CHRL) that learns an end-to-end tracking controller that achieves powerful agility and adaptation for the legged robot. The two key components are (I) a novel automatic curriculum strategy on task difficulty and (ii) a Hindsight Experience Replay strategy adapted to legged locomotion tasks. We demonstrated successful agile and adaptive locomotion on a real quadruped robot that performed fall recovery autonomously, coherent trotting, sustained outdoor speeds up to 3.45 m/s, and tuning speeds up to 3.2 rad/s. This system produces adaptive behaviours responding to changing situations and unexpected disturbances on natural terrains like grass and dirt

    A General Review on Longwall Mining-Induced Fractures in Near-Face Regions

    No full text
    It is believed that underground longwall mining usually produces fractures in the surrounding rocks. On the one hand, mining-induced fractures not only degrade the strength of the rock mass but also serve as main channels for fluids (e.g., water and methane). Fractures facilitate the failure of the rock mass and fluid inrush into working spaces. Therefore, mining-induced fractures are significant for the safety evaluation of underground structures and finding feasible solutions. On the other hand, the fractures are also beneficial for methane collection and coal fragmentation, which are essential for the successful operation of longwall top coal caving mining. Therefore, determining the characteristics of induced fractures is significant for underground longwall mining. From a global perspective, longwall mining-induced fractures in the overburden have been well studied, which improves the understanding of the mining pressure and ground control. However, induced fractures near the longwall face, which have more significant effects on mining activities, have not been summarized. The goal of this review paper is to provide a general summary of the current achievements in characterizing mining-induced fractures in near-face regions. The characteristics of mining-induced fractures in the coal wall, chain pillar, immediate roofs and top coal, and floors are reviewed and summarized. Remarks are made on the current progress of, fundamental problems with, and developments in methodologies for characterizing mining-induced fractures using methods such as field observations, small-scale laboratory tests, physical modeling, and numerical modeling. Based on a comprehensive analysis, the advantages and disadvantages of each method are discussed, and the ideal conditions for applying each of these methods are also recommended

    Practice and prospect of fully mechanised mininig technology for thin coal seams under complex conditions in China

    Get PDF
    In China, thin coal seam are rich in resources and complex in conditions, however, the characteristics such as narrow mining space, the low level of mechanised technology, bad working environment and the high cost of mining, directly restrict the development of mining safety and high-efficiency. In thin coal seams with hard gangue which contains concretions of pyrite, LS-DYNA is applied to calculate the rational blasting parameters and carry out the deep-hole pre-splitting blasting technology, the hard gangue is fractured effectively, hence advancing the productivity of thin coal seam mining. In addition, the mining rate is sped up in thin protective layers in extreme close coal seams by enhancing the level of fully mechanised equipment and other effective measures. Safety and high-efficiency mining can be realised in the outburst coal seam. Thin coal seam mining technology faces many problems presently, i.e. the low level of equipment automation, the low advance rate of mixed coal-rock drift, and the big intensity of worker labour. By lowering the labour intensity, improving the efficiency by means of advancing mining automatic equipment and other measures, respectively, thus manless working faces can be successfully realised in thin coal seam mining
    corecore