61 research outputs found

    Achieving non-discrimination in prediction

    Full text link
    Discrimination-aware classification is receiving an increasing attention in data science fields. The pre-process methods for constructing a discrimination-free classifier first remove discrimination from the training data, and then learn the classifier from the cleaned data. However, they lack a theoretical guarantee for the potential discrimination when the classifier is deployed for prediction. In this paper, we fill this gap by mathematically bounding the probability of the discrimination in prediction being within a given interval in terms of the training data and classifier. We adopt the causal model for modeling the data generation mechanism, and formally defining discrimination in population, in a dataset, and in prediction. We obtain two important theoretical results: (1) the discrimination in prediction can still exist even if the discrimination in the training data is completely removed; and (2) not all pre-process methods can ensure non-discrimination in prediction even though they can achieve non-discrimination in the modified training data. Based on the results, we develop a two-phase framework for constructing a discrimination-free classifier with a theoretical guarantee. The experiments demonstrate the theoretical results and show the effectiveness of our two-phase framework

    Achieving Causal Fairness in Machine Learning

    Get PDF
    Fairness is a social norm and a legal requirement in today\u27s society. Many laws and regulations (e.g., the Equal Credit Opportunity Act of 1974) have been established to prohibit discrimination and enforce fairness on several grounds, such as gender, age, sexual orientation, race, and religion, referred to as sensitive attributes. Nowadays machine learning algorithms are extensively applied to make important decisions in many real-world applications, e.g., employment, admission, and loans. Traditional machine learning algorithms aim to maximize predictive performance, e.g., accuracy. Consequently, certain groups may get unfairly treated when those algorithms are applied for decision-making. Therefore, it is an imperative task to develop fairness-aware machine learning algorithms such that the decisions made by them are not only accurate but also subject to fairness requirements. In the literature, machine learning researchers have proposed association-based fairness notions, e.g., statistical parity, disparate impact, equality of opportunity, etc., and developed respective discrimination mitigation approaches. However, these works did not consider that fairness should be treated as a causal relationship. Although it is well known that association does not imply causation, the gap between association and causation is not paid sufficient attention by the fairness researchers and stakeholders. The goal of this dissertation is to study fairness in machine learning, define appropriate fairness notions, and develop novel discrimination mitigation approaches from a causal perspective. Based on Pearl\u27s structural causal model, we propose to formulate discrimination as causal effects of the sensitive attribute on the decision. We consider different types of causal effects to cope with different situations, including the path-specific effect for direct/indirect discrimination, the counterfactual effect for group/individual discrimination, and the path-specific counterfactual effect for general cases. In the attempt to measure discrimination, the unidentifiable situations pose an inevitable barrier to the accurate causal inference. To address this challenge, we propose novel bounding methods to accurately estimate the strength of unidentifiable fairness notions, including path-specific fairness, counterfactual fairness, and path-specific counterfactual fairness. Based on the estimation of fairness, we develop novel and efficient algorithms for learning fair classification models. Besides classification, we also investigate the discrimination issues in other machine learning scenarios, such as ranked data analysis

    Learning Causally Disentangled Representations via the Principle of Independent Causal Mechanisms

    Full text link
    Learning disentangled causal representations is a challenging problem that has gained significant attention recently due to its implications for extracting meaningful information for downstream tasks. In this work, we define a new notion of causal disentanglement from the perspective of independent causal mechanisms. We propose ICM-VAE, a framework for learning causally disentangled representations supervised by causally related observed labels. We model causal mechanisms using learnable flow-based diffeomorphic functions to map noise variables to latent causal variables. Further, to promote the disentanglement of causal factors, we propose a causal disentanglement prior that utilizes the known causal structure to encourage learning a causally factorized distribution in the latent space. Under relatively mild conditions, we provide theoretical results showing the identifiability of causal factors and mechanisms up to permutation and elementwise reparameterization. We empirically demonstrate that our framework induces highly disentangled causal factors, improves interventional robustness, and is compatible with counterfactual generation

    An Improved Modeling for Low-grade Organic Rankine Cycle Coupled with Optimization Design of Radial-inflow Turbine

    Get PDF
    This document is the Accepted Manuscript of the following article: Lijing Zhai, Guoqiang Xu, Jie Wen, Yongkai Quan, Jian Fu, Hongwei Wu, and Tingting Li, ‘An improved modeling for low-grade organic Rankine cycle coupled with optimization design of radial-inflow turbine’, Energy Conversion and Management, Vol. 153: 60-70, December 2017. Under embargo. Embargo end date: 10 October 2018. The final, published version is available online at DOI: https://doi.org/10.1016/j.enconman.2017.09.063. Published by Elsevier Ltd.Organic Rankine cycle (ORC) has been proven to be an effective and promising technology to convert low-grade heat energy into power, attracting rapidly growing interest in recent years. As the key component of the ORC system, turbine significantly influences the overall cycle performance and its efficiency also varies with different working fluids as well as in different operating conditions. However, turbine efficiency is generally assumed to be constant in the conventional cycle design. Aiming at this issue, this paper couples the ORC system design with the radial-inflow turbine design to investigate the thermodynamic performance of the ORC system and the aerodynamic characteristics of radial-inflow turbine simultaneously. The constrained genetic algorithm (GA) is used to optimize the radial-inflow turbine with attention to six design parameters, including degree of reaction, velocity ratio, loading coefficient, flow coefficient, ratio of wheel diameter, and rotational speed. The influence of heat source outlet temperature on the performance of the radial-inflow turbine and the ORC system with constant mass flow rate of the heat source and constant heat source inlet temperature is investigated for four kinds of working fluids. The net electrical powers achieved are from few tens kWs to one hundred kWs. The results show that the turbine efficiency decreases with increasing heat source outlet temperature and that the decreasing rate of turbine efficiency becomes faster in the high temperature region. The optimized turbine efficiency varies from 88.06% (using pentane at the outlet temperature of 105 ºC) to 91.01% (using R245fa at the outlet temperature of 80 ºC), which appears much higher compared to common values reported in the literature. Furthermore, the cycle efficiency increases monotonously with the growth of the heat source outlet temperature, whereas the net power output has the opposite trend. R123 achieves the maximum cycle efficiency of 12.21% at the heat source outlet temperature of 110 ºC. Based on the optimized results, the recommended ranges of the key design parameters for ORC radial-inflow turbine are presented as well.Peer reviewe

    Performance analysis of a new deep super-cooling two-stage organic Rankine cycle

    Get PDF
    This document is the Accepted Manuscript version of the following article: Y. Yuan, G. Xu, Y. Quan, H. Wu, G. Song, W. Gong, and X. Luo, ‘Performance analysis of a new deep super-cooling two-stage organic Rankine cycle’, Energy Conversion and Management, Vol. 148: 305-316, September 2017. The final, definitive version is available online at doi:https://doi.org/10.1016/j.enconman.2017.06.006. Published by Elsevier.In this article, a new deep super-cooling two-stage organic Rankine cycle (DTORC) is proposed and evaluated at high temperature waste heat recovery in order to increase the power output. A thermodynamic model of recuperative organic rankine cycle (ORC) is also established for the purpose of comparison. Furthermore, a new evaluation index, effective heat source utilization, is proposed to reflect the relationship among the heat source, power output and consumption of the waste heat carrier. A simulation model is formulated and analysed under a wide range of operating conditions with the heat resource temperature fixed at 300℃. Hexamethyldisiloxane (MM) and R245fa are used as the working fluid for DTORC, and MM for ORC. In the current work, the comparisons of heat source utilization, net thermal efficiency as well as the total surface area of the heat exchangers between DTORC and RC are discussed in detail. Results show that the DTORC performs better than ORC at high temperature waste heat recovery and it could increase the power output by 150%. Moreover, the maximum net thermal efficiency of DTORC can reach to 23.5% and increased by 30.5% compared with that using ORC, whereas the total surface areas of the heat exchangers are nearly the same.Peer reviewe

    SiDA: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models

    Full text link
    Mixture-of-Experts (MoE) has emerged as a favorable architecture in the era of large models due to its inherent advantage, i.e., enlarging model capacity without incurring notable computational overhead. Yet, the realization of such benefits often results in ineffective GPU memory utilization, as large portions of the model parameters remain dormant during inference. Moreover, the memory demands of large models consistently outpace the memory capacity of contemporary GPUs. Addressing this, we introduce SiDA (Sparsity-inspired Data-Aware), an efficient inference approach tailored for large MoE models. SiDA judiciously exploits both the system's main memory, which is now abundant and readily scalable, and GPU memory by capitalizing on the inherent sparsity on expert activation in MoE models. By adopting a data-aware perspective, SiDA achieves enhanced model efficiency with a neglectable performance drop. Specifically, SiDA attains a remarkable speedup in MoE inference with up to 3.93X throughput increasing, up to 75% latency reduction, and up to 80% GPU memory saving with down to 1% performance drop. This work paves the way for scalable and efficient deployment of large MoE models, even in memory-constrained systems

    Pressure-induced Superconductivity and Structure Phase Transition in SnAs-based Zintl Compound SrSn2As2

    Full text link
    Layered SnAs-based Zintl compounds exhibit a distinctive electronic structure, igniting extensive research efforts in areas of superconductivity, topological insulators and quantum magnetism. In this paper, we systematically investigate the crystal structures and electronic properties of the Zintl compound SrSn2As2 under high-pressure. At approximately 20.8 GPa, pressure-induced superconductivity is observed in SrSn2As2 with a characteristic dome-like evolution of Tc. Theoretical calculations together with high pressure synchrotron X-ray diffraction and Raman spectroscopy have identified that SrSn2As2 undergoes a structural transformation from a trigonal to a monoclinic structure. Beyond 28.3 GPa, the superconducting transition temperature is suppressed due to a reduction of the density of state at the Fermi level. The discovery of pressure-induced superconductivity, accompanied by structural transitions in SrSn2As2, greatly expands the physical properties of layered SnAs-based compounds and provides a new ground states upon compression.Comment: 15 pages, 6 figures. arXiv admin note: text overlap with arXiv:2307.1562
    • …
    corecore