120 research outputs found

    Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees

    Full text link
    Many problems, such as online ad display, can be formulated as online bipartite matching. The crucial challenge lies in the nature of sequentially-revealed online item information, based on which we make irreversible matching decisions at each step. While numerous expert online algorithms have been proposed with bounded worst-case competitive ratios, they may not offer satisfactory performance in average cases. On the other hand, reinforcement learning (RL) has been applied to improve the average performance, but it lacks robustness and can perform arbitrarily poorly. In this paper, we propose a novel RL-based approach to edge-weighted online bipartite matching with robustness guarantees (LOMAR), achieving both good average-case and worst-case performance. The key novelty of LOMAR is a new online switching operation which, based on a judicious condition to hedge against future uncertainties, decides whether to follow the expert's decision or the RL decision for each online item. We prove that for any ρ[0,1]\rho\in[0,1], LOMAR is ρ\rho-competitive against any given expert online algorithm. To improve the average performance, we train the RL policy by explicitly considering the online switching operation. Finally, we run empirical experiments to demonstrate the advantages of LOMAR compared to existing baselines. Our code is available at: https://github.com/Ren-Research/LOMARComment: Accepted by ICML 202

    Robust Learning for Smoothed Online Convex Optimization with Feedback Delay

    Full text link
    We study a challenging form of Smoothed Online Convex Optimization, a.k.a. SOCO, including multi-step nonlinear switching costs and feedback delay. We propose a novel machine learning (ML) augmented online algorithm, Robustness-Constrained Learning (RCL), which combines untrusted ML predictions with a trusted expert online algorithm via constrained projection to robustify the ML prediction. Specifically,we prove that RCL is able to guarantee(1+λ)(1+\lambda)-competitiveness against any given expert for anyλ>0\lambda>0, while also explicitly training the ML model in a robustification-aware manner to improve the average-case performance. Importantly,RCL is the first ML-augmented algorithm with a provable robustness guarantee in the case of multi-step switching cost and feedback delay.We demonstrate the improvement of RCL in both robustness and average performance using battery management for electrifying transportationas a case study.Comment: Accepted by NeurIPS 202

    Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

    Full text link
    We study the tradeoff between consistency and robustness in the context of a single-trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned advice. Our work departs from the typical approach of treating advice as coming from black-box sources by instead considering a setting where additional information about how the advice is generated is available. We prove a first-of-its-kind consistency and robustness tradeoff given Q-value advice under a general MDP model that includes both continuous and discrete state/action spaces. Our results highlight that utilizing Q-value advice enables dynamic pursuit of the better of machine-learned advice and a robust baseline, thus result in near-optimal performance guarantees, which provably improves what can be obtained solely with black-box advice.Comment: 27 page

    Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI Models

    Full text link
    The growing carbon footprint of artificial intelligence (AI) models, especially large ones such as GPT-3, has been undergoing public scrutiny. Unfortunately, however, the equally important and enormous water (withdrawal and consumption) footprint of AI models has remained under the radar. For example, training GPT-3 in Microsoft's state-of-the-art U.S. data centers can directly evaporate 700,000 liters of clean freshwater, but such information has been kept a secret. More critically, the global AI demand may be accountable for 4.2 -- 6.6 billion cubic meters of water withdrawal in 2027, which is more than the total annual water withdrawal of 4 -- 6 Denmark or half of the United Kingdom. This is very concerning, as freshwater scarcity has become one of the most pressing challenges shared by all of us in the wake of the rapidly growing population, depleting water resources, and aging water infrastructures. To respond to the global water challenges, AI models can, and also must, take social responsibility and lead by example by addressing their own water footprint. In this paper, we provide a principled methodology to estimate the water footprint of AI models, and also discuss the unique spatial-temporal diversities of AI models' runtime water efficiency. Finally, we highlight the necessity of holistically addressing water footprint along with carbon footprint to enable truly sustainable AI.Comment: New updates include discussion on water withdrawal and water consumption, scope definition for water, and new estimates of GPT-3's water footprint based on Microsoft's new WUE and PUE data. Source codes available at: https://github.com/Ren-Research/Making-AI-Less-Thirst

    Optimization of Forged 42CrMo4 Steel Piston Pin Hole Profile Using Finite Element Method

    Get PDF
    The fatigue failure of the piston pin hole is considered as a key factor affecting the service life of engines. In this work, the piston pin hole profile was designed as tapered shape following a power law. By combining finite element analysis and hydraulic pulsating fatigue tests, the pin hole profile was optimized. It has been found that the maximum contact pressure on the pin hole surface was reduced by 16,7% with appropriate increasing the radius enlarging rate of the piston pin hole, the maximum tensile stress of the piston pin seat was reduced by 13,1%, and the piston pin seat fatigue safety factor was increased by 41,4%, the piston pin hole fatigue safety factor was increased by 15,9%. The piston pin hole’s hydraulic pulsating fatigue test results were found to be consistent with the FEA results. It could be concluded that appropriate increasing the radius enlarging rate of the pin hole could significantly weaken the fatigue wear of the pin hole, further improving its fatigue resistance

    An improved local remeshing algorithm for moving boundary problems

    Get PDF
    © 2016 The Author(s). Three issues are tackled in this study to improve the robustness of local remeshing techniques. Firstly, the local remeshing region (hereafter referred to as ‘hole’) is initialized by removing low-quality elements and then continuously expanded until a certain element quality is reached after remeshing. The effect of the number of the expansion cycle on the hole size and element quality after remeshing is experimentally analyzed. Secondly, the grid sources for element size control are attached to moving bodies and will move along with their host bodies to ensure reasonable grid resolution inside the hole. Thirdly, the boundary recovery procedure of a Delaunay grid generation approach is enhanced by a new grid topology transformation technique (namely shell transformation) so that the new grid created inside the hole is therefore free of elements of extremely deformed/skewed shape, whilst also respecting the hole boundary. The proposed local remeshing algorithm has been integrated with an in-house unstructured grid-based simulation system for solving moving boundary problems. The robustness and accuracy of the developed local remeshing technique are successfully demonstrated via industry-scale applications for complex flow simulations

    Forces and Disease: Electrostatic force differences caused by mutations in kinesin motor domains can distinguish between disease-causing and non-disease-causing mutations

    Get PDF
    The ability to predict if a given mutation is disease-causing or not has enormous potential to impact human health. Typically, these predictions are made by assessing the effects of mutation on macromolecular stability and amino acid conservation. Here we report a novel feature: the electrostatic component of the force acting between a kinesin motor domain and tubulin. We demonstrate that changes in the electrostatic component of the binding force are able to discriminate between disease-causing and non-disease-causing mutations found in human kinesin motor domains using the receiver operating characteristic (ROC). Because diseases may originate from multiple effects not related to kinesin-microtubule binding, the prediction rate of 0.843 area under the ROC plot due to the change in magnitude of the electrostatic force alone is remarkable. These results reflect the dependence of kinesin’s function on motility along the microtubule, which suggests a precise balance of microtubule binding forces is required

    Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction

    Full text link
    ChatGPT has demonstrated impressive performance in various downstream tasks. However, in the Chinese Spelling Correction (CSC) task, we observe a discrepancy: while ChatGPT performs well under human evaluation, it scores poorly according to traditional metrics. We believe this inconsistency arises because the traditional metrics are not well-suited for evaluating generative models. Their overly strict length and phonics constraints may lead to underestimating ChatGPT's correction capabilities. To better evaluate generative models in the CSC task, this paper proposes a new evaluation metric: Eval-GCSC. By incorporating word-level and semantic similarity judgments, it relaxes the stringent length and phonics constraints. Experimental results show that Eval-GCSC closely aligns with human evaluations. Under this metric, ChatGPT's performance is comparable to traditional token-level classification models (TCM), demonstrating its potential as a CSC tool. The source code and scripts can be accessed at https://github.com/ktlKTL/Eval-GCSC

    The Influence of Receiver Selection Strategy on Packet Success Probability in Ad Hoc Network

    Get PDF
    Considering the importance of the receiver (RX) selection strategy for the packet success probability (PSP) in ad hoc network, this paper probes into the PSPs with nearest RX selection strategy and farthest RX selection strategy and determines the number of hops with the two strategies. Next, the performance of the successful transmission probability (STP) and PSP were discussed through numerical simulation with the above mentioned two strategies. The simulation results show that the PSP is affected by the terminal density, the RX selection strategy, the packet length and the STP; the number of hops mainly depends on the terminal density, the RX selection strategy, the length between the source TX and the destination RX. Furthermore, the nearest RX selection strategy and the farthest RX selection strategy differ insignificantly in the packet transmission duration between source TX to destination RX at a small terminal density
    corecore