61 research outputs found

    Consensual Resilient Control: Stateless Recovery of Stateful Controllers

    Get PDF
    Safety-critical systems have to absorb accidental and malicious faults to obtain high mean-times-to-failures (MTTFs). Traditionally, this is achieved through re-execution or replication. However, both techniques come with significant overheads, in particular when cold-start effects are considered. Such effects occur after replicas resume from checkpoints or from their initial state. This work aims at improving on the performance of control-task replication by leveraging an inherent stability of many plants to tolerate occasional control-task deadline misses and suggests masking faults just with a detection quorum. To make this possible, we have to eliminate cold-start effects to allow replicas to rejuvenate during each control cycle. We do so, by systematically turning stateful controllers into instants that can be recovered in a stateless manner. We highlight the mechanisms behind this transformation, how it achieves consensual resilient control, and demonstrate on the example of an inverted pendulum how accidental and maliciously-induced faults can be absorbed, even if control tasks run in less predictable environments

    Tropical Backpropagation

    Get PDF
    This work introduces tropicalization, a novel technique that delivers tropical neural networks as tropical limits of deep ReLU networks. Tropicalization transfers the initial weights from real numbers to those in the tropical semiring while maintain- ing the underlying graph of the network. After verifying that tropicalization will not affect the classification capacity of deep neural networks, this study introduces a tropical reformulation of backpropagation via tropical linear algebra. Tropical arithmetic replaces multiplication operations in the network with additions and addition operations with max, and therefore, theoretically, reduces the algorithmic complexity during the training and inference phase. We demonstrate the latter by simulating the tensor multiplication underlying the feed-forward process of state- of-the-art trained neural network architectures and compare the standard forward pass of the models with the tropical ones. Our benchmark results show that tropi- calization speeds up inference by 50 %. Hence, we conclude that tropicalization bears the potential to reduce the training times of large neural networks drastically

    Reading Aloud and First Language Development: A Systematic Review

    Get PDF
    Reading aloud appears to be an important lever for improving language acquisition and development in early childhood, and later in life it strengthens many sub-dimensions of language. However, the availability of numerous variations on reading training, shaped by different methodologies and different lengths of exposure make it difficult to determine the best approaches to follow. The aim of this review is to identify the available literature contributions that examine the association between mediated reading training, first language development and the acquisition of new vocabulary, including other components that could be improved by these interventions, such as cognitive function, emergent literacy and adult-child verbal interactions. The purpose is to compare research highlighting their fundamental characteristics, tools, duration and methodologies used in order to point out the effects that the practice of reading aloud produces on the acquisition and the enhancement of language, particularly in the age of language development. The analysis of the 51 articles included aims to identify the most effective reading strategies in terms of practices, timing and methods, able to produce the most significant gains in the language area

    Clinical predictive factors of pathologic complete response in locally advanced rectal cancer

    Get PDF
    Background: Predictive factors of pathologic complete response (pCR) after neoadjuvant chemoradiotherapy (nCRT) in locally advanced rectal cancer (LARC) are still not identified. The purpose of this study was to define them. Materials and Methods: Data from consecutive LARC patients treated between January 2008 and June 2014 at our Institution were included in the analysis. All patients were treated with a long course of nCRT. Demographics, initial diagnosis and tumor extension details, as well as treatment modalities characteristics were included in the univariate and logistic regression analysis. Results: In total 99 patients received nCRT, of whom 23 patients (23.2%) achieved pCR. Patients with and without pCR were similar in term of age, sex, comobidities, BMI and tumor characteristics. Multivariate logistic regression indicated that pre-treatment tumor size <= 5 cm was a significant predictor for pCR (p = 0.035), whereas clinical N stage only showed a positive trend (p = 0.084). Conclusions: Tumor size at diagnosis could be used to predict pCR, and thus to individualize therapy in LARC patients management. Validation in other studies is needed

    Consensual Resilient Control: Stateless Recovery of Stateful Controllers

    Get PDF
    peer reviewedSafety-critical systems have to absorb accidental and malicious faults to obtain high mean-times-to-failures (MTTFs). Traditionally, this is achieved through re-execution or replication. However, both techniques come with significant overheads, in particular when cold-start effects are considered. Such effects occur after replicas resume from checkpoints or from their initial state. This work aims at improving on the performance of control-task replication by leveraging an inherent stability of many plants to tolerate occasional control-task deadline misses and suggests masking faults just with a detection quorum. To make this possible, we have to eliminate cold-start effects to allow replicas to rejuvenate during each control cycle. We do so, by systematically turning stateful controllers into instants that can be recovered in a stateless manner. We highlight the mechanisms behind this transformation, how it achieves consensual resilient control, and demonstrate on the example of an inverted pendulum how accidental and maliciously-induced faults can be absorbed, even if control tasks run in less predictable environments

    Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

    Full text link
    Over the past few years, Large Language Models of Code (Code LLMs) have started to have a significant impact on programming practice. Code LLMs are also emerging as a building block for research in programming languages and software engineering. However, the quality of code produced by a Code LLM varies significantly by programming languages. Code LLMs produce impressive results on programming languages that are well represented in their training data (e.g., Java, Python, or JavaScript), but struggle with low-resource languages, like OCaml and Racket. This paper presents an effective approach for boosting the performance of Code LLMs on low-resource languages using semi-synthetic data. Our approach generates high-quality datasets for low-resource languages, which can then be used to fine-tune any pretrained Code LLM. Our approach, called MultiPL-T, translates training data from high-resource languages into training data for low-resource languages. We apply our approach to generate tens of thousands of new, validated training items for Racket, OCaml, and Lua from Python. Moreover, we use an open dataset (The Stack) and model (StarCoderBase), which allow us to decontaminate benchmarks and train models on this data without violating the model license. With MultiPL-T generated data, we present fine-tuned versions of StarCoderBase that achieve state-of-the-art performance for Racket, OCaml, and Lua on benchmark problems. For Lua, our fine-tuned model achieves the same performance as StarCoderBase as Python -- a very high-resource language -- on the MultiPL-E benchmarks. For Racket and OCaml, we double their performance on MultiPL-E, bringing their performance close to higher-resource languages such as Ruby and C#

    The 75-Gram Glucose Load in Pregnancy

    Get PDF
    OBJECTIVE—To investigate, in pregnant women without gestational diabetes mellitus (GDM), the relation among obstetric/demographic characteristics; fasting, 1-h, and 2-h plasma glucose values resulting from a 75-g glucose load; and the risk of abnormal neonatal anthropometric features and then to verify the presence of a threshold glucose value for a 75-g glucose load above which there is an increased risk for abnormal neonatal anthropometric characteristics. RESEARCH DESIGN AND METHODS—The study group consisted of 829 Caucasian pregnant women with singleton pregnancy who had no history of pregestational diabetes or GDM, who were tested for GDM with a 75-g, 2-h glucose load, used as a glucose challenge test, in two periods of pregnancy (early, 16–20 weeks; late, 26–30 weeks), and who did not meet the criteria for a GDM diagnosis. In the newborns, the following abnormal anthropometric characteristics were considered as outcome measures: cranial/thoracic circumference (CC/TC) ratio ≤10th percentile for gestational age (GA), ponderal index (birth weight/length3 × 100) ≥90th percentile for GA, and macrosomia (birth weight ≥90th percentile for GA), on the basis of growth standard development for our population. For the first part of the objective, logistic regression models were used to identify 75-g glucose load values as well as obstetric and demographic variables as markers for abnormal neonatal anthropometric characteristics. For the second part, the receiver operating characteristic (ROC) curve was performed for the 75-g glucose load values to determine the plasma glucose threshold value that yielded the highest combined sensitivity and specificity for the prediction of abnormal neonatal anthropometric characteristics. RESULTS—In both early and late periods, maternal age &gt;35 years was a predictor of neonatal CC/TC ratio ≤10th percentile and macrosomia, with fasting 75-g glucose load values being independent predictors of neonatal CC/TC ratio ≤10th percentile. In both periods, 1-h values gave a strong association with all abnormal neonatal anthropometric characteristics chosen as outcome measures, with maternal age &gt;35 years being an independent predictor for macrosomia. The 2-h, 75-g glucose load values were significantly associated in both periods with neonatal CC/TC ratio ≤10th percentile and ponderal index ≥90th percentile, whereas maternal age &gt;35 years was an independent predictor of both neonatal CC/TC ratio ≤10th percentile and macrosomia. In the ROC curves for the prediction of neonatal CC/TC ratio ≤10th percentile for GA in both early and late periods of pregnancy, inflection points were identified for a 1-h, 75-g glucose load threshold value of 150 mg/dl in the early period and 160 mg/dl in the late period. CONCLUSIONS—This study documented a significant association, seen even in the early period of pregnancy, between 1-h, 75-g glucose load values and abnormal neonatal anthropometric features, and provided evidence of a threshold relation between 75-g glucose load results and clinical outcome. Our results would therefore suggest the possibility of using a 75-g, 1-h oral glucose load as a single test for the diagnosis of GDM, adopting a threshold value of 150 mg/dl at 16–20 weeks and 160 mg/dl at 26–30 weeks

    Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets

    Get PDF
    Federated learning allows clients to collaboratively train models on datasets that are acquired in different locations and that cannot be exchanged because of their size or regulations. Such collected data is increasingly non-independent and non- identically distributed (non-IID), negatively affecting training accuracy. Previous works tried to mitigate the effects of non- IID datasets on training accuracy, focusing mainly on non-IID labels, however practical datasets often also contain non-IID features. To address both non-IID labels and features, we propose FedGMCC1, a novel framework where a central server aggregates client models that it can cluster together. FedGMCC clustering relies on a Monte Carlo procedure that samples the output space of client models, infers their position in the weight space on a loss manifold and computes their geometric connection via an affine curve parametrization. FedGMCC aggregates connected models along their path connectivity to produce a richer global model, incorporating knowledge of all connected client models. FedGMCC outperforms FedAvg and FedProx in terms of convergence rates on the EMNIST62 and a genomic sequence classification datasets (by up to +63%). FedGMCC yields an improved accuracy (+4%) on the genomic dataset with respect to CFL, in high non-IID feature space settings and label incongruency
    • …
    corecore