11 research outputs found

    Self-Supervised Learning with an Information Maximization Criterion

    Full text link
    Self-supervised learning allows AI systems to learn effective representations from large amounts of data using tasks that do not require costly labeling. Mode collapse, i.e., the model producing identical representations for all inputs, is a central problem to many self-supervised learning approaches, making self-supervised tasks, such as matching distorted variants of the inputs, ineffective. In this article, we argue that a straightforward application of information maximization among alternative latent representations of the same input naturally solves the collapse problem and achieves competitive empirical results. We propose a self-supervised learning method, CorInfoMax, that uses a second-order statistics-based mutual information measure that reflects the level of correlation among its arguments. Maximizing this correlative information measure between alternative representations of the same input serves two purposes: (1) it avoids the collapse problem by generating feature vectors with non-degenerate covariances; (2) it establishes relevance among alternative representations by increasing the linear dependence among them. An approximation of the proposed information maximization objective simplifies to a Euclidean distance-based objective function regularized by the log-determinant of the feature covariance matrix. The regularization term acts as a natural barrier against feature space degeneracy. Consequently, beyond avoiding complete output collapse to a single point, the proposed approach also prevents dimensional collapse by encouraging the spread of information across the whole feature space. Numerical experiments demonstrate that CorInfoMax achieves better or competitive performance results relative to the state-of-the-art SSL approaches

    Clinical outcome of preoperative breast MRI for early-stage breast cancer.

    No full text

    Performance analysis of vertical handover using predictable LGD event based on IEEE 802.21.

    Get PDF
    Next Generation Wireless Networks (NGWN) aim to provide any service at any time and anywhere with seamless mobility between homogeneous and heterogeneous networks. IEEE defines the IEEE 802.21 standard to facilitate seamless handover, namely, Media Independent Handover (MIH). IEEE 802.21 provides layer two events to upper layers with a view to enhance the operability and enable them to make the right decision on time. Link Going Down (LGD) is a predictive event triggered when a link quality degradation is expected in the near future. Connectivity losses and quality decreases are usually foreseeable during the handover process. Therefore, in this paper, we analyze the performance of our effective prediction model for generating the Link Going Down (LGD) event. The network performance metrics, such as packet loss, end-to-end delay, and throughput, have been evaluated using the Network Simulator NS2

    Unraveling flux behavior of superhydrophilic loose nanofiltration membranes during textile wastewater treatment

    No full text
    © 2015 Elsevier B.V. Loose nanofiltration (NF) membranes can be used to treat textile wastewater effectively, providing an attracted avenue for resource recovery (i.e., dye purification and salt reuse) at the premise of the integration of high dye retention and salt permeation. However, the issue of membrane fouling has to be adequately addressed in view of its practical application. In this study, superhydrophilic loose NF membranes (Sepro NF 6 and NF 2A, Ultura) were applied for textile wastewater treatment. Synthetic solutions containing dyes and NaCl were used as the feed stream of a NF unit. It was found that two factors, namely cake-enhanced concentration polarization and the formation of a dye cake layer, dramatically deteriorated the flux of NF membranes with a synergic effect. In viewpoint of realistic application, diafiltration of a binary dye/salt mixture indicates that cake-enhanced concentration polarization plays a dominant role for the low membrane flux. As the diafiltration continued, cake-enhanced concentration polarization was alleviated with a decreasing concentration of salt in the feed. At the subsequent post-concentration procedure, the formation of a dye cake layer slightly compromised the membrane flux, but the negative impact of cake-enhanced concentration polarization was negligible due to the small quantity of salt remained in the feed.status: publishe

    Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

    No full text
    Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 442 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting

    Effects of hospital facilities on patient outcomes after cancer surgery: an international, prospective, observational study

    No full text
    © 2022 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 licenseBackground: Early death after cancer surgery is higher in low-income and middle-income countries (LMICs) compared with in high-income countries, yet the impact of facility characteristics on early postoperative outcomes is unknown. The aim of this study was to examine the association between hospital infrastructure, resource availability, and processes on early outcomes after cancer surgery worldwide. Methods: A multimethods analysis was performed as part of the GlobalSurg 3 study—a multicentre, international, prospective cohort study of patients who had surgery for breast, colorectal, or gastric cancer. The primary outcomes were 30-day mortality and 30-day major complication rates. Potentially beneficial hospital facilities were identified by variable selection to select those associated with 30-day mortality. Adjusted outcomes were determined using generalised estimating equations to account for patient characteristics and country-income group, with population stratification by hospital. Findings: Between April 1, 2018, and April 23, 2019, facility-level data were collected for 9685 patients across 238 hospitals in 66 countries (91 hospitals in 20 high-income countries; 57 hospitals in 19 upper-middle-income countries; and 90 hospitals in 27 low-income to lower-middle-income countries). The availability of five hospital facilities was inversely associated with mortality: ultrasound, CT scanner, critical care unit, opioid analgesia, and oncologist. After adjustment for case-mix and country income group, hospitals with three or fewer of these facilities (62 hospitals, 1294 patients) had higher mortality compared with those with four or five (adjusted odds ratio [OR] 3·85 [95% CI 2·58–5·75]; p<0·0001), with excess mortality predominantly explained by a limited capacity to rescue following the development of major complications (63·0% vs 82·7%; OR 0·35 [0·23–0·53]; p<0·0001). Across LMICs, improvements in hospital facilities would prevent one to three deaths for every 100 patients undergoing surgery for cancer. Interpretation: Hospitals with higher levels of infrastructure and resources have better outcomes after cancer surgery, independent of country income. Without urgent strengthening of hospital infrastructure and resources, the reductions in cancer-associated mortality associated with improved access will not be realised. Funding: National Institute for Health and Care Research
    corecore