10 research outputs found

    Evaluating XGBoost for Balanced and Imbalanced Data: Application to Fraud Detection

    Full text link
    This paper evaluates XGboost's performance given different dataset sizes and class distributions, from perfectly balanced to highly imbalanced. XGBoost has been selected for evaluation, as it stands out in several benchmarks due to its detection performance and speed. After introducing the problem of fraud detection, the paper reviews evaluation metrics for detection systems or binary classifiers, and illustrates with examples how different metrics work for balanced and imbalanced datasets. Then, it examines the principles of XGBoost. It proposes a pipeline for data preparation and compares a Vanilla XGBoost against a random search-tuned XGBoost. Random search fine-tuning provides consistent improvement for large datasets of 100 thousand samples, not so for medium and small datasets of 10 and 1 thousand samples, respectively. Besides, as expected, XGBoost recognition performance improves as more data is available, and deteriorates detection performance as the datasets become more imbalanced. Tests on distributions with 50, 45, 25, and 5 percent positive samples show that the largest drop in detection performance occurs for the distribution with only 5 percent positive samples. Sampling to balance the training set does not provide consistent improvement. Therefore, future work will include a systematic study of different techniques to deal with data imbalance and evaluating other approaches, including graphs, autoencoders, and generative adversarial methods, to deal with the lack of labels.Comment: 17 pages, 8 figures, 9 tables, Presented at NVIDIA GTC, The Conference for the Era of AI and the Metaverse, March 23, 2023. [S51129

    Examining the generalizability of research findings from archival data

    Get PDF
    This initiative examined systematically the extent to which a large set of archival research findings generalizes across contexts. We repeated the key analyses for 29 original strategic management effects in the same context (direct reproduction) as well as in 52 novel time periods and geographies; 45% of the reproductions returned results matching the original reports together with 55% of tests in different spans of years and 40% of tests in novel geographies. Some original findings were associated with multiple new tests. Reproducibility was the best predictor of generalizability—for the findings that proved directly reproducible, 84% emerged in other available time periods and 57% emerged in other geographies. Overall, only limited empirical evidence emerged for context sensitivity. In a forecasting survey, independent scientists were able to anticipate which effects would find support in tests in new samples

    Cullin-5 plays multiple roles in cell fate specification and synapse formation during Drosophila development

    No full text
    We describe a developmental analysis of Drosophila Cullin-5 (Cul-5) identified from the genome sequence on the basis of its high degree of homology to vertebrate and worm sequences. The gene is expressed in a restricted manner in ectodermal cells throughout development suggesting pleiotropic functions. We decided to examine the phenotypes of Cul-5 aberrations in two well-studied developmental systems: the neuromuscular junction (NMJ) and the developing sensory organ. Alteration of Cul-5 levels in motoneurons results in an increase in bouton number at the NMJ. The cells of a sensory organ on the adult notum arise from a single progenitor cell by regulated cell division. Aberrations in Cul-5 affect different steps in the lineage consistent with a role in cell fate determination, proliferation, and death. Such phenotypes highlight the multiple cellular processes in which Cul-5 can participate

    Efficacy and Safety of Naproxen Gel in Musculoskeletal Pain Management: A Prospective Cohort Study

    No full text
    Introduction: Naproxen is effective for various musculoskeletal conditions and has a longer half-life, making it a favourable choice for sustained relief. Additionally, there is a potential unmet need for guidelines on the usage of topical Non-Steroidal Anti-Inflammatory Drugs (NSAIDs) in the Asia-Pacific region. A study on naproxen 10% gel aims to address this need and increase awareness of its therapeutic potential in the region. Aim: To assess the efficacy and safety of naproxen 10% gel in relieving pain associated with lower back, knee, cervical, synovitis, bursitis, muscle sprain, and tendinitis. Materials and Methods: This prospective, cohort, observational, open-label, single-arm, multicentric study was conducted at 458 centres in India, including Ahmedabad, Bengaluru, Chandigarh, Chennai, Delhi, Guwahati, Hyderabad, Indore, Jaipur, Kolkata, Lucknow, Meerut, Mumbai, Patna, and Pune, between February 2023 and May 2023. The data was collected from outpatient settings/clinics of orthopaedicians and clinicians who have been prescribing topical naproxen 10% gel to their patients. The study included patients aged 18 to 60 years of either sex who were suffering from back pain, muscle pain, sprains, frozen shoulder, arthritis, acute low back ache (non-specific), or pain. The data was captured during the scheduled follow-up visits planned by the treating clinician, with data recorded at 3, 5, 10, and 15 days. At the baseline visit, demographic details (age, sex, weight, height, body mass index, and symptoms), Western Ontario and McMaster Universities Arthritis Index (WOMAC) pain subscale score, pain intensity on movement score, Visual Analogue Scale (VAS), and overall pain score were obtained. Results: Out of 10,587 patients, 10,265 completed the present study. The majority of patients had lower back pain (n=3386, 32.99%) and knee pain (n=3184, 31.02%). The average pain intensity on movement score of patients with bursitis significantly decreased from the baseline to 15 days {mean change {95% Confidence Interval (CI)}: 6.04 (5.89, 6.20); p<0.001}. Post-naproxen treatment, the average pain intensity, WOMAC pain score, VAS, and overall pain score significantly decreased from baseline to day 15 in patients with knee pain and lower back pain. A significant improvement in WOMAC, WOMAC pain (5.42 vs 17.98), WOMAC stiffness (1.49 vs 5.75), and WOMAC physical function score (18.93 vs 56.21) at day 15 was observed in patients with a muscle sprain. Adverse Events (AE) were reported in 173 (1.69%) patients overall, with dryness (n=125) being the most common, followed by erythema (n=20) and pruritus (n=17). Conclusion: Naproxen 10% gel is an effective topical treatment for lower back pain, knee pain, cervical pain, synovitis, bursitis, muscle sprain, and tendinitis. It could prove helpful in patients where the side-effects of oral NSAIDs are to be avoided

    ILC Reference Design Report Volume 1 - Executive Summary

    No full text
    The International Linear Collider (ILC) is a 200-500 GeV center-of-mass high-luminosity linear electron-positron collider, based on 1.3 GHz superconducting radio-frequency (SCRF) accelerating cavities. The ILC has a total footprint of about 31 km and is designed for a peak luminosity of 2x10^34 cm^-2s^-1. This report is the Executive Summary (Volume I) of the four volume Reference Design Report. It gives an overview of the physics at the ILC, the accelerator design and value estimate, the detector concepts, and the next steps towards project realization.The International Linear Collider (ILC) is a 200-500 GeV center-of-mass high-luminosity linear electron-positron collider, based on 1.3 GHz superconducting radio-frequency (SCRF) accelerating cavities. The ILC has a total footprint of about 31 km and is designed for a peak luminosity of 2x10^34 cm^-2s^-1. This report is the Executive Summary (Volume I) of the four volume Reference Design Report. It gives an overview of the physics at the ILC, the accelerator design and value estimate, the detector concepts, and the next steps towards project realization

    ILC Reference Design Report Volume 4 - Detectors

    No full text
    This report, Volume IV of the International Linear Collider Reference Design Report, describes the detectors which will record and measure the charged and neutral particles produced in the ILC's high energy e+e- collisions. The physics of the ILC, and the environment of the machine-detector interface, pose new challenges for detector design. Several conceptual designs for the detector promise the needed performance, and ongoing detector R&D is addressing the outstanding technological issues. Two such detectors, operating in push-pull mode, perfectly instrument the ILC interaction region, and access the full potential of ILC physics.This report, Volume IV of the International Linear Collider Reference Design Report, describes the detectors which will record and measure the charged and neutral particles produced in the ILC's high energy e+e- collisions. The physics of the ILC, and the environment of the machine-detector interface, pose new challenges for detector design. Several conceptual designs for the detector promise the needed performance, and ongoing detector R&D is addressing the outstanding technological issues. Two such detectors, operating in push-pull mode, perfectly instrument the ILC interaction region, and access the full potential of ILC physics

    ILC Reference Design Report Volume 3 - Accelerator

    No full text
    The International Linear Collider (ILC) is a 200-500 GeV center-of-mass high-luminosity linear electron-positron collider, based on 1.3 GHz superconducting radio-frequency (SCRF) accelerating cavities. The ILC has a total footprint of about 31 km and is designed for a peak luminosity of 2x10^34 cm^-2 s^-1. The complex includes a polarized electron source, an undulator-based positron source, two 6.7 km circumference damping rings, two-stage bunch compressors, two 11 km long main linacs and a 4.5 km long beam delivery system. This report is Volume III (Accelerator) of the four volume Reference Design Report, which describes the design and cost of the ILC.The International Linear Collider (ILC) is a 200-500 GeV center-of-mass high-luminosity linear electron-positron collider, based on 1.3 GHz superconducting radio-frequency (SCRF) accelerating cavities. The ILC has a total footprint of about 31 km and is designed for a peak luminosity of 2x10^34 cm^-2 s^-1. The complex includes a polarized electron source, an undulator-based positron source, two 6.7 km circumference damping rings, two-stage bunch compressors, two 11 km long main linacs and a 4.5 km long beam delivery system. This report is Volume III (Accelerator) of the four volume Reference Design Report, which describes the design and cost of the ILC

    International Linear Collider Reference Design Report Volume 2: PHYSICS AT THE ILC

    No full text
    This article reviews the physics case for the ILC. Baseline running at 500 GeV as well as possible upgrades and options are discussed. The opportunities on Standard Model physics, Higgs physics, Supersymmetry and alternative theories beyond the Standard Model are described.This article reviews the physics case for the ILC. Baseline running at 500 GeV as well as possible upgrades and options are discussed. The opportunities on Standard Model physics, Higgs physics, Supersymmetry and alternative theories beyond the Standard Model are described
    corecore