Search CORE

659,763 research outputs found

Machine Learning Regularization for the Minimum Volume Formula of Toric Calabi-Yau 3-folds

Author: Choi Eugene
Seong Rak-Kyeong
Publication venue
Publication date: 30/10/2023
Field of study

We present a collection of explicit formulas for the minimum volume of Sasaki-Einstein 5-manifolds. The cone over these 5-manifolds is a toric Calabi-Yau 3-fold. These toric Calabi-Yau 3-folds are associated with an infinite class of 4d N=1 supersymmetric gauge theories, which are realized as worldvolume theories of D3-branes probing the toric Calabi-Yau 3-folds. Under the AdS/CFT correspondence, the minimum volume of the Sasaki-Einstein base is inversely proportional to the central charge of the corresponding 4d N=1 superconformal field theories. The presented formulas for the minimum volume are in terms of geometric invariants of the toric Calabi-Yau 3-folds. These explicit results are derived by implementing machine learning regularization techniques that advance beyond previous applications of machine learning for determining the minimum volume. Moreover, the use of machine learning regularization allows us to present interpretable and explainable formulas for the minimum volume. Our work confirms that, even for extensive sets of toric Calabi-Yau 3-folds, the proposed formulas approximate the minimum volume with remarkable accuracy.Comment: 15 pages, 9 figures, 4 table

arXiv.org e-Print Archive

Building Near-Real-Time Processing Pipelines with the Spark-MPI Platform

Author: Chaudhary Aashish
Cowan Matt
Hanwell Marcus
Jourdain Sebastien
Malitsky Nikolay
O'Leary Patrick
Van Dam Kerstin Kleese
Publication venue
Publication date: 12/05/2018
Field of study

Advances in detectors and computational technologies provide new opportunities for applied research and the fundamental sciences. Concurrently, dramatic increases in the three Vs (Volume, Velocity, and Variety) of experimental data and the scale of computational tasks produced the demand for new real-time processing systems at experimental facilities. Recently, this demand was addressed by the Spark-MPI approach connecting the Spark data-intensive platform with the MPI high-performance framework. In contrast with existing data management and analytics systems, Spark introduced a new middleware based on resilient distributed datasets (RDDs), which decoupled various data sources from high-level processing algorithms. The RDD middleware significantly advanced the scope of data-intensive applications, spreading from SQL queries to machine learning to graph processing. Spark-MPI further extended the Spark ecosystem with the MPI applications using the Process Management Interface. The paper explores this integrated platform within the context of online ptychographic and tomographic reconstruction pipelines.Comment: New York Scientific Data Summit, August 6-9, 201

arXiv.org e-Print Archive

Crossref

Scipedia

Expression of Cytokines and Chemokines as Predictors of Stroke Outcomes in Acute Ischemic Stroke

Author: Albers
Albers
Amantea
Batten
Batten
Benjamin
Berkhemer
Borgens
Breiman
Butterfield
Chraa
Christoffersson
Diveu
Dostovic
Doyle
Dusheiko
ElKassar
Ellis
Fonarow
Fraser
Gauvreau
Goyal
Hacke
Hagberg
Hameg
Hastie
Heo
Jia
Kitayama
Kouro
Kramer
Kristian
Kumai
Lucas
Mifsud
Mokri
Moors
Nogueira
Parker
Pedregosa
Pflanz
Powers
Pradillo
Rostaing
Roufosse
Savage
Schroeder
Seabold
Simard
Souza
Strbian
Stumhofer
Tait Wojno
Takeda
Tan
Theodorou
Tobin
Trinchieri
Vapnik
Vapnik
Vukelic
Wang
Xia
Yamamoto
Yoshie
Yushkevich
Zhang
Zhao
Publication venue: UKnowledge
Publication date: 15/01/2020
Field of study

Introduction: Ischemic stroke remains one of the most debilitating diseases and is the fifth leading cause of death in the US. The ability to predict stroke outcomes within the acute period of stroke would be essential for care planning and rehabilitation. The Blood and Clot Thrombectomy Registry and Collaboration (BACTRAC; clinicaltrials.gov NCT03153683) study collects arterial blood immediately distal and proximal to the intracranial thrombus at the time of mechanical thrombectomy. These blood samples are an innovative resource in evaluating acute gene expression changes at the time of ischemic stroke. The purpose of this study was to identify inflammatory genes and important immune factors during mechanical thrombectomy for emergent large vessel occlusion (ELVO) and which patient demographics were predictors for stroke outcomes (infarct and/or edema volume) in acute ischemic stroke patients. Methods: The BACTRAC study is a non-probability sampling of male and female subjects (≥18 year old) treated with mechanical thrombectomy for ELVO. We evaluated 28 subjects (66 ± 15.48 years) relative concentrations of mRNA for gene expression in 84 inflammatory molecules in arterial blood distal and proximal to the intracranial thrombus who underwent thrombectomy. We used the machine learning method, Random Forest to predict which inflammatory genes and patient demographics were important features for infarct and edema volumes. To validate the overlapping genes with outcomes, we perform ordinary least squares regression analysis. Results: Machine learning analyses demonstrated that the genes and subject factors CCR4, IFNA2, IL-9, CXCL3, Age, T2DM, IL-7, CCL4, BMI, IL-5, CCR3, TNFα, and IL-27 predicted infarct volume. The genes and subject factor IFNA2, IL-5, CCL11, IL-17C, CCR4, IL-9, IL-7, CCR3, IL-27, T2DM, and CSF2 predicted edema volume. The overlap of genes CCR4, IFNA2, IL-9, IL-7, IL-5, CCR3, and IL-27 with T2DM predicted both infarct and edema volumes. These genes relate to a microenvironment for chemoattraction and proliferation of autoimmune cells, particularly Th2 cells and neutrophils. Conclusions: Machine learning algorithms can be employed to develop prognostic predictive biomarkers for stroke outcomes in ischemic stroke patients, particularly in regard to identifying acute gene expression changes that occur during stroke

Crossref

University of Kentucky

Performance Evaluation of Apache Spark MLlib Algorithms on an Intrusion Detection Dataset

Author: Ahmadi Mahmood
Atefinia Ramin
Publication venue
Publication date: 10/12/2022
Field of study

The increase in the use of the Internet and web services and the advent of the fifth generation of cellular network technology (5G) along with ever-growing Internet of Things (IoT) data traffic will grow global internet usage. To ensure the security of future networks, machine learning-based intrusion detection and prevention systems (IDPS) must be implemented to detect new attacks, and big data parallel processing tools can be used to handle a huge collection of training data in these systems. In this paper Apache Spark, a general-purpose and fast cluster computing platform is used for processing and training a large volume of network traffic feature data. In this work, the most important features of the CSE-CIC-IDS2018 dataset are used for constructing machine learning models and then the most popular machine learning approaches, namely Logistic Regression, Support Vector Machine (SVM), three different Decision Tree Classifiers, and Naive Bayes algorithm are used to train the model using up to eight number of worker nodes. Our Spark cluster contains seven machines acting as worker nodes and one machine is configured as both a master and a worker. We use the CSE-CIC-IDS2018 dataset to evaluate the overall performance of these algorithms on Botnet attacks and distributed hyperparameter tuning is used to find the best single decision tree parameters. We have achieved up to 100% accuracy using selected features by the learning method in our experimentsComment: Journal of Computing and Security (Isfahan University, Iran), Vol. 9, No.1, 202

arXiv.org e-Print Archive

Precision measurement of cardiac structure and function in cardiovascular magnetic resonance using machine learning

Author: Augusto João B
Bai Wenjia
Bhuva Anish
Captur Gabriella
Davies Rhodri H
Fontana Marianna
Greenwood John P
Herrey Anna
Hughes Rebecca K
Kellman Peter
Kozor Rebecca
Lau Clement
Lopes Luis R
Maestrini Viviana
Manisty Charlotte
Moon James C
Petersen Steffen E
Rosmini Stefania
Rueckert Daniel
Schelbert Erik
Shiwani Hunain
Treibel Thomas A
Xue Hui
Ye Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/03/2022
Field of study

BACKGROUND: Measurement of cardiac structure and function from images (e.g. volumes, mass and derived parameters such as left ventricular (LV) ejection fraction [LVEF]) guides care for millions. This is best assessed using cardiovascular magnetic resonance (CMR), but image analysis is currently performed by individual clinicians, which introduces error. We sought to develop a machine learning algorithm for volumetric analysis of CMR images with demonstrably better precision than human analysis. METHODS: A fully automated machine learning algorithm was trained on 1923 scans (10 scanner models, 13 institutions, 9 clinical conditions, 60,000 contours) and used to segment the LV blood volume and myocardium. Performance was quantified by measuring precision on an independent multi-site validation dataset with multiple pathologies with n = 109 patients, scanned twice. This dataset was augmented with a further 1277 patients scanned as part of routine clinical care to allow qualitative assessment of generalization ability by identifying mis-segmentations. Machine learning algorithm ('machine') performance was compared to three clinicians ('human') and a commercial tool (cvi42, Circle Cardiovascular Imaging). FINDINGS: Machine analysis was quicker (20 s per patient) than human (13 min). Overall machine mis-segmentation rate was 1 in 479 images for the combined dataset, occurring mostly in rare pathologies not encountered in training. Without correcting these mis-segmentations, machine analysis had superior precision to three clinicians (e.g. scan-rescan coefficients of variation of human vs machine: LVEF 6.0% vs 4.2%, LV mass 4.8% vs. 3.6%; both P < 0.05), translating to a 46% reduction in required trial sample size using an LVEF endpoint. CONCLUSION: We present a fully automated algorithm for measuring LV structure and global systolic function that betters human performance for speed and precision

UCL Discovery

A generative modeling approach for benchmarking and training shallow quantum circuits

Author: Benedetti Marcello
Garcia-Pintos Delfina
Leyton-Ortega Vicente
Nam Yunseong
Perdomo Oscar
Perdomo-Ortiz Alejandro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/05/2019
Field of study

Hybrid quantum-classical algorithms provide ways to use noisy intermediate-scale quantum computers for practical applications. Expanding the portfolio of such techniques, we propose a quantum circuit learning algorithm that can be used to assist the characterization of quantum devices and to train shallow circuits for generative tasks. The procedure leverages quantum hardware capabilities to its fullest extent by using native gates and their qubit connectivity. We demonstrate that our approach can learn an optimal preparation of the Greenberger-Horne-Zeilinger states, also known as "cat states". We further demonstrate that our approach can efficiently prepare approximate representations of coherent thermal states, wave functions that encode Boltzmann probabilities in their amplitudes. Finally, complementing proposals to characterize the power or usefulness of near-term quantum devices, such as IBM's quantum volume, we provide a new hardware-independent metric called the qBAS score. It is based on the performance yield in a specific sampling task on one of the canonical machine learning data sets known as Bars and Stripes. We show how entanglement is a key ingredient in encoding the patterns of this data set; an ideal benchmark for testing hardware starting at four qubits and up. We provide experimental results and evaluation of this metric to probe the trade off between several architectural circuit designs and circuit depths on an ion-trap quantum computer.Comment: 16 pages, 9 figures. Minor revisions. As published in npj Quantum Informatio

arXiv.org e-Print Archive

UCL Discovery

Predicting the Transfer Efficiency of Stencil Printing by Machine Learning Technique

Author: Al-Ma’aiteh Tareq Ibrahim
Anda Krisztián
Balogh Norbert
Krammer Olivér
Martinek Péter
Publication venue
Publication date: 19/06/2020
Field of study

Experiment was carried out for acquiring data regarding the transfer efficiency of stencil printing, and a machine learning-based technique (artificial neural network) was trained for predicting that parameter. The input parameters space in the experiment included the printing speed at five different levels (between 20 and120 mm/s) and the area ratio of stencil apertures from 0.34 to1.69. Three types of lead-free solder paste were also investigated as follows: Type-3 (particle size range is 20–45 μm), Type-4 (20–38 μm), Type-5 (10–25 μm). The output parameter space included the height and the area of the print deposits and the respective transfer efficiency, which is the ratio of the deposited paste volume to the aperture volume. Finally, an artificial neural network was trained with the empirical data using the Levenberg–Marquardt training algorithm. The optimal tuning factor for the fine-tuning of the network size was found to be approximately 9, resulting in a hidden neuron number of 160. The trained network was able to predict the output parameters with a mean average percentage error (MAPE) lower than 3%. Though, the prediction error depended on the values of the input parameters, which is elaborated in the paper in details. The research proved the applicability of machine learning techniques in the yield prediction of the process of stencil printing

Crossref

Repository of the Academy's Library