Search CORE

1,046 research outputs found

Machine Learning for Fluid Mechanics

Author: Brunton Steven
Koumoutsakos Petros
Noack Bernd
Publication venue: 'Annual Reviews'
Publication date: 04/01/2020
Field of study

The field of fluid mechanics is rapidly advancing, driven by unprecedented volumes of data from field measurements, experiments and large-scale simulations at multiple spatiotemporal scales. Machine learning offers a wealth of techniques to extract information from data that could be translated into knowledge about the underlying fluid mechanics. Moreover, machine learning algorithms can augment domain knowledge and automate tasks related to flow control and optimization. This article presents an overview of past history, current developments, and emerging opportunities of machine learning for fluid mechanics. It outlines fundamental machine learning methodologies and discusses their uses for understanding, modeling, optimizing, and controlling fluid flows. The strengths and limitations of these methods are addressed from the perspective of scientific inquiry that considers data as an inherent part of modeling, experimentation, and simulation. Machine learning provides a powerful information processing framework that can enrich, and possibly even transform, current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202

arXiv.org e-Print Archive

Rails Quality Data Modelling via Machine Learning-Based Paradigms

Author: Zughrat Ali
Publication venue: 'University of Sheffield Conference Proceedings'
Publication date: 01/06/2015
Field of study

White Rose E-theses Online

Machine Learning and Integrative Analysis of Biomedical Big Data.

Author: Choi Howard
Chung Neo Christopher
Mirza Bilal
Ping Peipei
Wang Jie
Wang Wei
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

Multidisciplinary Digital Publishing Institute

Ezid

Directory of Open Access Journals

eScholarship - University of California

Study on identification of nonlinear systems using Quasi-ARX models

Author: Cheng Yu
Publication venue
Publication date: 01/01/2012
Field of study

制度:新 ; 報告番号:甲3660号 ; 学位の種類:博士(工学) ; 授与年月日:2012/9/15 ; 早大学位記番号:新6026Waseda Universit

Waseda University Repository

Analysis of Artificial Neural Networks For Building Automated Surrogate Algorithms

Author: Miriyala S S
Publication venue
Publication date: 01/01/2015
Field of study

While attaining the objective of online optimization of complex chemical processes, the possibility of using the first principle based models is rarely an option, since such models demand large computational time. Surrogate models, which can emulate first principle based models, offer a credible solution to this problem, by ensuring faster optimization. Thus, the entire challenge of enabling online optimization of complex models depends on construction of efficient surrogate models. Often, the surrogate building algorithms have certain parameters that are usually fixed based on some heuristic, thereby inviting potential errors in building such surrogate models. This work aims at presenting an elaborate study on the effect of various parameters affecting the predictability of artificial neural networks viz.(a) architecture of ANN, (b) sample size required by the ANN, (c) maximum possible accuracy of prediction, (d) a robust sampling plan and (e) transfer function choice for node activation. The ANNs are then utilized as surrogates for a highly nonlinear industrial sintering process, the optimization of which is then realized nearly 7 times faster than the optimization study using the expensive phenomenological model

Research Archive of Indian Institute of Technology Hyderabad

Enhanced parallel Differential Evolution algorithm for problems in computational systems biology

Author: Banga Julio R.
Doallo Ramón
González Patricia
Penas David R.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

[Abstract] Many key problems in computational systems biology and bioinformatics can be formulated and solved using a global optimization framework. The complexity of the underlying mathematical models require the use of efficient solvers in order to obtain satisfactory results in reasonable computation times. Metaheuristics are gaining recognition in this context, with Differential Evolution (DE) as one of the most popular methods. However, for most realistic applications, like those considering parameter estimation in dynamic models, DE still requires excessive computation times. Here we consider this latter class of problems and present several enhancements to DE based on the introduction of additional algorithmic steps and the exploitation of parallelism. In particular, we propose an asynchronous parallel implementation of DE which has been extended with improved heuristics to exploit the specific structure of parameter estimation problems in computational systems biology. The proposed method is evaluated with different types of benchmarks problems: (i) black-box global optimization problems and (ii) calibration of non-linear dynamic models of biological systems, obtaining excellent results both in terms of quality of the solution and regarding speedup and scalability.Ministerio de Economía y Competitividad; DPI2011-28112-C04-03Consejo Superior de Investigaciones Científicas; PIE-201170E018Ministerio de Ciencia e Innovación; TIN2013-42148-PGalicia. Consellería de Cultura, Educación e Ordenación Universitaria; GRC2013/05

Repositorio da Universidade da Coruña

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Digital.CSIC

Evolutionary Computation and QSAR Research

Author: Aguiar-Pulido Vanessa
Cruz-Monteagudo Maykel
Dorado Julián
Gestal M.
Munteanu Cristian-Robert
Rabuñal Juan R.
Publication venue: 'Bentham Science Publishers Ltd.'
Publication date: 01/01/2013
Field of study

[Abstract] The successful high throughput screening of molecule libraries for a specific biological property is one of the main improvements in drug discovery. The virtual molecular filtering and screening relies greatly on quantitative structure-activity relationship (QSAR) analysis, a mathematical model that correlates the activity of a molecule with molecular descriptors. QSAR models have the potential to reduce the costly failure of drug candidates in advanced (clinical) stages by filtering combinatorial libraries, eliminating candidates with a predicted toxic effect and poor pharmacokinetic profiles, and reducing the number of experiments. To obtain a predictive and reliable QSAR model, scientists use methods from various fields such as molecular modeling, pattern recognition, machine learning or artificial intelligence. QSAR modeling relies on three main steps: molecular structure codification into molecular descriptors, selection of relevant variables in the context of the analyzed activity, and search of the optimal mathematical model that correlates the molecular descriptors with a specific activity. Since a variety of techniques from statistics and artificial intelligence can aid variable selection and model building steps, this review focuses on the evolutionary computation methods supporting these tasks. Thus, this review explains the basic of the genetic algorithms and genetic programming as evolutionary computation approaches, the selection methods for high-dimensional data in QSAR, the methods to build QSAR models, the current evolutionary feature selection methods and applications in QSAR and the future trend on the joint or multi-task feature selection methods.Instituto de Salud Carlos III, PIO52048Instituto de Salud Carlos III, RD07/0067/0005Ministerio de Industria, Comercio y Turismo; TSI-020110-2009-53)Galicia. Consellería de Economía e Industria; 10SIN105004P

Repositorio da Universidade da Coruña

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

University of Miami: Scholarship Miami

Co-evolution in Manufacturing Systems Inspired by Biological Analogy

Author: AlGeddawy Tarek
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2011
Field of study

The artificial world experiences continuous changes that result in the evolution of design features of products and the capabilities of the corresponding manufacturing systems similar to the changes of species in the natural world. The idea of simulating the artificial world, based on the analogy between the symbiotic behaviour of products and manufacturing systems and the biological co-evolution of different species in nature, is expressed by a model and novel hypotheses regarding manufacturing co-evolution mechanism, preserving that co-evolution and using it for future planning and prediction. Biological analogy is also employed to drive the mathematical formulation of the model and its algorithms. Cladistics, a biological classification tool, is adapted and used to realize evolution trends of products and systems and their symbiosis was illustrated using another biological tool, tree reconciliation. A new mathematical method was developed to realize the co-development relationships between product features and manufacturing capabilities. It has been used for synthesizing / predicting new species of systems and products. The developed model was validated using machining and assembly case studies. Results have proven the proposed hypotheses, demonstrated the presence of manufacturing symbiosis and made predictions and synthesized new systems and products. The model has been also adapted for use in different applications such as; system layout design, identifying sustainable design features and products family redesign to promote modularity. The co-evolution model is significant as it closes the loop connecting products and systems to learn from their shared past development and predict their intertwined future, unlike available unidirectional design strategies. The economic life of manufacturing systems can be extended by better utilizing their available capabilities, since the co-evolution model directs products - systems development towards reaching a perfect co-evolution state. This research presents original ideas expressed by innovative co-evolution hypotheses in manufacturing, new mathematical model and algorithms, and demonstrates its advantages and benefits in a wide range of applications

Scholarship at UWindsor

A review of machine learning applications in wildfire science and management

Author: Coogan Sean C P
Crowley Mark
Flannigan Mike D
Jain Piyush
Subramanian Sriram Ganapathi
Taylor Steve
Publication venue: 'Canadian Science Publishing'
Publication date: 08/07/2020
Field of study

Artificial intelligence has been applied in wildfire science and management since the 1990s, with early applications including neural networks and expert systems. Since then the field has rapidly progressed congruently with the wide adoption of machine learning (ML) in the environmental sciences. Here, we present a scoping review of ML in wildfire science and management. Our objective is to improve awareness of ML among wildfire scientists and managers, as well as illustrate the challenging range of problems in wildfire science available to data scientists. We first present an overview of popular ML approaches used in wildfire science to date, and then review their use in wildfire science within six problem domains: 1) fuels characterization, fire detection, and mapping; 2) fire weather and climate change; 3) fire occurrence, susceptibility, and risk; 4) fire behavior prediction; 5) fire effects; and 6) fire management. We also discuss the advantages and limitations of various ML approaches and identify opportunities for future advances in wildfire science and management within a data science context. We identified 298 relevant publications, where the most frequently used ML methods included random forests, MaxEnt, artificial neural networks, decision trees, support vector machines, and genetic algorithms. There exists opportunities to apply more current ML methods (e.g., deep learning and agent based learning) in wildfire science. However, despite the ability of ML models to learn on their own, expertise in wildfire science is necessary to ensure realistic modelling of fire processes across multiple scales, while the complexity of some ML methods requires sophisticated knowledge for their application. Finally, we stress that the wildfire research and management community plays an active role in providing relevant, high quality data for use by practitioners of ML methods.Comment: 83 pages, 4 figures, 3 table

arXiv.org e-Print Archive

University of Toronto Research Repository

Parallel based support vector regression for empirical modeling of nonlinear chemical process systems

Author: Haslinda Zabiri
Nasser M. Ramli
Ramasamy Marappagounder
Publication venue: 'Penerbit Universiti Kebangsaan Malaysia (UKM Press)'
Publication date: 01/03/2018
Field of study

In this paper, a support vector regression (SVR) using radial basis function (RBF) kernel is proposed using an integrated parallel linear-and-nonlinear model framework for empirical modeling of nonlinear chemical process systems. Utilizing linear orthonormal basis filters (OBF) model to represent the linear structure, the developed empirical parallel model is tested for its performance under open-loop conditions in a nonlinear continuous stirred-tank reactor simulation case study as well as a highly nonlinear cascaded tank benchmark system. A comparative study between SVR and the parallel OBF-SVR models is then investigated. The results showed that the proposed parallel OBF-SVR model retained the same modelling efficiency as that of the SVR, whilst enhancing the generalization properties to out-of-sample data

Crossref

UKM Journal Article Repository