222 research outputs found

    Heuristic modeling of macromolecule release from PLGA microspheres

    Get PDF
    Dissolution of protein macromolecules from poly(lactic-co-glycolic acid) (PLGA) particles is a complex process and still not fully understood. As such, there are difficulties in obtaining a predictive model that could be of fundamental significance in design, development, and optimization for medical applications and toxicity evaluation of PLGA-based multiparticulate dosage form. In the present study, two models with comparable goodness of fit were proposed for the prediction of the macromolecule dissolution profile from PLGA micro- and nanoparticles. In both cases, heuristic techniques, such as artificial neural networks (ANNs), feature selection, and genetic programming were employed. Feature selection provided by fscaret package and sensitivity analysis performed by ANNs reduced the original input vector from a total of 300 input variables to 21, 17, 16, and eleven; to achieve a better insight into generalization error, two cut-off points for every method was proposed. The best ANNs model results were obtained by monotone multi-layer perceptron neural network (MON-MLP) networks with a root-mean-square error (RMSE) of 15.4, and the input vector consisted of eleven inputs. The complicated classical equation derived from a database consisting of 17 inputs was able to yield a better generalization error (RMSE) of 14.3. The equation was characterized by four parameters, thus feasible (applicable) to standard nonlinear regression techniques. Heuristic modeling led to the ANN model describing macromolecules release profiles from PLGA microspheres with good predictive efficiency. Moreover genetic programming technique resulted in classical equation with comparable predictability to the ANN model

    Dimensionality reduction, and function approximation of poly (lactic-co-glycolic acid) micro-and nanoparticle dissolution rate

    Get PDF
    Prediction of poly(lactic-co-glycolic acid) (PLGA) micro- and nanoparticles’ dissolution rates plays a significant role in pharmaceutical and medical industries. The prediction of PLGA dissolution rate is crucial for drug manufacturing. Therefore, a model that predicts the PLGA dissolution rate could be beneficial. PLGA dissolution is influenced by numerous factors (features), and counting the known features leads to a dataset with 300 features. This large number of features and high redundancy within the dataset makes the prediction task very difficult and inaccurate. In this study, dimensionality reduction techniques were applied in order to simplify the task and eliminate irrelevant and redundant features. A heterogeneous pool of several regression algorithms were independently tested and evaluated. In addition, several ensemble methods were tested in order to improve the accuracy of prediction. The empirical results revealed that the proposed evolutionary weighted ensemble method offered the lowest margin of error and significantly outperformed the individual algorithms and the other ensemble techniques

    Dimensionality reduction, and function approximation of poly(lactic-co-glycolic acid) micro- and nanoparticle dissolution rate

    Get PDF
    Prediction of poly(lactic-co-glycolic acid) (PLGA) micro- and nanoparticles’ dissolution rates plays a significant role in pharmaceutical and medical industries. The prediction of PLGA dissolution rate is crucial for drug manufacturing. Therefore, a model that predicts the PLGA dissolution rate could be beneficial. PLGA dissolution is influenced by numerous factors (features), and counting the known features leads to a dataset with 300 features. This large number of features and high redundancy within the dataset makes the prediction task very difficult and inaccurate. In this study, dimensionality reduction techniques were applied in order to simplify the task and eliminate irrelevant and redundant features. A heterogeneous pool of several regression algorithms were independently tested and evaluated. In addition, several ensemble methods were tested in order to improve the accuracy of prediction. The empirical results revealed that the proposed evolutionary weighted ensemble method offered the lowest margin of error and significantly outperformed the individual algorithms and the other ensemble techniques.Web of Science101129111

    Improvement of sample classification and metabolite profiling in 1H-NMR by a machine learning-based modelling of signal parameters

    Get PDF
    RMN és una plataforma analítica utilitzada per quantificar els metabòlits presents en les mostres de metabolòmica. Els espectres de 1H-RMN mostren múltiples senyals de metabòlits amb tres paràmetres específics (desplaçament químic, ample mitjà de banda, intensitat) que poden mostrar reactivitat a les condicions de la mostra. Aquesta reactivitat perjudica l'optimització del fitat dels espectres necessari per a realitzar el perfilat automàtic de metabòlits de les mostres. L'objectiu d'aquesta tesi va ser l'exploració de l'ús de tècniques de tendència basades en Machine Learning (ML) amb l'ús de fluxos de treball robustos per modelar i explotar la informació present en els diferents paràmetres de senyal durant el perfilat de metabòlits dels conjunts de dades 1H-NMR. En particular, les aplicacions considerades van ser la millora de la classificació de les mostres en els estudis de metabolòmica i la millora de la qualitat del perfilat automàtic. A més d'assolir aquests objectius, també es van obtenir èxits addicionals (per exemple, la generació d'una nova eina de codi obert capaç de resoldre els reptes en l'elaboració de perfils de matrius complexes).RMN es una plataforma analítica utilizada para cuantificar los metabolitos presentes en las muestras de metabolómica. Los espectros de 1H-RMN muestran múltiples señales de metabolitos con tres parámetros específicos (desplazamiento químico, ancho medio de banda, intensidad) que pueden mostrar reactividad a las condiciones de la muestra. Esta reactividad perjudica a la optimización del fitado de los espectros necesario para realizar el perfilado automático de metabolitos de las muestras. El objetivo de esta tesis fue la exploración del uso de técnicas de tendencia basadas en Machine Learning (ML) con el uso de flujos de trabajo robustos para modelar y explotar la información presente en los diferentes parámetros de señal durante el perfilado de metabolitos de los conjuntos de datos 1H-NMR. En particular, las aplicaciones consideradas fueron la mejora de la clasificación de las muestras en los estudios de metabolómica y la mejora de la calidad del perfilado automático. Además de lograr estos objetivos, también se obtuvieron logros adicionales (por ejemplo, la generación de una nueva herramienta de código abierto capaz de resolver los retos en la elaboración de perfiles de matrices complejas).NMR is an analytical platform used to quantify the metabolites present in metabolomics samples. 1H-NMR spectra show multiple metabolite signals, each one with three parameters (chemical shift, half bandwidth, intensity) which can show reactivity to the sample conditions. This reactivity is a challenge for the optimization of the lineshape fitting of spectra necessary to perform the automatic metabolite profiling of samples. The aim of this PhD thesis was the exploration of the use of trending machine learning (ML)-based techniques and of robust ML-based workflows to model and then exploit the information present in the different parameters collected for each signal during the metabolite profiling of 1H-NMR datasets. In particular, the applications considered were the enhanced classification of samples in metabolomics studies and the enhancement of the quality of automatic profiling in 1H-NMR datasets. in addition to the achievement of these goals, additional achievements (e.g., the generation of a new open-source tool able to solve challenges in the profiling of complex matrices) was also fulfilled

    Multiobjective programming for type-2 hierarchical fuzzy inference trees

    Get PDF
    This paper proposes a design of hierarchical fuzzy inference tree (HFIT). An HFIT produces an optimum tree-like structure. Specifically, a natural hierarchical structure that accommodates simplicity by combining several low-dimensional fuzzy inference systems (FISs). Such a natural hierarchical structure provides a high degree of approximation accuracy. The construction of HFIT takes place in two phases. Firstly, a nondominated sorting based multiobjective genetic programming (MOGP) is applied to obtain a simple tree structure (low model’s complexity) with a high accuracy. Secondly, the differential evolution algorithm is applied to optimize the obtained tree’s parameters. In the obtained tree, each node has a different input’s combination, where the evolutionary process governs the input’s combination. Hence, HFIT nodes are heterogeneous in nature, which leads to a high diversity among the rules generated by the HFIT. Additionally, the HFIT provides an automatic feature selection because it uses MOGP for the tree’s structural optimization that accept inputs only relevant to the knowledge contained in data. The HFIT was studied in the context of both type-1 and type-2 FISs, and its performance was evaluated through six application problems. Moreover, the proposed multiobjective HFIT was compared both theoretically and empirically with recently proposed FISs methods from the literature, such as McIT2FIS, TSCIT2FNN, SIT2FNN, RIT2FNS-WB, eT2FIS, MRIT2NFS, IT2FNN-SVR, etc. From the obtained results, it was found that the HFIT provided less complex and highly accurate models compared to the models produced by most of the other methods. Hence, the proposed HFIT is an efficient and competitive alternative to the other FISs for function approximation and feature selectio

    Discretization approach

    Get PDF
    학위논문(박사)--서울대학교 대학원 :공과대학 화학생물공학부(에너지환경 화학융합기술전공),2019. 8. 이원보.In recent years, many researchers in chemical engineering have made great efforts to develop mathematical models on the theoretical side that are consistent with experimental results. Despite these efforts, however, establishing models for a system with complex phenomena such as multiphase flow or stirred reactors is still considered to be a challenge. In the meantime, an increase in computational efficiency and stability in various numerical methods has allowed us to correctly solve and analyze the system based on the fundamental equations. This leads to the need for a mathematical model to accurately predict the behavior of systems in which there is interdependence among the internal elements. A methodology for building a model based on equations that represent fundamental phenomena can lower technical barriers in system analysis. In this thesis, we propose three mathematical models validated from laboratory or pilot-scale experiments. First, an apparatus for vaporizing liquid natural gas is surrounded with a frost layer formed on the surface during operation, and performance of the apparatus is gradually deteriorated due to the adiabatic effect. Because the system uses ambient air as a heat sink, it is necessary to consider the phase transition and mass transfer of water vapor, and natural gas in the air in order to understand the fluctuation of system characteristics. The model predicts the experimental data of a pilot-scale vaporizer within a mean absolute error of 5.5 %. In addition, we suggest the robust design methodology and optimal design which is able to maintain the efficiency under the weather conditions for a year. Two or more data analysis techniques including discrete waveform transformation and k-means clustering are used to extract features that can represent time series data. Under the settings, the performance in the optimized desgin is improved by 22.92 percentage points compared to that in the conventional system. In the second system, the continuous tubular crystallization reactor has advantages in terms of production capacity and scale-up compared with the conventional batch reactor. However, the tubular system requires a well-designed control system to maintain its stability and durability, and thus; there is a great deal of demand for the mathematical model of this system. We were able to estimate crystal size distribution by considering the population balance model simultaneously with several heat exchanger models. The model constructed based on the first principle reaction scheme successfully predicted the results from the full-factorial experiment. The experiments were conducted with LAM (L-asparagine monohydrate) solution. In the prediction, the average crystal length and standard deviation were within 20% of the results of an experiment where the crystals were not iteratively dissolved in the liquid but maintained a low-level supersaturation. Furthermore, to confirm the controllability of the crystal size distribution in the system, we replaced the LAM solution with HEWL (Hen-egg white lysozyme) solution. Finally, we propose a multi-compartment model to predict the behavior of a high-pressure autoclave reactor for polymer production. In order to simulate a complex polymer synthesis mechanism, the rotation effect of impellers in the reactor on polymerization and the influence caused by polymerization heat were sequentially evaluated. As a result, This model turned out to be able to predict the physical properties of the polymers produced in an industrial-scale reactor within 7% accuracy. In this thesis, all three systems are distributed parameter systems which can be expressed as partial differential equations for time and space. To construct a high order model, the system was interpreted based on discretization approach under minimal assumptions. This methodology can be applied not only to the systems suggested in this thesis but also to those consisting of interpdependent variables. I hope that this thesis provides guidance for further researches of chemical engineering in nearby future.최근에 몇 년에 걸쳐서 많은 연구자들이 이론을 기반으로 실험 결과와 일치하는 수학 모델을 개발하고자 많은 노력을 기울여 왔다. 하지만 이런 노력에도 불구하고 다상 흐름 혹은 교반 반응기와 같은 복잡한 현상을 내포한 시스템을 위한 모델을 수립하는 것은 여전히 화학 공학 분야에서 쉽지 않은 일로 여겨진다. 이 와중에 다양한 수치적 방법에서의 계산 효율의 증가와 안정성의 향상은 기본방정식에 기초한 시스템을 정확하게 해결하고 분석할 수 있게 해주었다. 이로 인하여 내부 요소들 간의 상호 의존성이 존재하는 시스템의 거동을 정확하게 예측하기 위한 수학적 모델의 필요성이 부각되었다. 기본 현상들을 표현할 수 있는 방정식들을 기반으로 모델을 구축하기 위한 방법론은 시스템 해석에 있어서 기술적 장벽을 낮출 수 있다. 이 학위 논문에서 우리는 실험실 또는 파일럿 규모의 실험으로부터 입증된 세 가지 수학적 모델을 제안한다. 첫 번째로, 공기를 사용하여 액상의 천연가스를 기화시키는 장치는 운전 도중에 기화기 표면에 서리 층이 형성되고 그로 인한 단열 효과로 장비의 성능이 서서히 저하된다. 시스템은 주변 공기를 열 흡수원으로 사용하기 때문에 시스템 특성의 변동을 파악하기 위해서는 공기 중 수증기 및 천연 가스의 상전이 및 전달 현상을 동시에 고려하여야 한다. 제시된 수학적 모델에 의해 예측한 결과는 파일럿 규모 기화기로부터 얻은 실험 데이터와 5.5% 평균 절대 오차를 보였다. 이에 더하여, 앞에서 제시한 기화기 모델을 이용하여 1년 동안의 기상 조건에서 운전 효율을 유지하면서 지속 운전이 가능한 기화기의 설계 방법과 결과를 제안하였다. 이산 파형 변환과 k-평균 군집화를 포함하는 두 가지 이상의 데이터 분석 기법을 사용하여 시계열 데이터를 대표할 수 있는 특징을 추출한다. 추출된 특징 아래에서 최적화된 디자인은 기존 제시된 안에 비해 22.92% 만큼 향상된 성능을 보여주었다. 두 번째 시스템은 신 제약 기술 공정인 연속 관형 결정화 반응기는 기존에 널리 쓰이던 회분식 반응기에 비하여 생산 속도 및 스케일 업 측면에서 장점이 많다. 하지만 제어기술이 기반이 되어야한다는 점에 있어서 그 도입이 늦어졌고 이에 따라 자연스럽게 개발된 모델 또한 전무하다. 우리는 이 장치에서 결정 크기 분포를 추산하기 위한 인구 균형 모델을 열 교환 모델과 동시에 고려하여 결정 크기 분포를 추산할 수 있었다. 제 1원리 결정 반응식을 기반으로 구축된 모델은 완전 요인 배치법을 기반으로 실험된 데이터를 성공적으로 예측하였다. 결정이 액상에 용해되지 않으면서 낮은 수준의 과포화 상태를 유지한 실험에 대해서는 평균 결정 길이와 표준편차가 실험 결과와 20% 이내의 오차를 보였다. 앞에서 모델의 검증에 사용된 데이터가 LAM (L-아스파라긴 일 수화물)용액으로부터 얻어진 것이었다면 이후에는 HEWL (달걀 흰자 리소자임)를 사용하여 제품의 결정 크기 분포의 조절 가능성을 보였다. 마지막으로 폴리머 생산을 위한 고압 오토클레이브 반응기의 거동을 예측하기 위한 다중 구획 모델을 제안하였다. 복잡한 고분자 합성 메커니즘을 모사하기 위하여 반응기 내 임펠러의 회전이 중합에 미치는 효과와 중합 열로 인한 영향력을 순차적으로 평가하였다. 제안된 모델은 3D 구조를 가진 산업화된 반응기에서 생산된 두 가지 고분자의 물성을 7%이내 정확도로 예측할 수 있다. 본 학위논문에서는 다루는 시스템은 모두 분포 정수계 시스템으로 시간과 공간에 대하여 편미분방정식으로 표현할 수 있다. 고차 모델을 구축하기 위해 이산화 접근법을 기반으로 최소한의 가정 하에 시스템을 해석하였다. 이는 논문에 제시한 시스템 뿐만 아니라 시공간에서 예측 어려운 분포를 가지는 변수를 가진 모든 시스템에 대하여 적용이 가능하다. 이 논문이 앞으로 화학 공학 분야의 시스템을 해석하는 데 있어서 더 발전된 연구를 위한 지침서가 되기를 희망한다.Abstract i Contents iv List of Figures viii List of Tables xii Chapter 1 1 Introduction 1 1.1 Research motivation 1 1.2 Research objective 3 1.3 Outline of the thesis 4 1.4 Associated publications 9 Chapter 2 10 Distributed parameter system 10 2.1 Introduction 10 2.2 Modeling methods 11 2.3 Conclusion 16 Chapter 3 17 Modeling and design of pilot-scale ambient air vaporizer 17 3.1 Introduction 17 3.2 Modeling and analysis of frost growth in pilot-scale ambient air vaporizer 24 3.2.1 Ambient air vaporizer 24 3.2.2 Experimental measurement 27 3.2.3 Numerical model of the vaporizer 31 3.2.4 Result and discussion 43 3.3 Robust design of ambient air vaporizer based on time-series clustering 58 3.3.1 Background 58 3.3.2 Trend of time-series weather conditions 61 3.3.3 Optimization of AAV structures under time-series weather conditions 63 3.3.4 Results and discussion 76 3.4 Conclusion 93 3.4.1 Modeling and analysis of AAV system 93 3.4.2 Robust design of AAV system 95 Chapter 4 97 Tunable protein crystal size distribution via continuous crystallization 97 4.1 Introduction 97 4.2 Mathematical modeling and experimental verification of fully automated continuous slug-flow crystallizer 101 4.2.1 Experimental methods and equipment setup 101 4.2.2 Mathematical model of crystallizer 109 4.2.3 Results and discussion 118 4.3 Continuous crystallization of a protein: hen egg white lysozyme (HEWL) 132 4.3.1 Introduction 132 4.3.2 Experimental method 135 4.3.3 Results and discussion 145 4.4 Conclusion 164 4.4.1 Mathematical model of continuous crystallizer 164 4.4.2 Tunable continuous protein crystallization process 165 Chapter 5 167 Multi-compartment model of high-pressure autoclave reactor for polymer production: combined CFD mixing model and kinetics of polymerization 167 5.1 Introduction 167 5.2 Method 170 5.2.1 EVA autoclave reactor 170 5.2.2 Multi-compartment model of the autoclave reactor 173 5.2.3 CFD simulation of mixing effects in the autoclave reactor 175 5.2.4 Region-based dividing algorithm 178 5.2.5 Polymerization kinetic model 182 5.3 Results and discussion 191 5.4 Conclusion 203 5.5 Appendix 205 Chapter 6 210 Concluding Remarks 210 6.1 Summary of contributions 210 6.2 Future work 211 Appendix 214 Acknowledgment and collaboration declaration 214 Supplementary materials 217 Reference 227 Abstract in Korean (국문초록) 249Docto

    Principal Component Analysis

    Get PDF
    This book is aimed at raising awareness of researchers, scientists and engineers on the benefits of Principal Component Analysis (PCA) in data analysis. In this book, the reader will find the applications of PCA in fields such as taxonomy, biology, pharmacy,finance, agriculture, ecology, health and architecture

    Uncovering Genes with Divergent mRNA-Protein Dynamics in Streptomyces coelicolor

    Get PDF
    Many biological processes are intrinsically dynamic, incurring profound changes at both molecular and physiological levels. Systems analyses of such processes incorporating large-scale transcriptome or proteome profiling can be quite revealing. Although consistency between mRNA and proteins is often implicitly assumed in many studies, examples of divergent trends are frequently observed. Here, we present a comparative transcriptome and proteome analysis of growth and stationary phase adaptation in Streptomyces coelicolor, taking the time-dynamics of process into consideration. These processes are of immense interest in microbiology as they pertain to the physiological transformations eliciting biosynthesis of many naturally occurring therapeutic agents. A shotgun proteomics approach based on mass spectrometric analysis of isobaric stable isotope labeled peptides (iTRAQ™) enabled identification and rapid quantification of approximately 14% of the theoretical proteome of S. coelicolor. Independent principal component analyses of this and DNA microarray-derived transcriptome data revealed that the prominent patterns in both protein and mRNA domains are surprisingly well correlated. Despite this overall correlation, by employing a systematic concordance analysis, we estimated that over 30% of the analyzed genes likely exhibited significantly divergent patterns, of which nearly one-third displayed even opposing trends. Integrating this data with biological information, we discovered that certain groups of functionally related genes exhibit mRNA-protein discordance in a similar fashion. Our observations suggest that differences between mRNA and protein synthesis/degradation mechanisms are prominent in microbes while reaffirming the plausibility of such mechanisms acting in a concerted fashion at a protein complex or sub-pathway level
    corecore