1,830 research outputs found
Deep generative models for network data synthesis and monitoring
Measurement and monitoring are fundamental tasks in all networks, enabling the down-stream management and optimization of the network.
Although networks inherently
have abundant amounts of monitoring data, its access and effective measurement is
another story. The challenges exist in many aspects. First, the inaccessibility of network monitoring data for external users, and it is hard to provide a high-fidelity dataset
without leaking commercial sensitive information. Second, it could be very expensive
to carry out effective data collection to cover a large-scale network system, considering the size of network growing, i.e., cell number of radio network and the number of
flows in the Internet Service Provider (ISP) network. Third, it is difficult to ensure fidelity and efficiency simultaneously in network monitoring, as the available resources
in the network element that can be applied to support the measurement function are
too limited to implement sophisticated mechanisms. Finally, understanding and explaining the behavior of the network becomes challenging due to its size and complex
structure. Various emerging optimization-based solutions (e.g., compressive sensing)
or data-driven solutions (e.g. deep learning) have been proposed for the aforementioned challenges. However, the fidelity and efficiency of existing methods cannot yet
meet the current network requirements.
The contributions made in this thesis significantly advance the state of the art in
the domain of network measurement and monitoring techniques. Overall, we leverage
cutting-edge machine learning technology, deep generative modeling, throughout the
entire thesis. First, we design and realize APPSHOT , an efficient city-scale network
traffic sharing with a conditional generative model, which only requires open-source
contextual data during inference (e.g., land use information and population distribution). Second, we develop an efficient drive testing system â GENDT, based on generative model, which combines graph neural networks, conditional generation, and quantified model uncertainty to enhance the efficiency of mobile drive testing. Third, we
design and implement DISTILGAN, a high-fidelity, efficient, versatile, and real-time
network telemetry system with latent GANs and spectral-temporal networks. Finally,
we propose SPOTLIGHT , an accurate, explainable, and efficient anomaly detection system of the Open RAN (Radio Access Network) system. The lessons learned through
this research are summarized, and interesting topics are discussed for future work in
this domain. All proposed solutions have been evaluated with real-world datasets and
applied to support different applications in real systems
The Divided Self: Internal Conflict in Literature, Philosophy, Psychology, and Neuroscience
This thematic project examines the notion of self-division, particularly in terms of the conflict between cognition and metacognition, across the fields of philosophy, psychology, and, most recently, the cognitive and neurosciences. The project offers a historic overview of models of self-division, as well as analyses of the various problems presented in theoretical models to date. This work explores how self-division has been depicted in the literary works of Edgar Allan Poe, Don DeLillo, and Mary Shelley. It examines the ways in which artistic renderings alternately assimilate, resist, and/or critique dominant philosophical, psychological, and scientific discourses about the self and its divisions. This dissertation argues that the internal conflict portrayed by the writers of these literary characters is conscious: it is the conflict of the metacognitive âIâ against akratic impulses, unwanted cognitions, and, ultimately, consciousness as a whole
Machine learning applications in search algorithms for gravitational waves from compact binary mergers
Gravitational waves from compact binary mergers are now routinely observed by Earth-bound detectors. These observations enable exciting new science, as they have opened a new window to the Universe.
However, extracting gravitational-wave signals from the noisy detector data is a challenging problem. The most sensitive search algorithms for compact binary mergers use matched filtering, an algorithm that compares the data with a set of expected template signals. As detectors are upgraded and more sophisticated signal models become available, the number of required templates will increase, which can make some sources computationally prohibitive to search for. The computational cost is of particular concern when low-latency alerts should be issued to maximize the time for electromagnetic follow-up observations. One potential solution to reduce computational requirements that has started to be explored in the last decade is machine learning. However, different proposed deep learning searches target varying parameter spaces and use metrics that are not always comparable to existing literature. Consequently, a clear picture of the capabilities of machine learning searches has been sorely missing.
In this thesis, we closely examine the sensitivity of various deep learning gravitational-wave search algorithms and introduce new methods to detect signals from binary black hole and binary neutron star mergers at previously untested statistical confidence levels. By using the sensitive distance as our core metric, we allow for a direct comparison of our algorithms to state-of-the-art search pipelines. As part of this thesis, we organized a global mock data challenge to create a benchmark for machine learning search algorithms targeting compact binaries. This way, the tools developed in this thesis are made available to the greater community by publishing them as open source software.
Our studies show that, depending on the parameter space, deep learning gravitational-wave search algorithms are already competitive with current production search pipelines. We also find that strategies developed for traditional searches can be effectively adapted to their machine learning counterparts. In regions where matched filtering becomes computationally expensive, available deep learning algorithms are also limited in their capability. We find reduced sensitivity to long duration signals compared to the excellent results for short-duration binary black hole signals
Uncertainty Quantification in Machine Learning for Engineering Design and Health Prognostics: A Tutorial
On top of machine learning models, uncertainty quantification (UQ) functions
as an essential layer of safety assurance that could lead to more principled
decision making by enabling sound risk assessment and management. The safety
and reliability improvement of ML models empowered by UQ has the potential to
significantly facilitate the broad adoption of ML solutions in high-stakes
decision settings, such as healthcare, manufacturing, and aviation, to name a
few. In this tutorial, we aim to provide a holistic lens on emerging UQ methods
for ML models with a particular focus on neural networks and the applications
of these UQ methods in tackling engineering design as well as prognostics and
health management problems. Toward this goal, we start with a comprehensive
classification of uncertainty types, sources, and causes pertaining to UQ of ML
models. Next, we provide a tutorial-style description of several
state-of-the-art UQ methods: Gaussian process regression, Bayesian neural
network, neural network ensemble, and deterministic UQ methods focusing on
spectral-normalized neural Gaussian process. Established upon the mathematical
formulations, we subsequently examine the soundness of these UQ methods
quantitatively and qualitatively (by a toy regression example) to examine their
strengths and shortcomings from different dimensions. Then, we review
quantitative metrics commonly used to assess the quality of predictive
uncertainty in classification and regression problems. Afterward, we discuss
the increasingly important role of UQ of ML models in solving challenging
problems in engineering design and health prognostics. Two case studies with
source codes available on GitHub are used to demonstrate these UQ methods and
compare their performance in the life prediction of lithium-ion batteries at
the early stage and the remaining useful life prediction of turbofan engines
Recommended from our members
Policy options for food system transformation in Africa and the role of science, technology and innovation
As recognized by the Science, Technology and Innovation Strategy for Africa â 2024 (STISA-2024), science, technology and innovation (STI) offer many opportunities for addressing the main constraints to embracing transformation in Africa, while important lessons can be learned from successful interventions, including policy and institutional innovations, from those African countries that have already made significant progress towards food system transformation. This chapter identifies opportunities for African countries and the region to take proactive steps to harness the potential of the food and agriculture sector so as to ensure future food and nutrition security by applying STI solutions and by drawing on transformational policy and institutional innovations across the continent. Potential game-changing solutions and innovations for food system transformation serving people and ecology apply to (a) raising production efficiency and restoring and sustainably managing degraded resources; (b) finding innovation in the storage, processing and packaging of foods; (c) improving human nutrition and health; (d) addressing equity and vulnerability at the community and ecosystem levels; and (e) establishing preparedness and accountability systems. To be effective in these areas will require institutional coordination; clear, food safety and health-conscious regulatory environments; greater and timely access to information; and transparent monitoring and accountability systems
Singularity Formation in the High-Dimensional Euler Equations and Sampling of High-Dimensional Distributions by Deep Generative Networks
High dimensionality brings both opportunities and challenges to the study of applied mathematics. This thesis consists of two parts. The first part explores the singularity formation of the axisymmetric incompressible Euler equations with no swirl in ââż, which is closely related to the Millennium Prize Problem on the global singularity of the Navier-Stokes equations. In this part, the high dimensionality contributes to the singularity formation in finite time by enhancing the strength of the vortex stretching term. The second part focuses on sampling from a high-dimensional distribution using deep generative networks, which has wide applications in the Bayesian inverse problem and the image synthesis task. The high dimensionality in this part becomes a significant challenge to the numerical algorithms, known as the curse of dimensionality.
In the first part of this thesis, we consider the singularity formation in two scenarios. In the first scenario, for the axisymmetric Euler equations with no swirl, we consider the case when the initial condition for the angular vorticity is Cα Hölder continuous. We provide convincing numerical examples where the solutions develop potential self-similar blow-up in finite time when the Hölder exponent α < α*, and this upper bound α* can asymptotically approach 1 - 2/n. This result supports a conjecture from Drivas and Elgindi [37], and generalizes it to the high-dimensional case. This potential blow-up is insensitive to the perturbation of initial data. Based on assumptions summarized from numerical experiments, we study a limiting case of the Euler equations, and obtain α* = 1 - 2/n which agrees with the numerical result. For the general case, we propose a relatively simple one-dimensional model and numerically verify its approximation to the Euler equations. This one-dimensional model might suggest a possible way to show this finite-time blow-up scenario analytically. Compared to the first proved blow-up result of the 3D axisymmetric Euler equations with no swirl and Hölder continuous initial data by Elgindi in [40], our potential blow-up scenario has completely different scaling behavior and regularity of the initial condition. In the second scenario, we consider using smooth initial data, but modify the Euler equations by adding a factor Δ as the coefficient of the convection terms to weaken the convection effect. The new model is called the weak convection model. We provide convincing numerical examples of the weak convection model where the solutions develop potential self-similar blow-up in finite time when the convection strength Δ < Δ*, and this upper bound Δ* should be close to 1 - 2/n. This result is closely related to the infinite-dimensional case of an open question [37] stated by Drivas and Elgindi. Our numerical observations also inspire us to approximate the weak convection model with a one-dimensional model. We give a rigorous proof that the one-dimensional model will develop finite-time blow-up if Δ < 1 - 2/n, and study the approximation quality of the one-dimensional model to the weak convection model numerically, which could be beneficial to a rigorous proof of the potential finite-time blow-up.
In the second part of the thesis, we propose the Multiscale Invertible Generative Network (MsIGN) to sample from high-dimensional distributions by exploring the low-dimensional structure in the target distribution. The MsIGN models a transport map from a known reference distribution to the target distribution, and thus is very efficient in generating uncorrelated samples compared to MCMC-type methods. The MsIGN captures multiple modes in the target distribution by generating new samples hierarchically from a coarse scale to a fine scale with the help of a novel prior conditioning layer. The hierarchical structure of the MsIGN also allows training in a coarse-to-fine scale manner. The Jeffreys divergence is used as the objective function in training to avoid mode collapse. Importance sampling based on the prior conditioning layer is leveraged to estimate the Jeffreys divergence, which is intractable in previous deep generative networks. Numerically, when applied to two Bayesian inverse problems, the MsIGN clearly captures multiple modes in the high-dimensional posterior and approximates the posterior accurately, demonstrating its superior performance compared with previous methods. We also provide an ablation study to show the necessity of our proposed network architecture and training algorithm for the good numerical performance. Moreover, we also apply the MsIGN to the image synthesis task, where it achieves superior performance in terms of bits-per-dimension value over other flow-based generative models and yields very good interpretability of its neurons in intermediate layers.</p
Geoarchaeological Investigations of Late Pleistocene Physical Environments and Impacts of Prehistoric Foragers on the Ecosystem in Northern Malawi and Austria
A growing body of research shows that not only did environmental changes play an important role in human evolution, but humans in turn have impacted ecosystems and landscape evolution since the Late Pleistocene. This thesis presents collaborative work on Late Pleistocene open-air sites in the Karonga District of northern Malawi, in which new aspects of forager behavior came to light through the reconstruction of physical environments. My work has helped recognize that late Middle Stone Age (MSA) activity and tool production occurred in locally more open riparian environments within evergreen gallery forest, surrounded by a regional vegetation dominated by miombo woodlands and savanna. Additionally, MSA hunter-gatherers exploited the confluence of river and wetland areas along the shores of Lake Malawi, which likely served as important corridors for the dispersal of biota. By comparing data from the archaeological investigations with lake core records, we were able to identify effects of anthropogenic burning on vegetation structures and sedimentation in the region as early as 80 thousand years ago. These findings not only proved it possible to uncover early impacts of human activity on the ecosystem, but also emphasize the importance of fire in the lives of early foragers.
Publications contained within this dissertation:
A. Wright, D.K., Thompson, J.C., Schilt, F.C., Cohen, A., Choi, J-H., Mercader, J., Nightingale, S., Miller, C.E., Mentzer, S.M., Walde, D., Welling, M., and Gomani-Chindebvu, E. âApproaches to Middle Stone Age landscape archaeology in tropical Africaâ. Special issue Geoarchaeology of the Tropics of Journal of Archaeological Science 77:64-77. http://dx.doi.org/10.1016/j.jas.2016.01.014
B. Schilt, F.C., Verpoorte, A., Antl, W. âMicromorphology of an Upper Paleolithic cultural layer at Grub-Kranawetberg, Austriaâ. Journal of Archaeological Science: Reports 14:152-162. http://dx.doi.org/10.1016/j.jasrep.2017.05.041
C. Nightingale, S., Schilt, F.C., Thompson, J.C., Wright, D.K., Forman, S., Mercader, J., Moss, P., Clarke, S. Itambu, M., Gomani-Chindebvu, E., Welling, M. Late Middle Stone Age Behavior and Environments at Chaminade I (Karonga, Malawi). Journal of Paleolithic Archaeology 2-3:258-397. https://doi.org/10.1007/s41982-019-00035-3
D. Thompson, J.C.*, Wright, D.K.*, Ivory, S.J.*, Choi, J-H., Nightingale, S., Mackay, A., Schilt, F.C., OtĂĄrola-Castillo, E., Mercader, J., Forman, S.L., Pietsch, T., Cohen, A.S., Arrowsmith, J.R., Welling, M., Davis, J., Schiery, B., Kaliba, P., Malijani, O., Blome, M.W., OâDriscoll, C., Mentzer, S.M., Miller, C., Heo, S., Choi, J., Tembo, J., Mapemba, F., Simengwa, D., and Gomani-Chindebvu, E. âEarly human impacts and ecosystem reorganization in southern-central Africaâ. Science Advances 7(19): eabf9776. *equal contribution https://doi.org/10.1126/sciadv.abf9776
E. Schilt, F.C., Miller, C.M., Wright, D.K., Mentzer, S.M., Mercader, J., Moss, Choi, J.-H., Siljedal, G., Clarke, S., Mwambwiga, A., Thomas, K., Barbieri, A., Kaliba, P., Gomani-Chindebvu, E., Thompson, J.C. âHunter-gatherer environments at the Late Pleistocene sites of Bruce and MwangandaÂŽs Village, northern Malawiâ. Quaternary Science Reviews 292: 107638. https://www.sciencedirect.com/science/article/pii/S0277379122002694 [untranslated
Machine Learning Applications in Advanced Additive Manufacturing: Process Modeling, Microstructure Analysis, and Defect Detection
Non-destructive evaluation (NDE) techniques are critical for assessing the integrity, health, and mechanical properties of materials manufactured from various methods. High fidelity NDE techniques are essential for quality control but often lead to massive data generation. Such a vast data load cannot be manually processed, this leads to a severe bottleneck for process engineers. Machine learning (ML) offers a solution to this problem by providing powerful and adaptable algorithms capable of learning patterns, identifying features, and finding hidden relationships in large sets of data. Various ML models are used in this work to improve predictions, improve measurements, detect anomalies, classify anomalies, segment images, determine material health, and directly model behavior. These neural network or ML models are implemented to perform these tasks by utilizing data gathered through various NDE techniques. Additive manufacturing enables the production of complex geometries and customized parts with reduced waste and lead times. The development of new material printing capability and techniques is necessary to expand its capabilities to produce high performance parts with unique properties and functionality. Contributions to advanced additive manufacturing are made via the application of customized machine learning algorithms in this work. The development of a novel grain image generation method was completed to improve grain and grain boundary image segmentation methods on microstructure images. Convolutional Neural Networks (CNNs) were also applied to datasets of Stainless Steel Powder to help identify, qualify, and classify the health of the powder prior to print application. A feasibility study of the implementation of Binder Jetting (BJT) is conducted on Martian and Lunar regolith using a simplistic binder in this work. The need for efficient techniques to process data gathered from NDE methods is crucial to enhance the accuracy, efficiency, and speed of the analysis of this data. This will lead to faster development and implementation of advanced manufacturing techniques
Biofidelic simulations of embryonic joint growth and morphogenesis
During skeletal development, the opposing surfaces in the joint mould into interlocking and reciprocal shapes in a process called morphogenesis. Morphogenesis is critical to the health and function of the joint, and yet, little is known about the process of joint morphogenesis. For example, how do different joints acquire their specific shapes? Which cellular processes underlie joint shaping and how are they regulated? However, it is known that fetal movements are critical to joint development, with alterations or absences of movement being implicated in multiple pre- and post-natal musculoskeletal conditions. This doctorate explored the cell-level dynamics governing joint growth and the implication of movements in regulating them, using novel biofidelic and mechanobiological models of joint growth.
Cell-level data from wild type zebrafish larvae were tracked and synthesised in a biofidelic simulation of zebrafish jaw joint growth. Growth characteristics were quantified revealing a strong anisotropy (Chapter 3). Next, zebrafish larvae were immobilised using drug treatment. The material properties of the zebrafish jaw cartilage were measured using nano-indentation in the presence or absence of movement showing a delay in cartilage stiffening in immobilised larvae (Chapter 4). Then, I developed a novel mechanobiological model of zebrafish jaw joint growth, which identified a correlation between growth characteristics and the dynamic patterns of mechanical stimuli experienced by joint elements over jaw motion (Chapter 5). Finally, local growth rates were characterised in the mouse elbow in the presence or absence of skeletal muscles. Spatial heterogeneity in the growth rates correlated with the emergence of specific shape features at the level of the condyles. Immobilisation led to disruption of the local growth rates correlated with failed shape differentiation of the condyles. The relative contribution of key cell activities to growth such as cell volume expansion, cell number increases and extracellular matrix expansion, were shown to vary over time in both wild types and muscleless-limbs and to be altered in the absence of skeletal muscles (Chapter 6).
This research offers avenues for improvement in simulations of joint development and potentially other organs. It provides fundamental advance in our understanding of mechanoregulation in the developing joint and increases our understanding of the origins of musculoskeletal abnormalities.Open Acces
- âŠ