467 research outputs found
Optimisation and information-theoretic principles in multiplex networks.
PhD ThesesThe multiplex network paradigm has proven very helpful in the study of many real-world
complex systems, by allowing to retain full information about all the different possible kinds of
relationships among the elements of a system. As a result, new non-trivial structural patterns
have been found in diverse multi-dimensional networked systems, from transportation networks to
the human brain. However, the analysis of multiplex structural and dynamical properties often
requires more sophisticated algorithms and takes longer time to run compared to traditional
single network methods. As a consequence, relying on a multiplex formulation should be the
outcome of a trade-off between the level of information and the resources required to store it.
In the first part of the thesis, we address the problem of quantifying and comparing the
amount of information contained in multiplex networks. We propose an algorithmic informationtheoretic
approach to evaluate the complexity of multiplex networks, by assessing to which extent
a given multiplex representation of a system is more informative than a single-layer graph. Then,
we demonstrate that the same measure is able to detect redundancy in a multiplex network and
to obtain meaningful lower-dimensional representations of a system. We finally show that such
method allows us to retain most of the structural complexity of the original system as well as
the salient characteristics determining the behaviour of dynamical processes happening on it.
In the second part of the thesis, we shift the focus to the modelling and analysis of some structural
features of real-world multiplex systems throughout optimisation principles. We demonstrate
that Pareto optimal principles provide remarkable tools not only to model real-world
multiplex transportation systems but also to characterise the robustness of multiplex systems
against targeted attacks in the context of optimal percolation
Finding emergence in data by maximizing effective information
Quantifying emergence and modeling emergent dynamics in a data-driven manner
for complex dynamical systems is challenging due to the lack of direct
observations at the micro-level. Thus, it's crucial to develop a framework to
identify emergent phenomena and capture emergent dynamics at the macro-level
using available data. Inspired by the theory of causal emergence (CE), this
paper introduces a machine learning framework to learn macro-dynamics in an
emergent latent space and quantify the degree of CE. The framework maximizes
effective information, resulting in a macro-dynamics model with enhanced causal
effects. Experimental results on simulated and real data demonstrate the
effectiveness of the proposed framework. It quantifies degrees of CE
effectively under various conditions and reveals distinct influences of
different noise types. It can learn a one-dimensional coarse-grained
macro-state from fMRI data, to represent complex neural activities during movie
clip viewing. Furthermore, improved generalization to different test
environments is observed across all simulation data
Backwards is the way forward: feedback in the cortical hierarchy predicts the expected future
Clark offers a powerful description of the brain as a prediction machine, which offers progress on two distinct levels. First, on an abstract conceptual level, it provides a unifying framework for perception, action, and cognition (including subdivisions such as attention, expectation, and imagination). Second, hierarchical prediction offers progress on a concrete descriptive level for testing and constraining conceptual elements and mechanisms of predictive coding models (estimation of predictions, prediction errors, and internal models)
Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks
Future wireless networks have a substantial potential in terms of supporting
a broad range of complex compelling applications both in military and civilian
fields, where the users are able to enjoy high-rate, low-latency, low-cost and
reliable information services. Achieving this ambitious goal requires new radio
techniques for adaptive learning and intelligent decision making because of the
complex heterogeneous nature of the network structures and wireless services.
Machine learning (ML) algorithms have great success in supporting big data
analytics, efficient parameter estimation and interactive decision making.
Hence, in this article, we review the thirty-year history of ML by elaborating
on supervised learning, unsupervised learning, reinforcement learning and deep
learning. Furthermore, we investigate their employment in the compelling
applications of wireless networks, including heterogeneous networks (HetNets),
cognitive radios (CR), Internet of things (IoT), machine to machine networks
(M2M), and so on. This article aims for assisting the readers in clarifying the
motivation and methodology of the various ML algorithms, so as to invoke them
for hitherto unexplored services as well as scenarios of future wireless
networks.Comment: 46 pages, 22 fig
Safety quantification in gene editing experiments using machine learning on rationally designed feature spaces
With ongoing development of the CRISPR/Cas programmable nuclease system, applications in the area of \textit{in vivo} therapeutic gene editing are increasingly within reach. However, non-negligible off-target effects remain a major concern for clinical applications.
Even though a multitude of off-target cleavage datasets have been published, a comprehensive, transparent overview tool has not yet been established. The first part of this thesis presents the creation of crisprSQL (http://www.crisprsql.com), a large, diverse, interactive and bioinformatically enhanced collection of CRISPR/Cas9 off-target cleavage studies aimed at enriching the fields of cleavage profiling, gene editing safety analysis and transcriptomics.
Having established this data source, we use it to train novel deep learning algorithms and explore feature encodings for off-target prediction, systematically sampling the resulting model space in order to find optimal models and inform future modelling efforts. We lay emphasis on physically informed features which capture the biological environment of the cleavage site, hence terming our approach piCRISPR. We find that our best-performing model highlights the importance of sequence context and chromatin accessibility for cleavage prediction and compares favourably with state-of-the-art prediction performance. We further show that our novel, environmentally sensitive features are crucial to accurate prediction on sequence-identical locus pairs, making them highly relevant for clinical guide design.
We then turn our attention to the cell-intrinsic repair mechanisms that follow CRISPR/Cas-induced cleavage and provide a prediction algorithm for the outcome genotype distribution based on thermodynamic features of the DNA repair process. In a pioneering approach, we utilise structural calculations for the generation of these features and show that this novel approach surpasses published outcome prediction algorithms within our testing regime. Through interpretation of the trained model, we elucidate the thermodynamic factors driving DNA repair and provide a computational tool that allows experts to assess the severity of the genotypic changes predicted for a given edit.
Together, these efforts provide a comprehensive, one-stop computational source to assess and improve CRISPR/Cas9 gene editing safety
Generative AI-driven Semantic Communication Networks: Architecture, Technologies and Applications
Generative artificial intelligence (GAI) has emerged as a rapidly burgeoning
field demonstrating significant potential in creating diverse contents
intelligently and automatically. To support such artificial
intelligence-generated content (AIGC) services, future communication systems
should fulfill much more stringent requirements (including data rate,
throughput, latency, etc.) with limited yet precious spectrum resources. To
tackle this challenge, semantic communication (SemCom), dramatically reducing
resource consumption via extracting and transmitting semantics, has been deemed
as a revolutionary communication scheme. The advanced GAI algorithms facilitate
SemCom on sophisticated intelligence for model training, knowledge base
construction and channel adaption. Furthermore, GAI algorithms also play an
important role in the management of SemCom networks. In this survey, we first
overview the basics of GAI and SemCom as well as the synergies of the two
technologies. Especially, the GAI-driven SemCom framework is presented, where
many GAI models for information creation, SemCom-enabled information
transmission and information effectiveness for AIGC are discussed separately.
We then delve into the GAI-driven SemCom network management involving with
novel management layers, knowledge management, and resource allocation.
Finally, we envision several promising use cases, i.e., autonomous driving,
smart city, and the Metaverse for a more comprehensive exploration
Dynamic processes on networks and higher-order structures
Higher-order interactions are increasingly recognized as a critical aspect in the modeling of complex systems. Higher-order networks provide a framework for studying the relationship between the structure of higher-order interactions and the function of the complex system. However, little is known about how higher-order interactions affect dynamic processes. In this thesis, we develop general frameworks of percolation aiming at understanding the interplay between higher-order network structures and the critical properties of dynamics. We reveal that degree correlations strongly affect the percolation threshold on higher-order networks and interestingly, the effect of correlations is different on ordinary percolation and higher-order percolation. We further elucidate the mechanisms responsible for the emergence of discontinuous transitions on higher-order networks. Moreover, we show that triadic regulatory interaction, as a general type of higher-order interaction found widely in nature, can turn percolation into a fully-fledged dynamic process that exhibits period doubling and a route to chaos. As an important example of dynamic processes, we further investigate the role of network topology on epidemic spreading. We show that higher-order interactions can induce a non-linear infection kernel in a pandemic, which results in a discontinuous phase transition, hysteresis, and superexponential spreading. Finally, we propose an epidemic model to evaluate the role of automated contact-and-tracing with mobile apps as a new containment measure to mitigate a pandemic. We reveal the non-linear effect on the reduction of the incidence provided by a certain fraction of app adoption in the population and we propose the optimal strategy to mitigate the pandemic with limited resources. Altogether, the thesis provides new insights into the interplay between the topology of higher-order networks and their dynamics. The results obtained may shed light on the research in other areas of interest such as brain functions and epidemic spreading
Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data
With the development of remote sensing technology in recent decades,
spaceborne sensors with sub-meter and meter spatial resolution (Worldview and
PlanetScope) have achieved a considerable image quality to generate 3D
geospatial data via a stereo matching pipeline. These achievements have
significantly increased the data accessibility in 3D, necessitating adapting
these 3D geospatial data to analyze human and natural environments. This
dissertation explores several novel approaches based on stereo and multi-view
satellite image-derived 3D geospatial data, to deal with remote sensing
application issues for built-up area modeling and natural environment
monitoring, including building model 3D reconstruction, glacier dynamics
tracking, and lake algae monitoring. Specifically, the dissertation introduces
four parts of novel approaches that deal with the spatial and temporal
challenges with satellite-derived 3D data. The first study advances LoD-2
building modeling from satellite-derived Orthophoto and DSMs with a novel
approach employing a model-driven workflow that generates building rectangular
3D geometry models. Secondly, we further enhanced our building reconstruction
framework for dense urban areas and non-rectangular purposes, we implemented
deep learning for unit-level segmentation and introduced a gradient-based
circle reconstruction for circular buildings to develop a polygon composition
technique for advanced building LoD2 reconstruction. Our third study utilizes
high-spatiotemporal resolution PlanetScope satellite imagery for glacier
tracking at 3D level in mid-latitude regions. Finally, we proposed a term as
"Algal Behavior Function" to refine the quantification of chlorophyll-a
concentrations from satellite imagery in water quality monitoring, addressing
algae fluctuations and timing discrepancies between satellite observations and
field measurements, thus enhancing the precision of underwater algae volume
estimates. Overall, this dissertation demonstrates the extensive potential of
satellite photogrammetry applications in addressing urban and environmental
challenges. It further showcases innovative analytical methodologies that
enhance the applicability of adapting stereo and multi-view very
high-resolution satellite-derived 3D data. (See full abstract in the document)Comment: Ph.D. Dissertation, Geospatial Data Analytics Lab, The Ohio State
University, 2024, offical version is available in OhioLIN
Supervisory Control System Architecture for Advanced Small Modular Reactors
This technical report was generated as a product of the Supervisory Control for Multi-Modular SMR Plants project within the Instrumentation, Control and Human-Machine Interface technology area under the Advanced Small Modular Reactor (SMR) Research and Development Program of the U.S. Department of Energy. The report documents the definition of strategies, functional elements, and the structural architecture of a supervisory control system for multi-modular advanced SMR (AdvSMR) plants. This research activity advances the state-of-the art by incorporating decision making into the supervisory control system architectural layers through the introduction of a tiered-plant system approach. The report provides a brief history of hierarchical functional architectures and the current state-of-the-art, describes a reference AdvSMR to show the dependencies between systems, presents a hierarchical structure for supervisory control, indicates the importance of understanding trip setpoints, applies a new theoretic approach for comparing architectures, identifies cyber security controls that should be addressed early in system design, and describes ongoing work to develop system requirements and hardware/software configurations
Development of Computational Techniques for Identification of Regulatory DNA Motif
Identifying precise transcription factor binding sites (TFBS) or regulatory DNA motif (motif) plays a fundamental role in researching transcriptional regulatory mechanism in cells and helping construct regulatory networks for biological investigation. Chromatin immunoprecipitation combined with sequencing (ChIP-seq) and lambda exonuclease digestion followed by high-throughput sequencing (ChIP-exo) enables researchers to identify TFBS on a genome-scale with improved resolution. Several algorithms have been developed to perform motif identification, employing widely different methods and often giving divergent results. In addition, these existing methods still suffer from prediction accuracy. Thesis focuses on the development of improved regulatory DNA motif identification techniques. We designed an integrated framework, WTSA, that can reliably combine the experimental signals from ChIP-exo data in base pair (bp) resolution to predict the statistically significant DNA motifs. The algorithm improves the prediction accuracy and extends the scope of applicability of the existing methods. We have applied the framework to Escherichia coli k12 genome and evaluated WTSA prediction performance through comparison with seven existing programs. The performance evaluation indicated that WTSA provides reliable predictive power for regulatory motifs using ChIP-exo data. An important application of DNA motif identification is to identify transcriptional regulatory mechanisms. The rapid development of single-cell RNA-Sequencing (scRNAseq) technologies provides an unprecedented opportunity to discover the gene transcriptional regulation at the single-cell level. In the scRNA-seq analyses, a critical step is to identify the cell-type-specific regulons (CTS-Rs), each of which is a group of genes co-regulated by the same transcription regulator in a specific cell type. We developed a web server, IRIS3 (Integrated Cell-type-specific Regulon Inference Server from Single-cell RNA-Seq), to solve this problem by the integration of data preprocessing, cell type prediction, gene module identification, and cis-regulatory motif analyses. Compared with other packages, IRIS3 predicts more efficiently and provides more accurate regulon from scRNA-seq data. These CTS-Rs can substantially improve the elucidation of heterogeneous regulatory mechanisms among various cell types and allow reliable constructions of global transcriptional regulation networks encoded in a specific cell type. Also presented in this thesis is DESSO (DEep Sequence and Shape mOtif (DESSO), using deep neural networks and the binomial distribution model to identify DNA motifs, DESSO outperformed existing tools, including DeepBind, in 690 human ENCODE ChIP-Sequencing datasets. DESSO also further expanded motif identification power by integrating the detection of DNA shape features
- …