724 research outputs found
An Interpretable Generalization Mechanism for Accurately Detecting Anomaly and Identifying Networking Intrusion Techniques
Recent advancements in Intrusion Detection Systems (IDS), integrating
Explainable AI (XAI) methodologies, have led to notable improvements in system
performance via precise feature selection. However, a thorough understanding of
cyber-attacks requires inherently explainable decision-making processes within
IDS. In this paper, we present the Interpretable Generalization Mechanism (IG),
poised to revolutionize IDS capabilities. IG discerns coherent patterns, making
it interpretable in distinguishing between normal and anomalous network
traffic. Further, the synthesis of coherent patterns sheds light on intricate
intrusion pathways, providing essential insights for cybersecurity forensics.
By experiments with real-world datasets NSL-KDD, UNSW-NB15, and UKM-IDS20, IG
is accurate even at a low ratio of training-to-test. With 10%-to-90%, IG
achieves Precision (PRE)=0.93, Recall (REC)=0.94, and Area Under Curve
(AUC)=0.94 in NSL-KDD; PRE=0.98, REC=0.99, and AUC=0.99 in UNSW-NB15; and
PRE=0.98, REC=0.98, and AUC=0.99 in UKM-IDS20. Notably, in UNSW-NB15, IG
achieves REC=1.0 and at least PRE=0.98 since 40%-to-60%; in UKM-IDS20, IG
achieves REC=1.0 and at least PRE=0.88 since 20%-to-80%. Importantly, in
UKM-IDS20, IG successfully identifies all three anomalous instances without
prior exposure, demonstrating its generalization capabilities. These results
and inferences are reproducible. In sum, IG showcases superior generalization
by consistently performing well across diverse datasets and training-to-test
ratios (from 10%-to-90% to 90%-to-10%), and excels in identifying novel
anomalies without prior exposure. Its interpretability is enhanced by coherent
evidence that accurately distinguishes both normal and anomalous activities,
significantly improving detection accuracy and reducing false alarms, thereby
strengthening IDS reliability and trustworthiness
The construction of bifunctional fusion xylanolytic enzymes and the prediction of optimum reaction conditions for the enzyme activity
Four chimeric xylanolytic enzymes were formed by fusion of a thermally stable xylanase XynCDBFV either to the N-terminus or C-terminus of a thermally stable acetylxylan esterase AxeS20E, with or without a Gly-rich flexible linker (S2). The three-dimensional (3D) structures of the chimeric enzymes were predicted using the I-TASSER server, and the results indicated that the structures of Axe-S2-Xyn and Xyn-S2-Axe were more similar to the native structures than were those of Axe-Xyn and Xyn-Axe. Axe-S2-Xyn and Xyn-S2-Axe were expressed in Escherichia coli and purified by means of affinity chromatography. Response surface modeling (RSM), combined with central composite design (CCD) and regression analysis, was then employed to optimize the xylanase activities of the chimeric enzymes. Under the optimal conditions, Xyn-S2-Axe had greater hydrolytic activities on natural xylans and rice straw than did the parental enzymes. These results suggested that the chimeric enzyme Xyn-S2-Axe could be effective at hydrolyzing xylan in biomass and that it has potential to be used in a range of biotechnological applications
Sustained proliferation in cancer: mechanisms and novel therapeutic targets
Proliferation is an important part of cancer development and progression. This is manifest by altered expression and/or activity of cell cycle related proteins. Constitutive activation of many signal transduction pathways also stimulates cell growth. Early steps in tumor development are associated with a fibrogenic response and the development of a hypoxic environment which favors the survival and proliferation of cancer stem cells. Part of the survival strategy of cancer stem cells may manifested by alterations in cell metabolism. Once tumors appear, growth and metastasis may be supported by overproduction of appropriate hormones (in hormonally dependent cancers), by promoting angiogenesis, by undergoing epithelial to mesenchymal transition, by triggering autophagy, and by taking cues from surrounding stromal cells. A number of natural compounds (e.g., curcumin, resveratrol, indole-3-carbinol, brassinin, sulforaphane, epigallocatechin-3-gallate, genistein, ellagitannins, lycopene and quercetin) have been found to inhibit one or more pathways that contribute to proliferation (e.g., hypoxia inducible factor 1, nuclear factor kappa B, phosphoinositide 3 kinase/Akt, insulin-like growth factor receptor 1, Wnt, cell cycle associated proteins, as well as androgen and estrogen receptor signaling). These data, in combination with bioinformatics analyses, will be very important for identifying signaling pathways and molecular targets that may provide early diagnostic markers and/or critical targets for the development of new drugs or drug combinations that block tumor formation and progression
Materials for Pharmaceutical Dosage Forms: Molecular Pharmaceutics and Controlled Release Drug Delivery Aspects
Controlled release delivery is available for many routes of administration and offers many advantages (as microparticles and nanoparticles) over immediate release delivery. These advantages include reduced dosing frequency, better therapeutic control, fewer side effects, and, consequently, these dosage forms are well accepted by patients. Advances in polymer material science, particle engineering design, manufacture, and nanotechnology have led the way to the introduction of several marketed controlled release products and several more are in pre-clinical and clinical development
First narrow-band search for continuous gravitational waves from known pulsars in advanced detector data
Spinning neutron stars asymmetric with respect to their rotation axis are potential sources of
continuous gravitational waves for ground-based interferometric detectors. In the case of known pulsars a
fully coherent search, based on matched filtering, which uses the position and rotational parameters
obtained from electromagnetic observations, can be carried out. Matched filtering maximizes the signalto-
noise (SNR) ratio, but a large sensitivity loss is expected in case of even a very small mismatch
between the assumed and the true signal parameters. For this reason, narrow-band analysis methods have
been developed, allowing a fully coherent search for gravitational waves from known pulsars over a
fraction of a hertz and several spin-down values. In this paper we describe a narrow-band search of
11 pulsars using data from Advanced LIGO’s first observing run. Although we have found several initial
outliers, further studies show no significant evidence for the presence of a gravitational wave signal.
Finally, we have placed upper limits on the signal strain amplitude lower than the spin-down limit for 5 of
the 11 targets over the bands searched; in the case of J1813-1749 the spin-down limit has been beaten for
the first time. For an additional 3 targets, the median upper limit across the search bands is below the
spin-down limit. This is the most sensitive narrow-band search for continuous gravitational waves carried
out so far
Detection and Alignment of 3D Domain Swapping Proteins Using Angle-Distance Image-Based Secondary Structural Matching Techniques
This work presents a novel detection method for three-dimensional domain swapping (DS), a mechanism for forming protein quaternary structures that can be visualized as if monomers had “opened” their “closed” structures and exchanged the opened portion to form intertwined oligomers. Since the first report of DS in the mid 1990s, an increasing number of identified cases has led to the postulation that DS might occur in a protein with an unconstrained terminus under appropriate conditions. DS may play important roles in the molecular evolution and functional regulation of proteins and the formation of depositions in Alzheimer's and prion diseases. Moreover, it is promising for designing auto-assembling biomaterials. Despite the increasing interest in DS, related bioinformatics methods are rarely available. Owing to a dramatic conformational difference between the monomeric/closed and oligomeric/open forms, conventional structural comparison methods are inadequate for detecting DS. Hence, there is also a lack of comprehensive datasets for studying DS. Based on angle-distance (A-D) image transformations of secondary structural elements (SSEs), specific patterns within A-D images can be recognized and classified for structural similarities. In this work, a matching algorithm to extract corresponding SSE pairs from A-D images and a novel DS score have been designed and demonstrated to be applicable to the detection of DS relationships. The Matthews correlation coefficient (MCC) and sensitivity of the proposed DS-detecting method were higher than 0.81 even when the sequence identities of the proteins examined were lower than 10%. On average, the alignment percentage and root-mean-square distance (RMSD) computed by the proposed method were 90% and 1.8Å for a set of 1,211 DS-related pairs of proteins. The performances of structural alignments remain high and stable for DS-related homologs with less than 10% sequence identities. In addition, the quality of its hinge loop determination is comparable to that of manual inspection. This method has been implemented as a web-based tool, which requires two protein structures as the input and then the type and/or existence of DS relationships between the input structures are determined according to the A-D image-based structural alignments and the DS score. The proposed method is expected to trigger large-scale studies of this interesting structural phenomenon and facilitate related applications
Growth of nanostructures by cluster deposition : a review
This paper presents a comprehensive analysis of simple models useful to
analyze the growth of nanostructures obtained by cluster deposition. After
detailing the potential interest of nanostructures, I extensively study the
first stages of growth (the submonolayer regime) by kinetic Monte-Carlo
simulations. These simulations are performed in a wide variety of experimental
situations : complete condensation, growth with reevaporation, nucleation on
defects, total or null cluster-cluster coalescence... The main scope of the
paper is to help experimentalists analyzing their data to deduce which of those
processes are important and to quantify them. A software including all these
simulation programs is available at no cost on request to the author. I
carefully discuss experiments of growth from cluster beams and show how the
mobility of the clusters on the surface can be measured : surprisingly high
values are found. An important issue for future technological applications of
cluster deposition is the relation between the size of the incident clusters
and the size of the islands obtained on the substrate. An approximate formula
which gives the ratio of the two sizes as a function of the melting temperature
of the material deposited is given. Finally, I study the atomic mechanisms
which can explain the diffusion of the clusters on a substrate and the result
of their mutual interaction (simple juxtaposition, partial or total
coalescence...)Comment: To be published Rev Mod Phys, Oct 99, RevTeX, 37 figure
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Large language models (LLMs) have been shown to be able to perform new tasks
based on a few demonstrations or natural language instructions. While these
capabilities have led to widespread adoption, most LLMs are developed by
resource-rich organizations and are frequently kept from the public. As a step
towards democratizing this powerful technology, we present BLOOM, a
176B-parameter open-access language model designed and built thanks to a
collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer
language model that was trained on the ROOTS corpus, a dataset comprising
hundreds of sources in 46 natural and 13 programming languages (59 in total).
We find that BLOOM achieves competitive performance on a wide variety of
benchmarks, with stronger results after undergoing multitask prompted
finetuning. To facilitate future research and applications using LLMs, we
publicly release our models and code under the Responsible AI License
Robust estimation of bacterial cell count from optical density
Optical density (OD) is widely used to estimate the density of cells in liquid culture, but cannot be compared between instruments without a standardized calibration protocol and is challenging to relate to actual cell count. We address this with an interlaboratory study comparing three simple, low-cost, and highly accessible OD calibration protocols across 244 laboratories, applied to eight strains of constitutive GFP-expressing E. coli. Based on our results, we recommend calibrating OD to estimated cell count using serial dilution of silica microspheres, which produces highly precise calibration (95.5% of residuals <1.2-fold), is easily assessed for quality control, also assesses instrument effective linear range, and can be combined with fluorescence calibration to obtain units of Molecules of Equivalent Fluorescein (MEFL) per cell, allowing direct comparison and data fusion with flow cytometry measurements: in our study, fluorescence per cell measurements showed only a 1.07-fold mean difference between plate reader and flow cytometry data
- …