88,012 research outputs found
Experimental Biological Protocols with Formal Semantics
Both experimental and computational biology is becoming increasingly
automated. Laboratory experiments are now performed automatically on
high-throughput machinery, while computational models are synthesized or
inferred automatically from data. However, integration between automated tasks
in the process of biological discovery is still lacking, largely due to
incompatible or missing formal representations. While theories are expressed
formally as computational models, existing languages for encoding and
automating experimental protocols often lack formal semantics. This makes it
challenging to extract novel understanding by identifying when theory and
experimental evidence disagree due to errors in the models or the protocols
used to validate them. To address this, we formalize the syntax of a core
protocol language, which provides a unified description for the models of
biochemical systems being experimented on, together with the discrete events
representing the liquid-handling steps of biological protocols. We present both
a deterministic and a stochastic semantics to this language, both defined in
terms of hybrid processes. In particular, the stochastic semantics captures
uncertainties in equipment tolerances, making it a suitable tool for both
experimental and computational biologists. We illustrate how the proposed
protocol language can be used for automated verification and synthesis of
laboratory experiments on case studies from the fields of chemistry and
molecular programming
Collaboration Enabling Internet Resource Collection-Building Software and Technologies
Over the last decade the Library of the University of California, Riverside
and its collaborators have developed a number of systems, service designs,
and projects that utilize innovative technologies to foster better Internet
finding tools in libraries and more cooperative and efficient effort in Internet
link and metadata collection building. The open-source software
and projects discussed represent appropriate technologies and sustainable
strategies that we believe will help Internet portals, digital libraries, virtual libraries,
library catalogs-with-portal-like-capabilities (IPDVLCs), and related
collection-building efforts in academia to better scale and more accurately
anticipate and meet the needs of scholarly and educational users.published or submitted for publicatio
Automated legal sensemaking: the centrality of relevance and intentionality
Introduction: In a perfect world, discovery would ideally be conducted by the senior litigator who is
responsible for developing and fully understanding all nuances of their client’s legal strategy. Of
course today we must deal with the explosion of electronically stored information (ESI) that
never is less than tens-of-thousands of documents in small cases and now increasingly involves
multi-million-document populations for internal corporate investigations and litigations.
Therefore scalable processes and technologies are required as a substitute for the authority’s
judgment. The approaches taken have typically either substituted large teams of surrogate
human reviewers using vastly simplified issue coding reference materials or employed
increasingly sophisticated computational resources with little focus on quality metrics to insure
retrieval consistent with the legal goal. What is required is a system (people, process, and
technology) that replicates and automates the senior litigator’s human judgment.
In this paper we utilize 15 years of sensemaking research to establish the minimum acceptable
basis for conducting a document review that meets the needs of a legal proceeding. There is
no substitute for a rigorous characterization of the explicit and tacit goals of the senior litigator.
Once a process has been established for capturing the authority’s relevance criteria, we argue
that literal translation of requirements into technical specifications does not properly account for
the activities or states-of-affairs of interest. Having only a data warehouse of written records, it
is also necessary to discover the intentions of actors involved in textual communications. We
present quantitative results for a process and technology approach that automates effective
legal sensemaking
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
In this paper, we introduce and evaluate PROPEDEUTICA, a novel methodology
and framework for efficient and effective real-time malware detection,
leveraging the best of conventional machine learning (ML) and deep learning
(DL) algorithms. In PROPEDEUTICA, all software processes in the system start
execution subjected to a conventional ML detector for fast classification. If a
piece of software receives a borderline classification, it is subjected to
further analysis via more performance expensive and more accurate DL methods,
via our newly proposed DL algorithm DEEPMALWARE. Further, we introduce delays
to the execution of software subjected to deep learning analysis as a way to
"buy time" for DL analysis and to rate-limit the impact of possible malware in
the system. We evaluated PROPEDEUTICA with a set of 9,115 malware samples and
877 commonly used benign software samples from various categories for the
Windows OS. Our results show that the false positive rate for conventional ML
methods can reach 20%, and for modern DL methods it is usually below 6%.
However, the classification time for DL can be 100X longer than conventional ML
methods. PROPEDEUTICA improved the detection F1-score from 77.54% (conventional
ML method) to 90.25%, and reduced the detection time by 54.86%. Further, the
percentage of software subjected to DL analysis was approximately 40% on
average. Further, the application of delays in software subjected to ML reduced
the detection time by approximately 10%. Finally, we found and discussed a
discrepancy between the detection accuracy offline (analysis after all traces
are collected) and on-the-fly (analysis in tandem with trace collection). Our
insights show that conventional ML and modern DL-based malware detectors in
isolation cannot meet the needs of efficient and effective malware detection:
high accuracy, low false positive rate, and short classification time.Comment: 17 pages, 7 figure
Web Service Discovery in a Semantically Extended UDDI Registry: the Case of FUSION
Service-oriented computing is being adopted at an unprecedented rate, making the effectiveness of automated service discovery an increasingly important challenge. UDDI has emerged as a de facto industry standard and fundamental building block within SOA infrastructures. Nevertheless, conventional UDDI registries lack means to provide unambiguous, semantically rich representations of Web service capabilities, and the logic inference power required for facilitating automated service discovery. To overcome this important limitation, a number of approaches have been proposed towards augmenting Web service discovery with semantics. This paper discusses the benefits of semantically extending Web service descriptions and UDDI registries, and presents an overview of the approach put forward in project FUSION, towards semantically-enhanced publication and discovery of services based on SAWSDL
What Can Artificial Intelligence Do for Scientific Realism?
The paper proposes a synthesis between human scientists and artificial representation learning models as a way of augmenting epistemic warrants of realist theories against various anti-realist attempts. Towards this end, the paper fleshes out unconceived alternatives not as a critique of scientific realism but rather a reinforcement, as it rejects the retrospective interpretations of scientific progress, which brought about the problem of alternatives in the first place. By utilising adversarial machine learning, the synthesis explores possibility spaces of available evidence for unconceived alternatives providing modal knowledge of what is possible therein. As a result, the epistemic warrant of synthesised realist theories should emerge bolstered as the underdetermination by available evidence gets reduced. While shifting the realist commitment away from theoretical artefacts towards modalities of the possibility spaces, the synthesis comes out as a kind of perspectival modelling
- …