75 research outputs found
Code-Style In-Context Learning for Knowledge-Based Question Answering
Current methods for Knowledge-Based Question Answering (KBQA) usually rely on
complex training techniques and model frameworks, leading to many limitations
in practical applications. Recently, the emergence of In-Context Learning (ICL)
capabilities in Large Language Models (LLMs) provides a simple and
training-free semantic parsing paradigm for KBQA: Given a small number of
questions and their labeled logical forms as demo examples, LLMs can understand
the task intent and generate the logic form for a new question. However,
current powerful LLMs have little exposure to logic forms during pre-training,
resulting in a high format error rate. To solve this problem, we propose a
code-style in-context learning method for KBQA, which converts the generation
process of unfamiliar logical form into the more familiar code generation
process for LLMs. Experimental results on three mainstream datasets show that
our method dramatically mitigated the formatting error problem in generating
logic forms while realizing a new SOTA on WebQSP, GrailQA, and GraphQ under the
few-shot setting.Comment: work in progres
Towards Multi-perspective Conformance Checking with Fuzzy Sets
Nowadays organizations often need to employ data-driven techniques to audit their business processes and ensure they comply with laws and internal/external regulations. Failing in complying with the expected process behavior can indeed pave the way to inefficiencies or, worse, to frauds or abuses. An increasingly popular approach to automatically assess the compliance of the executions of organization processes is represented by alignment-based conformance checking. These techniques are able to compare real process executions with models representing the expected behaviors, providing diagnostics able to pinpoint possible discrepancies. However, the diagnostics generated by state of the art techniques still suffer from some limitations. They perform a crisp evaluation of process compliance, marking process behavior either as compliant or deviant, without taking into account the severity of the identified deviation. This hampers the accuracy of the obtained diagnostics and can lead to misleading results, especially in contexts where there is some tolerance with respect to violations of the process guidelines. In the present work, we discuss the impact and the drawbacks of a crisp deviation assessment approach. Then, we propose a novel conformance checking approach aimed at representing actors’ tolerance with respect to process deviations, taking it into account when assessing the severity of the deviations. As a proof of concept, we performed a set of synthetic experiments to assess the approach. The obtained results point out the potential of the usage of a more flexible evaluation of process deviations, and its impact on the quality and the interpretation of the obtained diagnostics
Re-ordered fuzzy conformance checking for uncertain clinical records
Modern hospitals implement clinical pathways to standardize patients’ treatments. Conformance checking techniques provide an automated tool to assess whether the actual executions of clinical processes comply with the corresponding clinical pathways. However, clinical processes are typically characterized by a high degree of uncertainty, both in their execution and recording. This paper focuses on uncertainty related to logging clinical processes. The logging of the activities executed during a clinical process in the hospital information system is often performed manually by the involved actors (e.g., the nurses). However, such logging can occur at a different time than the actual execution time, which hampers the reliability of the diagnostics provided by conformance checking techniques. To address this issue, we propose a novel conformance checking algorithm that leverages principles of fuzzy set theory to incorporate experts’ knowledge when generating conformance diagnostics. We exploit this knowledge to define a fuzzy tolerance in a time window, which is then used to assess the magnitude of timestamp violations of the recorded activities when evaluating the overall process execution compliance. Experiments conducted on a real-life case study in a Dutch hospital show that the proposed method obtains more accurate diagnostics than the state-of-the-art approaches. We also consider how our diagnostics can be used to stimulate discussion with domain experts on possible strategies to mitigate logging uncertainty in the clinical practice.</p
A General SIMD-based Approach to Accelerating Compression Algorithms
Compression algorithms are important for data oriented tasks, especially in
the era of Big Data. Modern processors equipped with powerful SIMD instruction
sets, provide us an opportunity for achieving better compression performance.
Previous research has shown that SIMD-based optimizations can multiply decoding
speeds. Following these pioneering studies, we propose a general approach to
accelerate compression algorithms. By instantiating the approach, we have
developed several novel integer compression algorithms, called Group-Simple,
Group-Scheme, Group-AFOR, and Group-PFD, and implemented their corresponding
vectorized versions. We evaluate the proposed algorithms on two public TREC
datasets, a Wikipedia dataset and a Twitter dataset. With competitive
compression ratios and encoding speeds, our SIMD-based algorithms outperform
state-of-the-art non-vectorized algorithms with respect to decoding speeds
Spinel type MCo2O4 (M = Mn, Mg, Ni, Cu, Fe and Zn) for chemoresistance gas sensors
peer reviewedGas sensors present significant research value and broad applicability for environmental monitoring, medical diagnosis and agricultural production. As a p-type spinel ternary semiconductor metal oxide (SMOX), MCo2O4 (M = Mn, Mg, Ni, Cu, Fe and Zn) chemoresistance gas sensors possess a satisfactory sensing performance to diverse hazardous gases. Owning to the superior potential and widespread applications, this work provides a critical review of the current development for MCo2O4 chemoresistance gas sensors. Basic information of MCo2O4 and evaluation criteria of corresponding gas sensors were described primarily. Then the synthesis, morphology, characterization and sensing properties of MCo2O4 gas sensors were elaborated on the basis of different microtopography dimensions under zero dimension (0D), one dimension (1D), two dimension (2D) and three dimension (3D). Various efficient tactics for improving sensing performance and relevant transducing mechanism were demonstrated as well. Finally, perspectives on developing MCo2O4 synthesis and applications in gas sensors were elaborated
Genome dynamics and diversity of Shigella species, the etiologic agents of bacillary dysentery
The Shigella bacteria cause bacillary dysentery, which remains a significant threat to public health. The genus status and species classification appear no longer valid, as compelling evidence indicates that Shigella, as well as enteroinvasive Escherichia coli, are derived from multiple origins of E.coli and form a single pathovar. Nevertheless, Shigella dysenteriae serotype 1 causes deadly epidemics but Shigella boydii is restricted to the Indian subcontinent, while Shigella flexneri and Shigella sonnei are prevalent in developing and developed countries respectively. To begin to explain these distinctive epidemiological and pathological features at the genome level, we have carried out comparative genomics on four representative strains. Each of the Shigella genomes includes a virulence plasmid that encodes conserved primary virulence determinants. The Shigella chromosomes share most of their genes with that of E.coli K12 strain MG1655, but each has over 200 pseudogenes, 300∼700 copies of insertion sequence (IS) elements, and numerous deletions, insertions, translocations and inversions. There is extensive diversity of putative virulence genes, mostly acquired via bacteriophage-mediated lateral gene transfer. Hence, via convergent evolution involving gain and loss of functions, through bacteriophage-mediated gene acquisition, IS-mediated DNA rearrangements and formation of pseudogenes, the Shigella spp. became highly specific human pathogens with variable epidemiological and pathological features
Physiological and Pathological Role of Alpha-synuclein in Parkinson’s Disease Through Iron Mediated Oxidative Stress; The Role of a Putative Iron-responsive Element
Parkinson’s disease (PD) is the second most common progressive neurodegenerative disorder after Alzheimer’s disease (AD) and represents a large health burden to society. Genetic and oxidative risk factors have been proposed as possible causes, but their relative contribution remains unclear. Dysfunction of alpha-synuclein (α-syn) has been associated with PD due to its increased presence, together with iron, in Lewy bodies. Brain oxidative damage caused by iron may be partly mediated by α-syn oligomerization during PD pathology. Also, α-syn gene dosage can cause familial PD and inhibition of its gene expression by blocking translation via a newly identified Iron Responsive Element-like RNA sequence in its 5’-untranslated region may provide a new PD drug target
Sharp constant of Hardy operators corresponding to general positive measures
Abstract We investigate a new kind of Hardy operator Hμ with respect to arbitrary positive measures μ and prove that Hμ is bounded on Lp(dμ) with an upper constant p/(p−1) . Moreover, we characterize a sufficient condition about the measure which makes p/(p−1) to be the Lp -norm of Hμ
- …