2,444 research outputs found
Multi-Dimensional Explanation of Target Variables from Documents
Automated predictions require explanations to be interpretable by humans.
Past work used attention and rationale mechanisms to find words that predict
the target variable of a document. Often though, they result in a tradeoff
between noisy explanations or a drop in accuracy. Furthermore, rationale
methods cannot capture the multi-faceted nature of justifications for multiple
targets, because of the non-probabilistic nature of the mask. In this paper, we
propose the Multi-Target Masker (MTM) to address these shortcomings. The
novelty lies in the soft multi-dimensional mask that models a relevance
probability distribution over the set of target variables to handle
ambiguities. Additionally, two regularizers guide MTM to induce long,
meaningful explanations. We evaluate MTM on two datasets and show, using
standard metrics and human annotations, that the resulting masks are more
accurate and coherent than those generated by the state-of-the-art methods.
Moreover, MTM is the first to also achieve the highest F1 scores for all the
target variables simultaneously.Comment: Accepted in AAAI 2021. 18 pages, 14 figures, 9 table
Non-Destructive Technologies for Detecting Insect Infestation in Fruits and Vegetables under Postharvest Conditions: A Critical Review
In the last two decades, food scientists have attempted to develop new technologies that can improve the detection of insect infestation in fruits and vegetables under postharvest conditions using a multitude of non-destructive technologies. While consumers\u27 expectations for higher nutritive and sensorial value of fresh produce has increased over time, they have also become more critical on using insecticides or synthetic chemicals to preserve food quality from insects\u27 attacks or enhance the quality attributes of minimally processed fresh produce. In addition, the increasingly stringent quarantine measures by regulatory agencies for commercial import-export of fresh produce needs more reliable technologies for quickly detecting insect infestation in fruits and vegetables before their commercialization. For these reasons, the food industry investigates alternative and non-destructive means to improve food quality. Several studies have been conducted on the development of rapid, accurate, and reliable insect infestation monitoring systems to replace invasive and subjective methods that are often inefficient. There are still major limitations to the effective in-field, as well as postharvest on-line, monitoring applications. This review presents a general overview of current non-destructive techniques for the detection of insect damage in fruits and vegetables and discusses basic principles and applications. The paper also elaborates on the specific post-harvest fruit infestation detection methods, which include principles, protocols, specific application examples, merits, and limitations. The methods reviewed include those based on spectroscopy, imaging, acoustic sensing, and chemical interactions, with greater emphasis on the noninvasive methods. This review also discusses the current research gaps as well as the future research directions for non-destructive methods\u27 application in the detection and classification of insect infestation in fruits and vegetables
TCM-SD: A Benchmark for Probing Syndrome Differentiation via Natural Language Processing
Traditional Chinese Medicine (TCM) is a natural, safe, and effective therapy
that has spread and been applied worldwide. The unique TCM diagnosis and
treatment system requires a comprehensive analysis of a patient's symptoms
hidden in the clinical record written in free text. Prior studies have shown
that this system can be informationized and intelligentized with the aid of
artificial intelligence (AI) technology, such as natural language processing
(NLP). However, existing datasets are not of sufficient quality nor quantity to
support the further development of data-driven AI technology in TCM. Therefore,
in this paper, we focus on the core task of the TCM diagnosis and treatment
system -- syndrome differentiation (SD) -- and we introduce the first public
large-scale dataset for SD, called TCM-SD. Our dataset contains 54,152
real-world clinical records covering 148 syndromes. Furthermore, we collect a
large-scale unlabelled textual corpus in the field of TCM and propose a
domain-specific pre-trained language model, called ZY-BERT. We conducted
experiments using deep neural networks to establish a strong performance
baseline, reveal various challenges in SD, and prove the potential of
domain-specific pre-trained language model. Our study and analysis reveal
opportunities for incorporating computer science and linguistics knowledge to
explore the empirical validity of TCM theories.Comment: 10 main pages + 2 reference pages, to appear at CCL202
Advances in Patient Classification for Traditional Chinese Medicine: A Machine Learning Perspective
As a complementary and alternative medicine in medical field, traditional Chinese medicine (TCM) has drawn great attention in the domestic field and overseas. In practice, TCM provides a quite distinct methodology to patient diagnosis and treatment compared to western medicine (WM). Syndrome (ZHENG or pattern) is differentiated by a set of symptoms and signs
examined from an individual by four main diagnostic methods: inspection, auscultation and olfaction, interrogation, and palpation which reflects the pathological and physiological changes of
disease occurrence and development. Patient classification is to divide patients into several classes based on different criteria. In this paper, from the machine learning perspective, a survey on
patient classification issue will be summarized on three major aspects of TCM: sign classification, syndrome differentiation, and disease classification. With the consideration of different diagnostic
data analyzed by different computational methods, we present the overview for four subfields of TCM diagnosis, respectively. For each subfield, we design a rectangular reference list with applications in the horizontal direction and machine learning algorithms in the longitudinal direction. According to the current development of objective TCM diagnosis for patient classification, a discussion of the research issues around machine learning techniques with applications to TCM diagnosis is given to facilitate the further research for TCM patient classification
Unsupervised Learning and Multipartite Network Models: A Promising Approach for Understanding Traditional Medicine
The ultimate goal of precision medicine is to determine right treatment for right patients based on precise diagnosis. To achieve this goal, correct stratification of patients using molecular features and clinical phenotypes is crucial. During the long history of medical science, our understanding on disease classification has been improved greatly by chemistry and molecular biology. Nowadays, we gain access to large scale patient-derived data by high-throughput technologies, generating a greater need for data science including unsupervised learning and network modeling. Unsupervised learning methods such as clustering could be a better solution to stratify patients when there is a lack of predefined classifiers. In network modularity analysis, clustering methods can be also applied to elucidate the complex structure of biological and disease networks at the systems level. In this review, we went over the main points of clustering analysis and network modeling, particularly in the context of Traditional Chinese medicine (TCM). We showed that this approach can provide novel insights on the rationale of classification for TCM herbs. In a case study, using a modularity analysis of multipartite networks, we illustrated that the TCM classifications are associated with the chemical properties of the herb ingredients. We concluded that multipartite network modeling may become a suitable data integration tool for understanding the mechanisms of actions of traditional medicine.Peer reviewe
Artificial Intelligence in Oral Health
This Special Issue is intended to lay the foundation of AI applications focusing on oral health, including general dentistry, periodontology, implantology, oral surgery, oral radiology, orthodontics, and prosthodontics, among others
Integrating heterogeneous data into electronic medical record analysis
Electronic medical records (EMRs) are the digital equivalent of paper records at a clinician's office. They contain patient information such as treatment and medical history, and have been shown to have a wide variety of benefits.
However, EMRs typically contain a multitude of diverse data, including images, doctor notes, medical test results, and genomic data. This heterogeneity generates high dimensionality and data sparsity, which are two of the most prevalent culprits that exacerbate already difficult computational problems. Additionally, domain-specific characteristics, such as the existence of synonyms in the medical vocabulary, introduce ambiguity. This can further reduce the data mining potential of EMRs.
This thesis is a systematic study that addresses these issues associated with EMRs. In particular, I utilized heterogeneous data sources that are typically incompatible, and then developed frameworks in which these data sources complement one another. As a result, these methods have the potential for direct clinical translation, paving the way for improving healthcare from a data-driven perspective.
To improve a variety of downstream healthcare applications, such as patient subcategorization, survival analysis, and visualization, I used external networks of domain knowledge consisting of drug-symptom relationships, protein-protein interactions, and genetic information to enhance patient records. I found that this enhancement process increased the data mining capabilities as well as the interpretability of the EMRs.
To improve EMR retrieval systems, I developed a query expansion method that frames symptoms and treatments as two different languages. I found that a topic modeling method that follows this dual-language framework yielded the highest performance. Lastly, I showed that due to pathological similarities, jointly studying Alzheimer's disease and Parkinson's disease resulted in higher computational power by effectively increasing the size of the training datasets. This allowed for the accurate prediction of the onset of dementia in both diseases.
Each of these results can lay the groundwork for applications that have the potential to be implemented directly in clinical practice, improving the safety and quality of patient care
3D printed neuromorphic sensing systems
Thanks to the high energy efficiency, neuromorphic devices are spotlighted recently by mimicking the calculation principle of the human brain through the parallel computation and the memory function. Various bio-inspired \u27in-memory computing\u27 (IMC) devices were developed during the past decades, such as synaptic transistors for artificial synapses. By integrating with specific sensors, neuromorphic sensing systems are achievable with the bio-inspired signal perception function. A signal perception process is possible by a combination of stimuli sensing, signal conversion/transmission, and signal processing. However, most neuromorphic sensing systems were demonstrated without signal conversion/transmission functions. Therefore, those cannot fully mimic the function provides by the sensory neuron in the biological system. This thesis aims to design a neuromorphic sensing system with a complete function as biological sensory neurons. To reach such a target, 3D printed sensors, electrical oscillators, and synaptic transistors were developed as functions of artificial receptors, artificial neurons, and artificial synapses, respectively. Moreover, since the 3D printing technology has demonstrated a facile process due to fast prototyping, the proposed 3D neuromorphic sensing system was designed as a 3D integrated structure and fabricated by 3D printing technologies. A novel multi-axis robot 3D printing system was also utilized to increase the fabrication efficiency with the capability of printing on vertical and tilted surfaces seamlessly. Furthermore, the developed 3D neuromorphic system was easily adapted to the application of tactile sensing. A portable neuromorphic system was integrated with a tactile sensing system for the intelligent tactile sensing application of the humanoid robot. Finally, the bio-inspired reflex arc for the unconscious response was also demonstrated by training the neuromorphic tactile sensing system
Smart Gas Sensors: Materials, Technologies, Practical ‎Applications, and Use of Machine Learning – A Review
The electronic nose, popularly known as the E-nose, that combines gas sensor arrays (GSAs) with machine learning has gained a strong foothold in gas sensing technology. The E-nose designed to mimic the human olfactory system, is used for the detection and identification of various volatile compounds. The GSAs develop a unique signal fingerprint for each volatile compound to enable pattern recognition using machine learning algorithms. The inexpensive, portable and non-invasive characteristics of the E-nose system have rendered it indispensable within the gas-sensing arena. As a result, E-noses have been widely employed in several applications in the areas of the food industry, health management, disease diagnosis, water and air quality control, and toxic gas leakage detection. This paper reviews the various sensor fabrication technologies of GSAs and highlights the main operational framework of the E-nose system. The paper details vital signal pre-processing techniques of feature extraction, feature selection, in addition to machine learning algorithms such as SVM, kNN, ANN, and Random Forests for determining the type of gas and estimating its concentration in a competitive environment. The paper further explores the potential applications of E-noses for diagnosing diseases, monitoring air quality, assessing the quality of food samples and estimating concentrations of volatile organic compounds (VOCs) in air and in food samples. The review concludes with some challenges faced by E-nose, alternative ways to tackle them and proposes some recommendations as potential future work for further development and design enhancement of E-noses
- …