2,444 research outputs found

    Multi-Dimensional Explanation of Target Variables from Documents

    Full text link
    Automated predictions require explanations to be interpretable by humans. Past work used attention and rationale mechanisms to find words that predict the target variable of a document. Often though, they result in a tradeoff between noisy explanations or a drop in accuracy. Furthermore, rationale methods cannot capture the multi-faceted nature of justifications for multiple targets, because of the non-probabilistic nature of the mask. In this paper, we propose the Multi-Target Masker (MTM) to address these shortcomings. The novelty lies in the soft multi-dimensional mask that models a relevance probability distribution over the set of target variables to handle ambiguities. Additionally, two regularizers guide MTM to induce long, meaningful explanations. We evaluate MTM on two datasets and show, using standard metrics and human annotations, that the resulting masks are more accurate and coherent than those generated by the state-of-the-art methods. Moreover, MTM is the first to also achieve the highest F1 scores for all the target variables simultaneously.Comment: Accepted in AAAI 2021. 18 pages, 14 figures, 9 table

    Non-Destructive Technologies for Detecting Insect Infestation in Fruits and Vegetables under Postharvest Conditions: A Critical Review

    Get PDF
    In the last two decades, food scientists have attempted to develop new technologies that can improve the detection of insect infestation in fruits and vegetables under postharvest conditions using a multitude of non-destructive technologies. While consumers\u27 expectations for higher nutritive and sensorial value of fresh produce has increased over time, they have also become more critical on using insecticides or synthetic chemicals to preserve food quality from insects\u27 attacks or enhance the quality attributes of minimally processed fresh produce. In addition, the increasingly stringent quarantine measures by regulatory agencies for commercial import-export of fresh produce needs more reliable technologies for quickly detecting insect infestation in fruits and vegetables before their commercialization. For these reasons, the food industry investigates alternative and non-destructive means to improve food quality. Several studies have been conducted on the development of rapid, accurate, and reliable insect infestation monitoring systems to replace invasive and subjective methods that are often inefficient. There are still major limitations to the effective in-field, as well as postharvest on-line, monitoring applications. This review presents a general overview of current non-destructive techniques for the detection of insect damage in fruits and vegetables and discusses basic principles and applications. The paper also elaborates on the specific post-harvest fruit infestation detection methods, which include principles, protocols, specific application examples, merits, and limitations. The methods reviewed include those based on spectroscopy, imaging, acoustic sensing, and chemical interactions, with greater emphasis on the noninvasive methods. This review also discusses the current research gaps as well as the future research directions for non-destructive methods\u27 application in the detection and classification of insect infestation in fruits and vegetables

    PowerGuard{reg_sign} Advanced Manufacturing; PVMaT Phase 1 Final Technical Report: June 1, 1998 to September 30, 1999

    Full text link

    TCM-SD: A Benchmark for Probing Syndrome Differentiation via Natural Language Processing

    Full text link
    Traditional Chinese Medicine (TCM) is a natural, safe, and effective therapy that has spread and been applied worldwide. The unique TCM diagnosis and treatment system requires a comprehensive analysis of a patient's symptoms hidden in the clinical record written in free text. Prior studies have shown that this system can be informationized and intelligentized with the aid of artificial intelligence (AI) technology, such as natural language processing (NLP). However, existing datasets are not of sufficient quality nor quantity to support the further development of data-driven AI technology in TCM. Therefore, in this paper, we focus on the core task of the TCM diagnosis and treatment system -- syndrome differentiation (SD) -- and we introduce the first public large-scale dataset for SD, called TCM-SD. Our dataset contains 54,152 real-world clinical records covering 148 syndromes. Furthermore, we collect a large-scale unlabelled textual corpus in the field of TCM and propose a domain-specific pre-trained language model, called ZY-BERT. We conducted experiments using deep neural networks to establish a strong performance baseline, reveal various challenges in SD, and prove the potential of domain-specific pre-trained language model. Our study and analysis reveal opportunities for incorporating computer science and linguistics knowledge to explore the empirical validity of TCM theories.Comment: 10 main pages + 2 reference pages, to appear at CCL202

    Advances in Patient Classification for Traditional Chinese Medicine: A Machine Learning Perspective

    Get PDF
    As a complementary and alternative medicine in medical field, traditional Chinese medicine (TCM) has drawn great attention in the domestic field and overseas. In practice, TCM provides a quite distinct methodology to patient diagnosis and treatment compared to western medicine (WM). Syndrome (ZHENG or pattern) is differentiated by a set of symptoms and signs examined from an individual by four main diagnostic methods: inspection, auscultation and olfaction, interrogation, and palpation which reflects the pathological and physiological changes of disease occurrence and development. Patient classification is to divide patients into several classes based on different criteria. In this paper, from the machine learning perspective, a survey on patient classification issue will be summarized on three major aspects of TCM: sign classification, syndrome differentiation, and disease classification. With the consideration of different diagnostic data analyzed by different computational methods, we present the overview for four subfields of TCM diagnosis, respectively. For each subfield, we design a rectangular reference list with applications in the horizontal direction and machine learning algorithms in the longitudinal direction. According to the current development of objective TCM diagnosis for patient classification, a discussion of the research issues around machine learning techniques with applications to TCM diagnosis is given to facilitate the further research for TCM patient classification

    Unsupervised Learning and Multipartite Network Models: A Promising Approach for Understanding Traditional Medicine

    Get PDF
    The ultimate goal of precision medicine is to determine right treatment for right patients based on precise diagnosis. To achieve this goal, correct stratification of patients using molecular features and clinical phenotypes is crucial. During the long history of medical science, our understanding on disease classification has been improved greatly by chemistry and molecular biology. Nowadays, we gain access to large scale patient-derived data by high-throughput technologies, generating a greater need for data science including unsupervised learning and network modeling. Unsupervised learning methods such as clustering could be a better solution to stratify patients when there is a lack of predefined classifiers. In network modularity analysis, clustering methods can be also applied to elucidate the complex structure of biological and disease networks at the systems level. In this review, we went over the main points of clustering analysis and network modeling, particularly in the context of Traditional Chinese medicine (TCM). We showed that this approach can provide novel insights on the rationale of classification for TCM herbs. In a case study, using a modularity analysis of multipartite networks, we illustrated that the TCM classifications are associated with the chemical properties of the herb ingredients. We concluded that multipartite network modeling may become a suitable data integration tool for understanding the mechanisms of actions of traditional medicine.Peer reviewe

    Artificial Intelligence in Oral Health

    Get PDF
    This Special Issue is intended to lay the foundation of AI applications focusing on oral health, including general dentistry, periodontology, implantology, oral surgery, oral radiology, orthodontics, and prosthodontics, among others

    Integrating heterogeneous data into electronic medical record analysis

    Get PDF
    Electronic medical records (EMRs) are the digital equivalent of paper records at a clinician's office. They contain patient information such as treatment and medical history, and have been shown to have a wide variety of benefits. However, EMRs typically contain a multitude of diverse data, including images, doctor notes, medical test results, and genomic data. This heterogeneity generates high dimensionality and data sparsity, which are two of the most prevalent culprits that exacerbate already difficult computational problems. Additionally, domain-specific characteristics, such as the existence of synonyms in the medical vocabulary, introduce ambiguity. This can further reduce the data mining potential of EMRs. This thesis is a systematic study that addresses these issues associated with EMRs. In particular, I utilized heterogeneous data sources that are typically incompatible, and then developed frameworks in which these data sources complement one another. As a result, these methods have the potential for direct clinical translation, paving the way for improving healthcare from a data-driven perspective. To improve a variety of downstream healthcare applications, such as patient subcategorization, survival analysis, and visualization, I used external networks of domain knowledge consisting of drug-symptom relationships, protein-protein interactions, and genetic information to enhance patient records. I found that this enhancement process increased the data mining capabilities as well as the interpretability of the EMRs. To improve EMR retrieval systems, I developed a query expansion method that frames symptoms and treatments as two different languages. I found that a topic modeling method that follows this dual-language framework yielded the highest performance. Lastly, I showed that due to pathological similarities, jointly studying Alzheimer's disease and Parkinson's disease resulted in higher computational power by effectively increasing the size of the training datasets. This allowed for the accurate prediction of the onset of dementia in both diseases. Each of these results can lay the groundwork for applications that have the potential to be implemented directly in clinical practice, improving the safety and quality of patient care

    3D printed neuromorphic sensing systems

    Get PDF
    Thanks to the high energy efficiency, neuromorphic devices are spotlighted recently by mimicking the calculation principle of the human brain through the parallel computation and the memory function. Various bio-inspired \u27in-memory computing\u27 (IMC) devices were developed during the past decades, such as synaptic transistors for artificial synapses. By integrating with specific sensors, neuromorphic sensing systems are achievable with the bio-inspired signal perception function. A signal perception process is possible by a combination of stimuli sensing, signal conversion/transmission, and signal processing. However, most neuromorphic sensing systems were demonstrated without signal conversion/transmission functions. Therefore, those cannot fully mimic the function provides by the sensory neuron in the biological system. This thesis aims to design a neuromorphic sensing system with a complete function as biological sensory neurons. To reach such a target, 3D printed sensors, electrical oscillators, and synaptic transistors were developed as functions of artificial receptors, artificial neurons, and artificial synapses, respectively. Moreover, since the 3D printing technology has demonstrated a facile process due to fast prototyping, the proposed 3D neuromorphic sensing system was designed as a 3D integrated structure and fabricated by 3D printing technologies. A novel multi-axis robot 3D printing system was also utilized to increase the fabrication efficiency with the capability of printing on vertical and tilted surfaces seamlessly. Furthermore, the developed 3D neuromorphic system was easily adapted to the application of tactile sensing. A portable neuromorphic system was integrated with a tactile sensing system for the intelligent tactile sensing application of the humanoid robot. Finally, the bio-inspired reflex arc for the unconscious response was also demonstrated by training the neuromorphic tactile sensing system

    Smart Gas Sensors: Materials, Technologies, Practical ‎Applications, and Use of Machine Learning – A Review

    Get PDF
    The electronic nose, popularly known as the E-nose, that combines gas sensor arrays (GSAs) with machine learning has gained a strong foothold in gas sensing technology. The E-nose designed to mimic the human olfactory system, is used for the detection and identification of various volatile compounds. The GSAs develop a unique signal fingerprint for each volatile compound to enable pattern recognition using machine learning algorithms. The inexpensive, portable and non-invasive characteristics of the E-nose system have rendered it indispensable within the gas-sensing arena. As a result, E-noses have been widely employed in several applications in the areas of the food industry, health management, disease diagnosis, water and air quality control, and toxic gas leakage detection. This paper reviews the various sensor fabrication technologies of GSAs and highlights the main operational framework of the E-nose system. The paper details vital signal pre-processing techniques of feature extraction, feature selection, in addition to machine learning algorithms such as SVM, kNN, ANN, and Random Forests for determining the type of gas and estimating its concentration in a competitive environment. The paper further explores the potential applications of E-noses for diagnosing diseases, monitoring air quality, assessing the quality of food samples and estimating concentrations of volatile organic compounds (VOCs) in air and in food samples. The review concludes with some challenges faced by E-nose, alternative ways to tackle them and proposes some recommendations as potential future work for further development and design enhancement of E-noses
    • …
    corecore