77,563 research outputs found

    Learning representations for information mining from text corpora with applications to cyber threat intelligence

    Get PDF
    Doctor of PhilosophyDepartment of Computer ScienceWilliam H HsuThis research develops learning representations and architectures for natural language understanding, within an information mining framework for analysis of open-source cyber threat intelligence (CTI). Both contextual (sequential) and topological (graph-based) encodings of short text documents are modeled. To accomplish this goal, a series of machine learning tasks are defined, and learning representations are developed to detect crucial information in these documents: cyber threat entities, types, and events. Using hybrid transformer-based implementations of these learning models, CTI-relevant key phrases are identified, and specific cyber threats are classified using classification models based upon graph neural networks (GNNs). The central scientific goal here is to learn features from corpora consisting of short texts for multiple document categorization and information extraction sub-tasks to improve the accuracy, precision, recall, and F1 score of a multimodal framework. To address a performance gap (e.g., classification accuracy) for text classification, a novel multi-dimensional Feature Attended Parametric Kernel Graph Neural Network (APKGNN) layer is introduced to construct a GNN model in this dissertation where the text classification task is transformed into a graph node classification task. To extract key phrases, contextual semantic tagging with text sequences as input to transformers is used which improves a transformer's learning representation. By deriving a set of characteristics ranging from low-level (lexical) natural language features to summative extracts, this research focuses on reducing human effort by adopting a combination of semi-supervised approaches for learning syntactic, semantic, and topological feature representation. The following central research questions are addressed: can CTI-relevant key phrases be identified effectively with reduced human effort; whether threats be classified into different types; and can threat events be detected and ranked from social media like Twitter data and other benchmark data sets. Developing an integrated system to answer these research questions showed that user-specific information in shared social media content, and connections (followers and followees) are effective and crucial for algorithmically tracing active CTI user accounts from open-source social network data. All these components, used in combination, facilitate the understanding of key analytical tasks and objectives of open-source cyber-threat intelligence

    Graph Convolutional Networks for Predictive Healthcare using Clinical Notes

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (석사) -- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 컴퓨터곡학뢀, 2020. 8. κΉ€μ„ .Clinical notes in Electronic Health Record(EHR) system are recorded in free text forms with different styles and abbreviations of personal preference. Thus, it is very difficult to extract clinically meaningful information from EHR clinical notes. There are many computational methods developed for tasks such as medical text normalization, medical entity extraction, and patient-level prediction tasks. Existing methods for the patient-level prediction task focus on capturing the contextual or sequential information from clinical texts, but they are not designed to capture global and non-consecutive information in the clinical texts. Recently, graph convolutional neural networks(GCNs) are successfully used for text-based classification since GCN can extract the global and long-distance information among the whole texts. However, application of GCN for mining clinical notes is yet to be fully explored. In this study, we propose an end-to-end framework for the analysis of clinical notes using graph neural network-based techniques to predict whether a patient is with MRSA (Methicillin-Resistant Staphylococcus Aureus) positive infection or negative infection. For this MRSA infection prediction, it is critical to capture the patient-specific and global non-consecutive information from patient clinical notes. The clinical notes of a patient are processed to construct a patient-level graph, and each patient-level graph is fed into the GCN-based framework for graph-level supervised learning. The proposed framework consists of a graph convolutional network layer, a graph pooling layer, and a readout layer, followed by a fully connected layer. We tested various settings of the GCN-based framework with various combinations of graph convolution operations and graph pooling methods and we evaluated the performance of each variant framework. In experiments with MRSA infection data, all of the variant frameworks with graph structure information outperformed several baseline methods without using graph structure information with a margin of 2.93%∼11.81%. We also investigated graphs in the pooling step to conduct interpretable analysis in population-based statistical and patient-specific aspects, respectively. With this inspection, we found long-distance word pairs that are distinct for MRSA positive patients and we also showed the pooled graph of the patient that contributes to the patient-specific prediction. Moreover, the Adaboost algorithm was used to improve the performance further. As a result, the framework proposed in this paper reached the highest performance of 85.70%, which is higher than the baseline methods with a margin of 3.71%∼12.59%.μ „μž 건강 기둝은 디지털 ν˜•νƒœλ‘œ μ²΄κ³„μ μœΌλ‘œ μˆ˜μ§‘λœ ν™˜μžμ˜ 건강 정보닀. μ „μž 건강 기둝이 ν™˜μžμ˜ μƒνƒœλ₯Ό ν‘œν˜„ ν•˜λŠ” λ‹¨μ–΄λ“€λ‘œ κ΅¬μ„±λœ λ¬Έμ„œμ˜ μ§‘ν•©μ΄κΈ°λ•Œλ¬Έμ— μžμ—°μ–΄ 처리 뢄야에 μ μš©λ˜λŠ” λ‹€μ–‘ν•œ κΈ°κ³„ν•™μŠ΅μ  방법듀이 μ μš©λ˜μ–΄μ™”λ‹€. 특히, λ”₯λŸ¬λ‹ 기술의 λ°œμ „μœΌλ‘œ 인해, μ΄λ―Έμ§€λ‚˜ ν…μŠ€νŠΈ λΆ„μ•Όμ—μ„œ ν™œμš© 되던 λ”₯λŸ¬λ‹ 기술 듀이생λͺ…μ •λ³΄λ°μ˜ν•™μ •λ³΄λΆ„μ•Όμ—μ μ°¨μ μš©λ˜κ³ μžˆλ‹€.ν•˜μ§€λ§Œ,κΈ°μ‘΄μ˜μ΄λ―Έμ§€λ‚˜ ν…μŠ€νŠΈλ°μ΄ν„°μ™€λŠ” λ‹€λ₯΄κ²Œ, μ „μž 건강 기둝 λ°μ΄ν„°λŠ” μž‘μ„±μž 및 ν™˜μž 개개인의 μƒνƒœμ— λ”°λΌμ„œ, λ°μ΄ν„°μ˜ ν™˜μž νŠΉμ΄μ„±μ΄ λ†’λ‹€. λ˜ν•œ, μœ μ‚¬ν•œ 의미λ₯Ό μ§€λ‹ˆλŠ” 건강 κΈ°λ‘λ“€κ°„μ˜ 상관관계λ₯Ό κ³ λ €ν•΄μ•Ό ν•  ν•„μš”κ°€μžˆλ‹€. λ³Έμ—°κ΅¬μ—μ„œλŠ” μ „μž 건강 기둝 λ°μ΄ν„°μ˜ ν™˜μžνŠΉμ΄μ„±μ„ κ³ λ €ν•œ κ·Έλž˜ν”„ 기반 λ”₯λŸ¬λ‹ λͺ¨λΈμ„ κ³ μ•ˆν•˜μ˜€λ‹€. ν™˜μžμ˜ μ „μž 건강 기둝 데이터와 의료 λ¬Έμ„œλ“€μ˜ 곡톡 μΆœν˜„ λΉˆλ„λ₯Ό ν™œμš© ν•˜μ—¬ ν™˜μž 특이적 κ·Έλž˜ν”„λ₯Ό μƒμ„±ν•˜μ˜€λ‹€. 이λ₯Ό 기반으둜, κ·Έλž˜ν”„ μ»¨λ³Όλ£¨μ…˜ λ„€νŠΈμ›Œν¬λ₯Ό μ‚¬μš©ν•˜μ—¬ ν™˜μžμ˜ λ³‘λ¦¬ν•™μ μƒνƒœλ₯Όμ˜ˆμΈ‘ν•˜λŠ”λͺ¨λΈμ„κ³ μ•ˆν•˜μ˜€λ‹€. μ—°κ΅¬μ—μ„œ μ‚¬μš©ν•œ λ°μ΄ν„°λŠ” Methicillin-Resistant Staphylococcus Aureus(MRSA) 감염 μ—¬λΆ€λ₯Ό μΈ‘μ •ν•œ 데이터이닀. κ³ μ•ˆν•œ κ·Έλž˜ν”„κΈ°λ°˜ λ”₯λŸ¬λ‹ λͺ¨λΈμ„ 톡해 ν™˜μžμ˜ 내성을 μ˜ˆμΈ‘ν•œ κ²°κ³Ό, κ·Έλž˜ν”„μ •λ³΄λ₯Ό ν™œμš© ν•˜μ§€ μ•Šμ€ κΈ°μ‘΄λͺ¨λΈλ“€ 보닀 2.93%∼11.81% λ›°μ–΄λ‚œμ„±λŠ₯μ„λ³΄μ˜€λ‹€. λ˜ν•œ 해석 κ°€λŠ₯ν•œ 뢄석을 μˆ˜ν–‰ν•˜κΈ° μœ„ν•΄ 풀링 λ‹¨κ³„μ—μ„œ κ·Έλž˜ν”„λ₯Ό μ‘°μ‚¬ν–ˆλ‹€.이λ₯Ό 톡해 MRSA μ–‘μ„± ν™˜μžμ— λŒ€ν•΄ κ΅¬λ³„λ˜λŠ” μž₯거리 λ‹¨μ–΄νŒ¨ν„΄μ„ μ°Ύμ•˜μœΌλ©° ν™˜μžλ³„ μ˜ˆμΈ‘μ— κΈ°μ—¬ν•˜λŠ” ν™˜μžμ˜ 합동 κ·Έλž˜ν”„λ₯Ό 보여 μ£Όμ—ˆλ‹€. μ„±λŠ₯을 λ”μš± ν–₯μƒμ‹œν‚€κΈ° μœ„ν•΄ μ•„λ‹€λΆ€μŠ€νŠΈ μ•Œκ³ λ¦¬μ¦˜μ„ μ‚¬μš©ν•˜μ˜€λ‹€. λ³Έ λ…Όλ¬Έμ—μ„œ μ œμ•ˆλœ κ²°κ³ΌλŠ” 85.70%둜 κ°€μž₯ 높은 μ„±λŠ₯을 κΈ°λ‘ν–ˆμœΌλ©°, μ΄λŠ” κΈ°μ‘΄ λͺ¨λΈλ³΄λ‹€ 3.71%∼12.59%의 ν–₯상 μ‹œμΌ°μŒμ„ λ³΄μ—¬μ£Όμ—ˆλ‹€.Chapter 1 Introduction 1 1.1 Background 1 1.1.1 EHR Clinical Text Data 1 1.1.2 Current methods and limitations 3 1.2 Problem Statement and Contributions 4 Chapter 2 Related Works 6 2.1 Traditional Methods 6 2.2 Deep Learning Methods 7 2.3 Graph Neural Networks 8 2.3.1 Graph Convolutional Networks 8 2.3.2 Graph Pooling Methods 9 2.3.3 Applications of GNN 10 Chapter 3 Methods and Materials 12 3.1 Notation and Problem Definition 12 3.2 Patient Graph Construction Process 14 3.2.1 Parsing and Filtering 15 3.2.2 Word Co-occurrence Finding 16 3.2.3 Patient-level Graph Representation 16 3.3 Word Embedding 17 3.4 Model Architecture 18 3.4.1 Graph Convolutional Network layer 19 3.4.2 Graph Pooling layer 22 3.4.3 Readout Layer 24 3.5 Prediction and Loss Function 25 3.6 Adaboost algorithm 25 Chapter 4 Experiments 27 4.1 EHR Dataset 27 4.1.1 Introduction to MIMIC-III Dataset 27 4.1.2 MRSA Data Collection 28 4.2 Hyper Parameter Settings 28 4.2.1 Model Training 29 4.3 Baseline Models 30 Chapter 5 Results 32 5.1 Performance Comparisons with baseline models 32 5.2 Performance Comparisons with graph networks 33 5.3 Interpretable analysis 34 5.4 Adaboost Result 38 Chapter 6 Conclusion 40 ꡭ문초둝 49 κ°μ‚¬μ˜ κΈ€ 50Maste

    Contextualized Non-local Neural Networks for Sequence Learning

    Full text link
    Recently, a large number of neural mechanisms and models have been proposed for sequence learning, of which self-attention, as exemplified by the Transformer model, and graph neural networks (GNNs) have attracted much attention. In this paper, we propose an approach that combines and draws on the complementary strengths of these two methods. Specifically, we propose contextualized non-local neural networks (CN3^{\textbf{3}}), which can both dynamically construct a task-specific structure of a sentence and leverage rich local dependencies within a particular neighborhood. Experimental results on ten NLP tasks in text classification, semantic matching, and sequence labeling show that our proposed model outperforms competitive baselines and discovers task-specific dependency structures, thus providing better interpretability to users.Comment: Accepted by AAAI201
    • …
    corecore