268 research outputs found

    Multimodal Short Video Rumor Detection System Based on Contrastive Learning

    Full text link
    With short video platforms becoming one of the important channels for news sharing, major short video platforms in China have gradually become new breeding grounds for fake news. However, it is not easy to distinguish short video rumors due to the great amount of information and features contained in short videos, as well as the serious homogenization and similarity of features among videos. In order to mitigate the spread of short video rumors, our group decides to detect short video rumors by constructing multimodal feature fusion and introducing external knowledge after considering the advantages and disadvantages of each algorithm. The ideas of detection are as follows: (1) dataset creation: to build a short video dataset with multiple features; (2) multimodal rumor detection model: firstly, we use TSN (Temporal Segment Networks) video coding model to extract video features; then, we use OCR (Optical Character Recognition) and ASR (Automatic Character Recognition) to extract video features. Recognition) and ASR (Automatic Speech Recognition) fusion to extract text, and then use the BERT model to fuse text features with video features (3) Finally, use contrast learning to achieve distinction: first crawl external knowledge, then use the vector database to achieve the introduction of external knowledge and the final structure of the classification output. Our research process is always oriented to practical needs, and the related knowledge results will play an important role in many practical scenarios such as short video rumor identification and social opinion control

    ์†Œ์…œ ๋ฏธ๋””์–ด ์† ๋ฃจ๋จธ ํƒ์ง€๋ฅผ ์œ„ํ•œ ๊ทธ๋ž˜ํ”„ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง๊ณผ ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2020. 8. ๊น€์ข…๊ถŒ.Social media has been a great disseminator for new information and thoughts. Due to its accessibility of sharing information, however, social media has also become an ideal platform for propagations of rumors, fake news, and misinformation. Rumors on social media not only mislead the users of online but also affects the real world immensely. Thus, detecting the rumors and preventing their spread became an essential task. Previous learning-based rumor detection methods adopted to use contents, users, or propagation features of rumors. However, the methods are limited to represent rumor propagation as static graphs, which arent optimal for capturing the dynamic information of the rumors. In this study, we propose a novel graph convolutional networks with attention mechanism model named, Dynamic GCN, for rumor detection. We first represent rumor posts with their responsive posts as dynamic graphs. The temporal information is used to generate a sequence of graph snapshots. The representation learning on graph snapshots with attention mechanism captures both structural and temporal information of rumor spreads. The conducted experiments on two real-world datasets demonstrate that our model, Dynamic GCN, achieves superior results over the state-of-the-art models in the rumor detection task.์†Œ์…œ ๋ฏธ๋””์–ด๋Š” ๊ฐ•๋ ฅํ•œ ์ •๋ณด ์ „๋‹ฌ๋ ฅ์„ ๊ฐ€์ง„ ๋งค์ฒด๋กœ ์ƒˆ๋กœ์šด ์ •๋ณด์™€ ์ƒ๊ฐ์˜ ์ „ํŒŒ ์ฐฝ๊ตฌ์ด๋‹ค. ์†Œ์…œ ๋ฏธ๋””์–ด์˜ ํŠน์ง•์ธ ์ ‘๊ทผ์„ฑ์€ ๋•Œ๋ก  ๋ฃจ๋จธ, ๊ฐ€์งœ ๋‰ด์Šค, ์ž˜๋ชป๋œ ์ •๋ณด์˜ ์ „ํŒŒ์—์„œ๋„ ์ด์ƒ์ ์ธ ํ”Œ๋žซํผ์ด ๋œ๋‹ค. ์†Œ์…œ ๋ฏธ๋””์–ด ์† ๋ฃจ๋จธ๋Š” ์˜จ๋ผ์ธ ์‚ฌ์šฉ์ž๋ฅผ ์˜ค๋„ํ•  ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ๋•Œ๋ก  ํ˜„์‹ค ์„ธ๊ณ„์—๋„ ํฐ ์˜ํ–ฅ์„ ๋ฏธ์นœ๋‹ค. ๋”ฐ๋ผ์„œ, ๋ฃจ๋จธ๋ฅผ ํƒ์ง€ํ•˜๊ณ  ๊ทธ ์ „ํŒŒ๋ฅผ ๋ง‰๋Š” ๋…ธ๋ ฅ์ด ์š”๊ตฌ๋œ๋‹ค. ๊ธฐ์กด์˜ ๋ฃจ๋จธ ํƒ์ง€ ๋ฐฉ๋ฒ•์€ ๋ฃจ๋จธ์˜ ๋‚ด์šฉ, ์‚ฌ์šฉ์ž, ๋˜๋Š” ์ „ํŒŒ ๊ณผ์ •์˜ ์ •๋ณด๋ฅผ ํŠน์„ฑ์œผ๋กœ ์ด์šฉํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ๋ฐฉ๋ฒ•์€ ๋ฃจ๋จธ์˜ ์ „ํŒŒ๋ฅผ ์ •์  ๊ทธ๋ž˜ํ”„๋กœ ํ‘œํ˜„ํ•˜๋ฉฐ ๊ทธ ๊ตฌ์กฐ์  ํŠน์„ฑ์„ ์ด์šฉํ•œ๋‹ค. ํ•˜์ง€๋งŒ ์ด๋Š” ๋ฃจ๋จธ์˜ ๋™์  ํŠน์„ฑ์„ ํฌ์ฐฉํ•˜์ง€ ๋ชปํ•œ๋‹ค๋Š” ํ•œ๊ณ„๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ๊ทธ๋ž˜ํ”„ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง (graph convolutional networks: GCN)๊ณผ ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜ (attention mechanism)์„ ํ™œ์šฉํ•œ ๋™์  ๊ทธ๋ž˜ํ”„ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง (Dynamic GCN) ๋ฃจ๋จธ ํƒ์ง€ ๋ชจ๋ธ์„ ์ œ์‹œํ•œ๋‹ค. ๋จผ์ €, ์†Œ์…œ ๋ฏธ๋””์–ด ์† ๋ฃจ๋จธ ๊ฒŒ์‹œ๊ธ€๋“ค (posts) ๊ณผ ๊ทธ์˜ ๋‹ต์žฅ์ด ๋˜๋Š” ๊ธ€๋“ค(responsive posts)์„ ์ด์šฉํ•˜์—ฌ ๋ฃจ๋จธ์˜ ์ „ํŒŒ ๊ณผ์ •์„ ์ •์  ๊ทธ๋ž˜ํ”„๋กœ ํ‘œํ˜„ํ•œ๋‹ค. ์‹œ๊ฐ„ ์ •๋ณด๋ฅผ ํ†ตํ•ด ์ „ํŒŒ ๊ณผ์ •์„ ํฌํ•จํ•˜๊ณ  ์žˆ๋Š” ์ •์  ๊ทธ๋ž˜ํ”„์˜ ์ง‘ํ•ฉ์ธ ๊ทธ๋ž˜ํ”„ ์Šค๋ƒ…์ˆ (graph snapshot) ์‹œํ€€์Šค (sequence)๋ฅผ ๋งŒ๋“ค๊ฒŒ ๋œ๋‹ค. ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ํ™œ์šฉํ•œ ๊ทธ๋ž˜ํ”„ ์Šค๋ƒ…์ˆ ํ‘œํ˜„ ํ•™์Šต์€ ๋ฃจ๋จธ ์ „ํŒŒ์˜ ๊ตฌ์กฐ์  ์‹œ๊ฐ„์  ์ •๋ณด๋ฅผ ๋ชจ๋‘ ํšจ๊ณผ์ ์œผ๋กœ ๋ฐ˜์˜ํ•œ๋‹ค. ์‹ค์ œ ํŠธ์œ„ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์ด์šฉํ•œ ์‹คํ—˜์„ ํ†ตํ•˜์—ฌ ์ œ์‹œ๋œ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ๋‹ค๋ฅธ ๋น„๊ต ๋ชจ๋ธ๋“ค๋ณด๋‹ค ๋†’์Œ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค.Chapter I Introduction 1 Chapter II Related Work 5 2.1 Rumor Detection 5 2.2 Graph Convolutional Networks 6 2.3 Learning Sequences & Attention Mechanism 7 Chapter III Problem Definition 9 Chapter IV Dynamic GCN with Attention Mechanism 11 4.1 Snapshot Generation 13 4.2 Graph Convolutional Networks 14 4.3 Readout Layer 15 4.4 Attention Mechanism 16 4.5 Training & Prediction 17 Chapter V Experiments 18 5.1 Datasets 18 5.2 Baselines 20 5.3 Experimental Setup & Implementation Details 21 5.4 Performance Evaluations 24 5.5 Ablation Study 25 Chapter VI Conclusion 30 Bibliography 31 ์ดˆ ๋ก 40Maste

    Towards Evaluating Veracity of Textual Statements on the Web

    Get PDF
    The quality of digital information on the web has been disquieting due to the absence of careful checking. Consequently, a large volume of false textual information is being produced and disseminated with misstatements of facts. The potential negative influence on the public, especially in time-sensitive emergencies, is a growing concern. This concern has motivated this thesis to deal with the problem of veracity evaluation. In this thesis, we set out to develop machine learning models for the veracity evaluation of textual claims based on stance and user engagements. Such evaluation is achieved from three aspects: news stance detection engaged user replies in social media and the engagement dynamics. First of all, we study stance detection in the context of online news articles where a claim is predicted to be true if it is supported by the evidential articles. We propose to manifest a hierarchical structure among stance classes: the high-level aims at identifying relatedness, while the low-level aims at classifying, those identified as related, into the other three classes, i.e., agree, disagree, and discuss. This model disentangles the semantic difference of related/unrelated and the other three stances and helps address the class imbalance problem. Beyond news articles, user replies on social media platforms also contain stances and can infer claim veracity. Claims and user replies in social media are usually short and can be ambiguous; to deal with semantic ambiguity, we design a deep latent variable model with a latent distribution to allow multimodal semantic distribution. Also, marginalizing the latent distribution enables the model to be more robust in relatively smalls-sized datasets. Thirdly, we extend the above content-based models by tracking the dynamics of user engagement in misinformation propagation. To capture these dynamics, we formulate user engagements as a dynamic graph and extract its temporal evolution patterns and geometric features based on an attention-modified Temporal Point Process. This allows to forecast the cumulative number of engaged users and can be useful in assessing the threat level of an individual piece of misinformation. The ability to evaluate veracity and forecast the scale growth of engagement networks serves to practically assist the minimization of online false informationโ€™s negative impacts
    • โ€ฆ
    corecore