Search CORE

552 research outputs found

Information Retrieval: Recent Advances and Beyond

Author: Hambarde Kailash A.
Proenca Hugo
Publication venue
Publication date: 01/01/2023
Field of study

In this paper, we provide a detailed overview of the models used for information retrieval in the first and second stages of the typical processing chain. We discuss the current state-of-the-art models, including methods based on terms, semantic retrieval, and neural. Additionally, we delve into the key topics related to the learning process of these models. This way, this survey offers a comprehensive understanding of the field and is of interest for for researchers and practitioners entering/working in the information retrieval domain

arXiv.org e-Print Archive

Directory of Open Access Journals

Ambiguitas Machine Translation pada Cross Language Chatbot Bea Cukai

Author: Al Haromainy Muhammad Muharrom
Arifin Agus Zainal
Setyawan Dimas Ari
Waluya Onny Kartika
Publication venue: 'Universitas Pesantren Tinggi Darul Ulum (Unipdu)'
Publication date: 01/01/2019
Field of study

Sistem Information Retrieval (IR) maupun chatbot semakin banyak dikembangkan. Salah satu bagian yang banyak diteliti adalah cross language. Masalah pada pengembangan cross language yaitu terjadinya kesalahan pada hasil terjemahan mesin translasi yang memberikan arti tidak sesuai dengan bahasa natural, sehingga pengguna tidak mendapatkan jawaban yang semestinya, bahkan tidak jarang pula pengguna tidak menemukan jawaban. Penelitian ini mengusulkan skema baru mesin translasi yang bertujuan meningkatkan performa dalam masalah ambiguitas. Mesin translasi bekerja dengan cek kebenaran kata kunci, kemudian melakukan Part-of-Speech (POS) Tagging pada kata benda (noun). Kemudian, setiap kata benda yang terdeteksi akan dicari sinonimnya. Lalu, sinonim yang didapatkan akan ditambahkan dan menjadi alternatif kueri baru. Kueri yang mempunyai nilai confident tertinggi diasumsikan sebagai kueri yang paling sesuai. Pada hasil yang didapatkan setelah dilakukan uji coba, melalui penambahan metode yang kami usulkan pada machine translation, dapat meningkatkan akurasi chatbot dibandingkan tanpa menggunakan skema yang diusulkan. Hasil akurasi bertambah 5%, dari yang semula 73% menjadi 77%. Information retrieval and chatbot systems are increasingly being developed with its language part mostly studied. However, the problem associated with its development is the occurrence of errors in the translation machine resulting in inaccurate answers not in accordance with the natural language, thereby providing users with wrong answers. This study proposes a new translation machine scheme that aims to improve performance while translating ambiguous terms. Translation machines functions by checking the correctness of keywords, and carrying out Part-of-Speech (POS) Tagging on nouns (noun). The synonyms of any detected noun are searched for and obtained added to become alternative new queries. Those with the highest confident value are assumed to be the most appropriate. The results obtained after testing, through the addition of the method proposed in machine translation, can improve the accuracy of the chatbot compared to not using the proposed scheme. The results of the accuracy increased from the original 73% to 77%

Jurnal Online Unipdu Jombang (Universitas Pesantren Tinggi Darul 'Ulum)

PersoNER: Persian named-entity recognition

Author: Abdous M
Borzeshi EZ
Piccardi M
Poostchi H
Publication venue
Publication date: 01/01/2016
Field of study

© 1963-2018 ACL. Named-Entity Recognition (NER) is still a challenging task for languages with low digital resources. The main difficulties arise from the scarcity of annotated corpora and the consequent problematic training of an effective NER pipeline. To abridge this gap, in this paper we target the Persian language that is spoken by a population of over a hundred million people world-wide. We first present and provide ArmanPerosNERCorpus, the first manually-annotated Persian NER corpus. Then, we introduce PersoNER, an NER pipeline for Persian that leverages a word embedding and a sequential max-margin classifier. The experimental results show that the proposed approach is capable of achieving interesting MUC7 and CoNNL scores while outperforming two alternatives based on a CRF and a recurrent neural network

OPUS - University of Technology Sydney

Representation Learning for Natural Language Processing

Author: Lin Yankai
Liu Zhiyuan
Sun Maosong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

This open access book provides an overview of the recent advances in representation learning theory, algorithms and applications for natural language processing (NLP). It is divided into three parts. Part I presents the representation learning techniques for multiple language entries, including words, phrases, sentences and documents. Part II then introduces the representation techniques for those objects that are closely related to NLP, including entity-based world knowledge, sememe-based linguistic knowledge, networks, and cross-modal entries. Lastly, Part III provides open resource tools for representation learning techniques, and discusses the remaining challenges and future research directions. The theories and algorithms of representation learning presented can also benefit other related domains such as machine learning, social network analysis, semantic Web, information retrieval, data mining and computational biology. This book is intended for advanced undergraduate and graduate students, post-doctoral fellows, researchers, lecturers, and industrial engineers, as well as anyone interested in representation learning and natural language processing

OAPEN Library

Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions

Author: Liang Paul Pu
Morency Louis-Philippe
Zadeh Amir
Publication venue
Publication date: 07/09/2022
Field of study

Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design computer agents with intelligent capabilities such as understanding, reasoning, and learning through integrating multiple communicative modalities, including linguistic, acoustic, visual, tactile, and physiological messages. With the recent interest in video understanding, embodied autonomous agents, text-to-image generation, and multisensor fusion in application domains such as healthcare and robotics, multimodal machine learning has brought unique computational and theoretical challenges to the machine learning community given the heterogeneity of data sources and the interconnections often found between modalities. However, the breadth of progress in multimodal research has made it difficult to identify the common themes and open questions in the field. By synthesizing a broad range of application domains and theoretical frameworks from both historical and recent perspectives, this paper is designed to provide an overview of the computational and theoretical foundations of multimodal machine learning. We start by defining two key principles of modality heterogeneity and interconnections that have driven subsequent innovations, and propose a taxonomy of 6 core technical challenges: representation, alignment, reasoning, generation, transference, and quantification covering historical and recent trends. Recent technical achievements will be presented through the lens of this taxonomy, allowing researchers to understand the similarities and differences across new approaches. We end by motivating several open problems for future research as identified by our taxonomy

arXiv.org e-Print Archive

A Continuously Growing Dataset of Sentential Paraphrases

Author: He Hua
Lan Wuwei
Qiu Siyu
Xu Wei
Publication venue
Publication date: 01/01/2017
Field of study

A major challenge in paraphrase research is the lack of parallel corpora. In this paper, we present a new method to collect large-scale sentential paraphrases from Twitter by linking tweets through shared URLs. The main advantage of our method is its simplicity, as it gets rid of the classifier or human in the loop needed to select data before annotation and subsequent application of paraphrase identification algorithms in the previous work. We present the largest human-labeled paraphrase corpus to date of 51,524 sentence pairs and the first cross-domain benchmarking for automatic paraphrase identification. In addition, we show that more than 30,000 new sentential paraphrases can be easily and continuously captured every month at ~70% precision, and demonstrate their utility for downstream NLP tasks through phrasal paraphrase extraction. We make our code and data freely available.Comment: 11 pages, accepted to EMNLP 201

arXiv.org e-Print Archive

Crossref

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey

Author: Chen Guangyao
Gao Pengcheng
Gao Wen
Qian Guangwu
Tian Yonghong
Wang Xiao
Wang Yaowei
Wei Xiao-Yong
Publication venue
Publication date: 31/10/2023
Field of study

With the urgent demand for generalized deep models, many pre-trained big models are proposed, such as BERT, ViT, GPT, etc. Inspired by the success of these models in single domains (like computer vision and natural language processing), the multi-modal pre-trained big models have also drawn more and more attention in recent years. In this work, we give a comprehensive survey of these models and hope this paper could provide new insights and helps fresh researchers to track the most cutting-edge works. Specifically, we firstly introduce the background of multi-modal pre-training by reviewing the conventional deep learning, pre-training works in natural language process, computer vision, and speech. Then, we introduce the task definition, key challenges, and advantages of multi-modal pre-training models (MM-PTMs), and discuss the MM-PTMs with a focus on data, objectives, network architectures, and knowledge enhanced pre-training. After that, we introduce the downstream tasks used for the validation of large-scale MM-PTMs, including generative, classification, and regression tasks. We also give visualization and analysis of the model parameters and results on representative downstream tasks. Finally, we point out possible research directions for this topic that may benefit future works. In addition, we maintain a continuously updated paper list for large-scale pre-trained multi-modal big models: https://github.com/wangxiao5791509/MultiModal_BigModels_SurveyComment: Accepted by Machine Intelligence Researc

arXiv.org e-Print Archive