Search CORE

6 research outputs found

Text Retrieval with Multi-Stage Re-Ranking Models

Author: Imaichi Osamu
Sasazawa Yuichi
Sogawa Yasuhiro
Yokote Kenichi
Publication venue
Publication date: 14/11/2023
Field of study

The text retrieval is the task of retrieving similar documents to a search query, and it is important to improve retrieval accuracy while maintaining a certain level of retrieval speed. Existing studies have reported accuracy improvements using language models, but many of these do not take into account the reduction in search speed that comes with increased performance. In this study, we propose three-stage re-ranking model using model ensembles or larger language models to improve search accuracy while minimizing the search delay. We ranked the documents by BM25 and language models, and then re-ranks by a model ensemble or a larger language model for documents with high similarity to the query. In our experiments, we train the MiniLM language model on the MS-MARCO dataset and evaluate it in a zero-shot setting. Our proposed method achieves higher retrieval accuracy while reducing the retrieval speed decay

arXiv.org e-Print Archive

Controlling keywords and their positions in text generation

Author: Imaichi Osamu
Morishita Terufumi
Ozaki Hiroaki
Sasazawa Yuichi
Sogawa Yasuhiro
Publication venue
Publication date: 19/04/2023
Field of study

One of the challenges in text generation is to control generation as intended by a user. Previous studies have proposed to specify the keywords that should be included in the generated text. However, this is insufficient to generate text which reflect the user intent. For example, placing the important keyword beginning of the text would helps attract the reader's attention, but existing methods do not enable such flexible control. In this paper, we tackle a novel task of controlling not only keywords but also the position of each keyword in the text generation. To this end, we show that a method using special tokens can control the relative position of keywords. Experimental results on summarization and story generation tasks show that the proposed method can control keywords and their positions. We also demonstrate that controlling the keyword positions can generate summary texts that are closer to the user's intent than baseline. We release our code

arXiv.org e-Print Archive

属性情報を追加した事前学習済みモデルのファインチューニング

Author: Okazaki Naoaki
Sasazawa Yuichi
岡崎直観
笹沢裕一
Publication venue
Publication date: 29/05/2021
Field of study

Institutional Repositories DataBase (IRDB)

WER99 at the NTCIR-15 QA Lab-PoliInfo-2 Classification Task

Author: Okazaki Naoaki
Sasazawa Yuichi
岡崎直観
笹沢裕一
Publication venue
Publication date: 29/05/2021
Field of study

Institutional Repositories DataBase (IRDB)

対話型質問応答の省略補完

Author: Okazaki Naoaki
Sasazawa Yuichi
Takase Sho
岡崎直観
笹沢裕一
高瀬翔
Publication venue
Publication date: 29/05/2020
Field of study

Institutional Repositories DataBase (IRDB)

Neural Question Generation using Interrogative Phrases

Author: Okazaki Naoaki
Sasazawa Yuichi
Takase Sho
岡崎直観
笹沢裕一
高瀬翔
Publication venue
Publication date: 29/05/2020
Field of study

Institutional Repositories DataBase (IRDB)