Search CORE

277 research outputs found

Enumeration of Extractive Oracle Summaries

Author: Hirao Tsutomu
Nagata Masaaki
Nishino Masaaki
Suzuki Jun
Publication venue
Publication date: 01/01/2017
Field of study

To analyze the limitations and the future directions of the extractive summarization paradigm, this paper proposes an Integer Linear Programming (ILP) formulation to obtain extractive oracle summaries in terms of ROUGE-N. We also propose an algorithm that enumerates all of the oracle summaries for a set of reference summaries to exploit F-measures that evaluate which system summaries contain how many sentences that are extracted as an oracle summary. Our experimental results obtained from Document Understanding Conference (DUC) corpora demonstrated the following: (1) room still exists to improve the performance of extractive summarization; (2) the F-measures derived from the enumerated oracle summaries have significantly stronger correlations with human judgment than those derived from single oracle summaries.Comment: 12 page

arXiv.org e-Print Archive

Crossref

Utilizing Features of Verbs in Statistical Zero Pronoun Resolution for Japanese Speech

Author: Nagata Masaaki
Yoshida Sen
Publication venue: City University of Hong Kong
Publication date: 01/01/2009
Field of study

PACLIC 23 / City University of Hong Kong / 3-5 December 200

Waseda University Repository

BaseNP Supersense Tagging for Japanese Texts

Author: Nagata Masaaki
Taira Hirotoshi
Yoshida Sen
Publication venue: City University of Hong Kong
Publication date: 01/01/2009
Field of study

PACLIC 23 / City University of Hong Kong / 3-5 December 200

Waseda University Repository

Parathyroid Diseases and Animal Models

Author: Imanishi Yasuo
Inaba Masaaki
Nagata Yuki
Publication venue: Frontiers Research Foundation
Publication date: 01/01/2012
Field of study

Circulating calcium and phosphate are tightly regulated by three hormones: the active form of vitamin D (1,25-dihydroxyvitamin D), fibroblast growth factor (FGF)-23, and parathyroid hormone (PTH). PTH acts to stimulate a rapid increment in serum calcium and has a crucial role in calcium homeostasis. Major target organs of PTH are kidney and bone. The oversecretion of the hormone results in hypercalcemia, caused by increased intestinal calcium absorption, reduced renal calcium clearance, and mobilization of calcium from bone in primary hyperparathyroidism. In chronic kidney disease, secondary hyperparathyroidism of uremia is observed in its early stages, and this finally develops into the autonomous secretion of PTH during maintenance hemodialysis. Receptors in parathyroid cells, such as the calcium-sensing receptor, vitamin D receptor, and FGF receptor (FGFR)-Klotho complex have crucial roles in the regulation of PTH secretion. Genes such as Cyclin D1, RET, MEN1, HRPT2, and CDKN1B have been identified in parathyroid diseases. Genetically engineered animals with these receptors and the associated genes have provided us with valuable information on the patho-physiology of parathyroid diseases. The application of these animal models is significant for the development of new therapies

Crossref

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction

Author: Nagata Masaaki
Tsuruoka Yoshimasa
Wu Qiyu
Publication venue
Publication date: 08/06/2023
Field of study

Most existing word alignment methods rely on manual alignment datasets or parallel corpora, which limits their usefulness. Here, to mitigate the dependence on manual data, we broaden the source of supervision by relaxing the requirement for correct, fully-aligned, and parallel sentences. Specifically, we make noisy, partially aligned, and non-parallel paragraphs. We then use such a large-scale weakly-supervised dataset for word alignment pre-training via span prediction. Extensive experiments with various settings empirically demonstrate that our approach, which is named WSPAlign, is an effective and scalable way to pre-train word aligners without manual data. When fine-tuned on standard benchmarks, WSPAlign has set a new state-of-the-art by improving upon the best-supervised baseline by 3.3~6.1 points in F1 and 1.5~6.1 points in AER. Furthermore, WSPAlign also achieves competitive performance compared with the corresponding baselines in few-shot, zero-shot and cross-lingual tests, which demonstrates that WSPAlign is potentially more practical for low-resource languages than existing methods.Comment: To appear at ACL 202

arXiv.org e-Print Archive

Extending Word-Level Quality Estimation for Post-Editing Assistance

Author: Nagata Masaaki
Utsuro Takehito
Wei Yizhen
Publication venue
Publication date: 22/09/2022
Field of study

We define a novel concept called extended word alignment in order to improve post-editing assistance efficiency. Based on extended word alignment, we further propose a novel task called refined word-level QE that outputs refined tags and word-level correspondences. Compared to original word-level QE, the new task is able to directly point out editing operations, thus improves efficiency. To extract extended word alignment, we adopt a supervised method based on mBERT. To solve refined word-level QE, we firstly predict original QE tags by training a regression model for sequence tagging based on mBERT and XLM-R. Then, we refine original word tags with extended word alignment. In addition, we extract source-gap correspondences, meanwhile, obtaining gap tags. Experiments on two language pairs show the feasibility of our method and give us inspirations for further improvement

arXiv.org e-Print Archive