14 research outputs found
Annotation-Scheme Reconstruction for "Fake News" and Japanese Fake News Dataset
Fake news provokes many societal problems; therefore, there has been
extensive research on fake news detection tasks to counter it. Many fake news
datasets were constructed as resources to facilitate this task. Contemporary
research focuses almost exclusively on the factuality aspect of the news.
However, this aspect alone is insufficient to explain "fake news," which is a
complex phenomenon that involves a wide range of issues. To fully understand
the nature of each instance of fake news, it is important to observe it from
various perspectives, such as the intention of the false news disseminator, the
harmfulness of the news to our society, and the target of the news. We propose
a novel annotation scheme with fine-grained labeling based on detailed
investigations of existing fake news datasets to capture these various aspects
of fake news. Using the annotation scheme, we construct and publish the first
Japanese fake news dataset. The annotation scheme is expected to provide an
in-depth understanding of fake news. We plan to build datasets for both
Japanese and other languages using our scheme. Our Japanese dataset is
published at https://hkefka385.github.io/dataset/fakenews-japanese/.Comment: 13th International Conference on Language Resources and Evaluation
(LREC), 202
Effect of mobile phase composition on retention factor in supercritical fluid chromatography
In supercritical fluid chromatography with a mixture of carbon dioxide and a modifier as the mobile phase, the relationship between retention factor k and modifier mole fraction x was derived as 1/kmix=xCO2/k0CO2 + xmod/k0mod, where kmix, k0CO2, and k0mod are the retention factors for the mixture solvent, pure CO2 and pure modifier, respectively, and xCO2 and xmod are the mole fractions of CO2 and a modifier, respectively. Since k0CO2 and k0mod are often difficult to determine due to too large of a value for k0CO2 and an invalid value of k0mod for the liquid phase, both values were estimated experimentally from the two retention factors available at the lowest and highest modifier mole fractions at each temperature and pressure. The equation was effective for the retention factors of the R- and S-forms of racemic transstilbene oxide measured in the present study by supercritical fluid chromatography using a modifier such as methanol, ethanol or acetonitrile. Moreover, the equation was also valid for the retention data of various enantioselective separations as well as achiral separations reported in the literature
Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation
Geoparsing is a fundamental technique for analyzing geo-entity information in
text. We focus on document-level geoparsing, which considers geographic
relatedness among geo-entity mentions, and presents a Japanese travelogue
dataset designed for evaluating document-level geoparsing systems. Our dataset
comprises 200 travelogue documents with rich geo-entity information: 12,171
mentions, 6,339 coreference clusters, and 2,551 geo-entities linked to
geo-database entries
Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector
A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements
Annotation-Scheme Reconstruction for “Fake News” and Japanese Fake News Dataset
Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), Marseille, 20-25 June 2022Fake news provokes many societal problems; therefore, there has been extensive research on fake news detection tasks to counter it. Many fake news datasets were constructed as resources to facilitate this task. Contemporary research focuses almost exclusively on the factuality aspect of the news. However, this aspect alone is insufficient to explain “fake news,” which is a complex phenomenon that involves a wide range of issues. To fully understand the nature of each instance of fake news, it is important to observe it from various perspectives, such as the intention of the false news disseminator, the
harmfulness of the news to our society, and the target of the news. We propose a novel annotation scheme with fine-grained labeling based on detailed investigations of existing fake news datasets to capture these various aspects of fake news. Using the annotation scheme, we construct and publish the first Japanese fake news dataset. The annotation scheme is expected to provide an in-depth understanding of fake news. We plan to build datasets for both Japanese and other languages using our scheme. Our Japanese dataset is published at https://hkefka385.github.io/dataset/fakenews-japanese/
Surveillance of early stage COVID-19 clusters using search query logs and mobile device-based location information
Two clusters of the coronavirus disease 2019 (COVID-19) were confirmed in Hokkaido, Japan, in February 2020. To identify these clusters, this study employed web search query logs of multiple devices and user location information from location-aware mobile devices. We anonymously identified users who used a web search engine (i.e., Yahoo! JAPAN) to search for COVID-19 or its symptoms. We regarded them as web searchers who were suspicious of their own COVID-19 infection (WSSCI). We extracted the location of WSSCI via a mobile operating system application and compared the spatio-temporal distribution of WSSCI with the actual location of the two known clusters. In the early stage of cluster development, we confirmed several WSSCI. Our approach was accurate in this stage and became biased after a public announcement of the cluster development. When other cluster-related resources, such as detailed population statistics, are not available, the proposed metric can capture hints of emerging clusters