Search CORE

1,251 research outputs found

AutoSense Model for Word Sense Induction

Author: Amplayo Reinald Kim
Hwang Seung-won
Song Min
Publication venue
Publication date: 22/11/2018
Field of study

Word sense induction (WSI), or the task of automatically discovering multiple senses or meanings of a word, has three main challenges: domain adaptability, novel sense detection, and sense granularity flexibility. While current latent variable models are known to solve the first two challenges, they are not flexible to different word sense granularities, which differ very much among words, from aardvark with one sense, to play with over 50 senses. Current models either require hyperparameter tuning or nonparametric induction of the number of senses, which we find both to be ineffective. Thus, we aim to eliminate these requirements and solve the sense granularity problem by proposing AutoSense, a latent variable model based on two observations: (1) senses are represented as a distribution over topics, and (2) senses generate pairings between the target word and its neighboring word. These observations alleviate the problem by (a) throwing garbage senses and (b) additionally inducing fine-grained word senses. Results show great improvements over the state-of-the-art models on popular WSI datasets. We also show that AutoSense is able to learn the appropriate sense granularity of a word. Finally, we apply AutoSense to the unsupervised author name disambiguation task where the sense granularity problem is more evident and show that AutoSense is evidently better than competing models. We share our data and code here: https://github.com/rktamplayo/AutoSense.Comment: AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Evidentiality-aware Retrieval for Overcoming Abstractiveness in Open-Domain Question Answering

Author: Hwang Seung-won
Jang Myungha
Lee Dahyun
Lee Dongha
Lee Kyungjae
Song Yongho
Yeo Jinyeong
Publication venue
Publication date: 01/02/2024
Field of study

The long-standing goal of dense retrievers in abtractive open-domain question answering (ODQA) tasks is to learn to capture evidence passages among relevant passages for any given query, such that the reader produce factually correct outputs from evidence passages. One of the key challenge is the insufficient amount of training data with the supervision of the answerability of the passages. Recent studies rely on iterative pipelines to annotate answerability using signals from the reader, but their high computational costs hamper practical applications. In this paper, we instead focus on a data-centric approach and propose Evidentiality-Aware Dense Passage Retrieval (EADPR), which leverages synthetic distractor samples to learn to discriminate evidence passages from distractors. We conduct extensive experiments to validate the effectiveness of our proposed method on multiple abstractive ODQA tasks.Comment: Findings of EACL 202

arXiv.org e-Print Archive

Myotonic Dystrophy Type 1 Presenting as Male Infertility

Author: Brewster
Brook
Davies
Dong Won Kim
Ji Woong Park
Ji Yun Jeong
Klesert
Kolettis
Mahadevan
Mifsud
Pan
Sarkar
Seung Ryul Lee
Seung Whan Doo
Won Jae Yang
Woong Bin Kim
Yun Seob Song
Publication venue: The Korean Urological Association
Publication date: 01/01/2012
Field of study

Myotonic dystrophy 1 (DM1) is a multi-system disorder characterized by endocrine defects that include testicular and tubular atrophy, oligospermia and azoospermia, and increased follicle-stimulating hormone levels. We describe a rare case of DM1 presenting as infertility in a 29-year-old man

Crossref

PubMed Central

A High-Yield Fabrication Process for Silicon Neural Probes

Author: Kim Jin Won
Kim Sung June
Oh Seung Jae
Song Jong Keun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2006
Field of study

There is a great need for silicon microelectrodes that can simultaneously monitor the activity of many neurons in the brain. However, one of the existing processes for fabricating silicon microelectrodes—reactive- ion etching in combination with anisotropic KOH etching—breaks down at the wet-etching step for device release. Here we describe amodified wet-etching sidewall-protection technique for the high-yield fabrication of well-defined silicon probe structures, using a Teflon® shield and low-pressure chemical vapor deposition (LPCVD) silicon nitride. In the proposed method, a micro-tab holds each individual probe to the central scaffold, allowing uniform anisotropicKOHetching. Using this approach, we obtained a well-defined probe structure without device loss during the wet-etching process. This simple method yielded more accurate fabrication and an improved mechanical profile.This work was supported in part by the Korean Science and Foundation (KOSEF) through the Nano-Bioelectronics and Systems Research Center, Seoul National Universit

SNU Open Repository and Archive

Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback

Author: Chae Hyungjoo
Hwang Seung-won
Kang Dongjin
Kwon Taeyoon
Moon Seungjun
Ong Kai Tzu-iunn
Song Yongho
Yeo Jinyoung
Publication venue
Publication date: 13/11/2023
Field of study

Code editing is an essential step towards reliable program synthesis to automatically correct critical errors generated from code LLMs. Recent studies have demonstrated that closed-source LLMs (i.e., ChatGPT and GPT-4) are capable of generating corrective feedback to edit erroneous inputs. However, it remains challenging for open-source code LLMs to generate feedback for code editing, since these models tend to adhere to the superficial formats of feedback and provide feedback with misleading information. Hence, the focus of our work is to leverage open-source code LLMs to generate helpful feedback with correct guidance for code editing. To this end, we present Coffee, a collected dataset specifically designed for code fixing with feedback. Using this dataset, we construct CoffeePots, a framework for COde Fixing with FEEdback via Preference-Optimized Tuning and Selection. The proposed framework aims to automatically generate helpful feedback for code editing while minimizing the potential risk of superficial feedback. The combination of Coffee and CoffeePots marks a significant advancement, achieving state-of-the-art performance on HumanEvalFix benchmark. Codes and model checkpoints are publicly available at https://github.com/Lune-Blue/COFFEE.Comment: Work in progres

arXiv.org e-Print Archive

Challenges in Diagnosing Narcolepsy and Idiopathic Hypersomnia

Author: Ji Hyun Song
Seung-Chul Hong
Tae-Won Kim
Young-Chan Kim
Publication venue: 'Korean Society of Sleep Medicine'
Publication date: 01/03/2023
Field of study

Narcolepsy and idiopathic hypersomnia are central disorders of hypersomnolence accompanied by excessive daytime sleepiness, which are not caused by nocturnal sleep disturbance, sleep deficiency, or circadian rhythm sleep disorders. Several studies have questioned the repeatability of the Multiple Sleep Latency Test (MSLT) in type 2 narcolepsy (NT2) patients. After two or more repeated MSLTs, the diagnosis of type 1 narcolepsy (NT1) is maintained in more than 90% of cases, while only half of the NT2 patients retain their original diagnosis. The diagnosis of NT2 may shift to idiopathic hypersomnia based on the MSLT results, making the differential diagnosis of NT2 and idiopathic hypersomnia particularly challenging. Therefore, this study suggests the need for new tests in addition to the MSLT for diagnostic consistency in NT2 and idiopathic hypersomnia

Directory of Open Access Journals