Search CORE

101 research outputs found

Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets

Author: Jung Woohwan
Lee Su Ah
Oh Seokjin
Publication venue
Publication date: 13/11/2023
Field of study

Named Entity Recognition (NER) frequently suffers from the problem of insufficient labeled data, particularly in fine-grained NER scenarios. Although

K

-shot learning techniques can be applied, their performance tends to saturate when the number of annotations exceeds several tens of labels. To overcome this problem, we utilize existing coarse-grained datasets that offer a large number of annotations. A straightforward approach to address this problem is pre-finetuning, which employs coarse-grained data for representation learning. However, it cannot directly utilize the relationships between fine-grained and coarse-grained entities, although a fine-grained entity type is likely to be a subcategory of a coarse-grained entity type. We propose a fine-grained NER model with a Fine-to-Coarse(F2C) mapping matrix to leverage the hierarchical structure explicitly. In addition, we present an inconsistency filtering method to eliminate coarse-grained entities that are inconsistent with fine-grained entity types to avoid performance degradation. Our experimental results show that our method outperforms both

K

-shot learning and supervised learning methods when dealing with a small number of fine-grained annotations.Comment: Accepted to EMNLP 202

arXiv.org e-Print Archive

Data Augmentation for Neural Machine Translation using Generative Language Model

Author: Jung Woohwan
Lee Su Ah
Oh Seokjin
Publication venue
Publication date: 13/11/2023
Field of study

Despite the rapid growth in model architecture, the scarcity of large parallel corpora remains the main bottleneck in Neural Machine Translation. Data augmentation is a technique that enhances the performance of data-hungry models by generating synthetic data instead of collecting new ones. We explore prompt-based data augmentation approaches that leverage large-scale language models such as ChatGPT. To create a synthetic parallel corpus, we compare 3 methods using different prompts. We employ two assessment metrics to measure the diversity of the generated synthetic data. This approach requires no further model training cost, which is mandatory in other augmentation methods like back-translation. The proposed method improves the unaugmented baseline by 0.68 BLEU score

arXiv.org e-Print Archive

Optical manipulation of a dielectric particle along polygonal closed-loop geometries within a single water droplet

Author: Dholakia Kishan
Hong Seongjin
Kim Seokjin
Lee Hyeonwoo
Lee Yong Soo
Oh Kyunghwan
Park Junbum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2021
Field of study

This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIT) (No. 2019R1A2C2011293) and the UK Engineering and Physical Sciences Research Council (Grant EP/P030017/1).We report a new method to optically manipulate a single dielectric particle along closed-loop polygonal trajectories by crossing a suite of all-fiber Bessel-like beams within a single water droplet. Exploiting optical radiation pressure, this method demonstrates the circulation of a single polystyrene bead in both a triangular and a rectangle geometry enabling the trapped particle to undergo multiple circulations successfully. The crossing of the Bessel-like beams creates polygonal corners where the trapped particles successfully make abrupt turns with acute angles, which is a novel capability in microfluidics. This offers an optofluidic paradigm for particle transport overcoming turbulences in conventional microfluidic chips.Publisher PDFPeer reviewe

Directory of Open Access Journals

PubMed Central

University of St. Andrews - Pure

St Andrews Research Repository

Carrot and stick?: Impact of a low-stakes school accountability program on student achievement

Author: Chiang
De Witte
Figlio
Hanushek
Imbens
Kyunghee Kim
Lee
Neal
Neal
Rockoff
Rouse
Seokjin Woo
Soohyung Lee
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

A key concern in the design of education policies relates to the structure of incentives in accountability systems. This paper examines a school accountability program that provides financial support to low-performing schools but has no direct punishment scheme for recipients who do not exhibit improvement. Although the program does not include high-stakes consequences, our estimates indicate that the program reduced the share of underperforming students by 18 percent. This paper's results suggest that to improve student achievement, a school accountability program does not need to set high-stakes consequences that potentially induce unwanted strategic behaviors on the part of school workers

Crossref

EconStor (ZBW Kiel)

Identification of the early and late responder genes during the generation of induced pluripotent stem cells from mouse fibroblasts

Author: Choe Moon Kyung
Ham Seokjin
Hong Chang-Pyo
Kim Hyo-Soo
Kwon Yoo-Wook
Lee Choon-Soo
Park Jihwan
ROH TAE YOUNG
Seo Seonghye
Shin So-I
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2019
Field of study

111Ysciescopu

포항공과대학교

Prefoldin 6 mediates longevity response from heat shock factor 1 to FOXO in C-elegans

Author: AN SEON WOO
Ao-Lin Hsu
Chang Man Ha
Eun-Seok Choi
Eunju Kim
Haeshim Baek
Heehwa G. Son
JANG SUNG KEY
Keunhee Seo
LEE SEUNG JAE
LEE YUJIN
Mihwa Seo
ROH TAE YOUNG
SANGSOON PARK
Seokjin Ham
Youngjae Ryu
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/12/2018
Field of study

Heat shock factor 1 (HSF-1) and forkhead box O (FOXO) are key transcription factors that protect cells from various stresses. In Caenorhabditis elegans, HSF-1 and FOXO together promote a long life span when insulin/IGF-1 signaling (IIS) is reduced. However, it remains poorly understood how HSF-1 and FOXO cooperate to confer IIS-mediated longevity. Here, we show that prefoldin 6 (PFD-6), a component of the molecular chaperone prefoldin-like complex, relays longevity response from HSF-1 to FOXO under reduced IIS. We found that PFD-6 was specifically required for reduced IIS-mediated longevity by acting in the intestine and hypodermis. We showed that HSF-1 increased the levels of PFD-6 proteins, which in turn directly bound FOXO and enhanced its transcriptional activity. Our work suggests that the prefoldin-like chaperone complex mediates longevity response from HSF-1 to FOXO to increase the life span in animals with reduced IIS.11Ysciescopu

IBS Publications Repository

포항공과대학교

DGIST Library Institutional Repository

CD82/KAI1 Maintains the Dormancy of Long-Term Hematopoietic Stem Cells through Interaction with DARC- Expressing Macrophages

Author: Chang-Hwan Yoon
Cheong-Whan Chae
Ho Lee
Hwan Lee
Hyo-Soo Kim
Jae-Il Choi
Jeehoon Kang
Ji-Yeon Yun
Jin Hur
Jin-A Kang
Jong Kwan Jun
Kyungjin Boo
Pniel Nham
ROH TAE YOUNG
Sang Eun Lee
Seokjin Ham
Sung Hee Baek
Tae-Won Kim
Publication venue: 'Elsevier BV'
Publication date
Field of study

Hematopoiesis is regulated by crosstalk between long-term repopulating hematopoietic stem cells (LT-HSCs) and supporting niche cells in the bone marrow (BM). Here, we examine the role of CD82/ KAI1 in niche-mediated LT-HSC maintenance. We found that CD82/ KAI1 is expressed predominantly on LT-HSCs and rarely on other hematopoietic stem-progenitor cells (HSPCs). In Cd82 +/-/+/- mice, LTHSCs were selectively lost as they exited from quiescence and differentiated. Mechanistically, CD82based TGF-b1/ Smad3 signaling leads to induction of CDK inhibitors and cell-cycle inhibition. The CD82 binding partner DARC/ CD234 is expressed on macrophages and stabilizes CD82 on LT-HSCs, promoting their quiescence. When DARC + BMmacrophages were ablated, the level of surface CD82 on LT-HSCs decreased, leading to cell-cycle entry, proliferation, and differentiation. A similar interaction appears to be relevant for human HSPCs. Thus, CD82 is a functional surface marker of LT-HSCs that maintains quiescence through interaction with DARC-expressing macrophages in the BM stem cell niche.113525Ysciescopu

포항공과대학교

Using standardized patients versus video cases for representing clinical problems in problem-based learning

Author: Barrows
Bo Young Yoon
Byoung Doo Rhee
Hyerin Roh
Ikseon Choi
Jong-Tae Lee
Seokjin Choi
Tae-Hee Kim
Publication venue: 'Korean Society of Medical Education'
Publication date
Field of study

Crossref

Bioinformatics services for analyzing massive genomic datasets

Author: Cho Youngbum
Ham Seokjin
Han Jiyeon
Jang Insoon
Jeong Seongmun
Kang Byunghee
Kim Jae-Yoon
Kim Jihyun F.
Kim Kyoung Hyoun
Kim Lian
Kim Namshin
Kim Pan-Gyu
Kim Seon-Young
Ko Gunhwan
Lee Byungwook
Lee Ho-Yeon
Lee Sanghyuk
Lee Seung-Won
Nam Dougu
Roh Tae-Young
Shin Sunguk
Yu Namhee
Publication venue: 'Korea Genome Organization'
Publication date: 01/03/2020
Field of study

The explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational resources and analysis pipelines. A promising solution for addressing this computational challenge is cloud computing, where CPUs, memory, storage, and programs are accessible in the form of virtual machines. Here, we present a cloud computing-based system, Bio-Express, that provides user-friendly, cost-effective analysis of massive genomic datasets. Bio-Express is loaded with predefined multi-omics data analysis pipelines, which are divided into genome, transcriptome, epigenome, and metagenome pipelines. Users can employ predefined pipelines or create a new pipeline for analyzing their own omics data. We also developed several web-based services for facilitating down-stream analysis of genome data. Bio-Express web service is freely available at https://www. bioexpress.re.kr/. ?? 2020, Korea Genome Organization

ScholarWorks@UNIST

Recommended from our members

Routing algorithms for field-programmable gate arrays

Author: Lee Seokjin
Publication venue
Publication date: 01/01/2003
Field of study

textField-Programmable Gate Arrays (FPGAs) have been one of the most popular devices for system prototyping, logic emulation, and reconfigurable computing. Their user-programmable prefabricated logic modules and routing structures provide low manufacturing cost and fast time-to-market implementation solutions to the users. However, the routing delay due to their inherent routing structure has been one of the biggest bottlenecks of their speed performance. As the VLSI fabrication feature size is shrunk to deep submicron dimension in modern technology, the portion taken up by routing in both of area and timing grows even more significantly. In this dissertation, we address issues on routing algorithms to optimize area and timing of an FPGA system. We present a new timing-driven routing algorithm for FPGAs. The algorithm finds a routing solution with minimum critical path delay for a given placed netlist using the Lagrangian relaxation technique. Lagrangian multipliers used to relax timing constraints are updated by subgradient method over iterations. Incorporated into the cost function, these multipliers guide the router to construct routing trees for the nets. Experimental results on benchmark circuits show that our approach outperforms the state-of-the-art VPR router. The routing channels of an FPGA consist of wire segments of various types, which provide the tradeoff between performance and routability. To fully exploit the potential of the routing architectures with various wire types, it is beneficial to perform appropriate assignment of wire types to routes for nets. We present a wire-type assignment algorithm that is based on iterative applications of mincost max-flow technique to simultaneously route many nets. At each stage of the network flow computation, we have guaranteed optimal result in terms of routability and total delay cost. Experimental results show that our algorithm can route more nets with smaller total delay. We also present a congestion-driven detailed routing algorithm. Using the min-cost flow approach, our algorithm routes all the nets connected to a common logic module simultaneously. At each stage of the min-cost flow computation, we guarantee optimal routing result for the nets connected to a logic block in terms of routability and total delay cost. To achieve better overall results, we adopt an iterative refinement scheme based on the Lagrangian relaxation technique. We compared the routing results with those from VPR router, and the results show that our router uses less or equal number of routing tracks with smaller critical path delay as well as total routing delay.Electrical and Computer Engineerin

Texas ScholarWorks