Search CORE

6 research outputs found

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

Author: Cao Ruisheng
Chen Lu
Xu Hongshen
Yu Kai
Zhang Hanchong
Publication venue
Publication date: 26/10/2023
Field of study

Recently Large Language Models (LLMs) have been proven to have strong abilities in various domains and tasks. We study the problem of prompt designing in the text-to-SQL task and attempt to improve the LLMs' reasoning ability when generating SQL queries. Besides the trivial few-shot in-context learning setting, we design our chain-of-thought (CoT) prompt with a similar method to schema linking. We provide a method named ACT-SQL to automatically generate auto-CoT exemplars and thus the whole process doesn't need manual labeling. Our approach is cost-saving since we only use the LLMs' API call once when generating one SQL query. Furthermore, we extend our in-context learning method to the multi-turn text-to-SQL task. The experiment results show that the LLMs' performance can benefit from our ACT-SQL approach. Our approach achieves SOTA performance on the Spider dev set among existing in-context learning approaches

arXiv.org e-Print Archive

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

Author: Cao Ruisheng
Chen Lu
Li Jieyu
Ma Da
Xu Hongshen
Yu Kai
Zhang Hanchong
Publication venue
Publication date: 28/10/2023
Field of study

Text-to-SQL aims to generate an executable SQL program given the user utterance and the corresponding database schema. To ensure the well-formedness of output SQLs, one prominent approach adopts a grammar-based recurrent decoder to produce the equivalent SQL abstract syntax tree (AST). However, previous methods mainly utilize an RNN-series decoder, which 1) is time-consuming and inefficient and 2) introduces very few structure priors. In this work, we propose an AST structure-aware Transformer decoder (ASTormer) to replace traditional RNN cells. The structural knowledge, such as node types and positions in the tree, is seamlessly incorporated into the decoder via both absolute and relative position embeddings. Besides, the proposed framework is compatible with different traversing orders even considering adaptive node selection. Extensive experiments on five text-to-SQL benchmarks demonstrate the effectiveness and efficiency of our structured decoder compared to competitive baselines

arXiv.org e-Print Archive

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Author: Cao Ruisheng
Chen Lu
Jiang Sheng
Xu Hongshen
Yu Kai
Zhang Hanchong
Zhu Su
Publication venue
Publication date: 28/02/2024
Field of study

Previous work on spoken language understanding (SLU) mainly focuses on single-intent settings, where each input utterance merely contains one user intent. This configuration significantly limits the surface form of user utterances and the capacity of output semantics. In this work, we first propose a Multi-Intent dataset which is collected from a realistic in-Vehicle dialogue System, called MIVS. The target semantic frame is organized in a 3-layer hierarchical structure to tackle the alignment and assignment problems in multi-intent cases. Accordingly, we devise a BiRGAT model to encode the hierarchy of ontology items, the backbone of which is a dual relational graph attention network. Coupled with the 3-way pointer-generator decoder, our method outperforms traditional sequence labeling and classification-based schemes by a large margin

arXiv.org e-Print Archive

On the Structural Generalization in Text-to-SQL

Author: Cao Ruisheng
Chen Lu
Chen Zhi
Li Jieyu
Xu Hongshen
Yu Kai
Zhang Hanchong
Zhu Su
Publication venue
Publication date: 21/01/2023
Field of study

Exploring the generalization of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous works provided investigations focusing on lexical diversity, including the influence of the synonym and perturbations in both natural language questions and databases. However, research on the structure variety of database schema~(DS) is deficient. Specifically, confronted with the same input question, the target SQL is probably represented in different ways when the DS comes to a different structure. In this work, we provide in-deep discussions about the structural generalization of text-to-SQL tasks. We observe that current datasets are too templated to study structural generalization. To collect eligible test data, we propose a framework to generate novel text-to-SQL data via automatic and synchronous (DS, SQL) pair altering. In the experiments, significant performance reduction when evaluating well-trained text-to-SQL models on the synthetic samples demonstrates the limitation of current research regarding structural generalization. According to comprehensive analysis, we suggest the practical reason is the overfitting of (NL, SQL) patterns.Comment: The experiment results of T5 and T5-Picard in Table 5 and Table 6 are not correct because we made mistakes in the evaluation code

arXiv.org e-Print Archive

MicroRNA screening identifies circulating microRNAs as potential biomarkers for osteosarcoma

Author: Bandres
Bian
Boeri
Brase
Broadhead
Chen
Cookson
Dai
Di Leva
Díaz
Esposito
Fan
Fox
Friedman
Hameed
HANCHONG ZHANG
Harting
Hiroki
HONG-BIN GUO
Hu
Huang
HUI LI
Hummel
JIE BU
Jones
Kim
Kloosterman
Kobayashi
Kosaka
Krishnan
KUN ZHANG
Lawrie
Lee
Li
Li
Li
Li
LI-HONG LIU
Liu
Liu
Liu
Ma
Menéndez
Mitchell
Morimura
Ng
Ratert
Redova
Sand
Schrauder
Shen
Si
Sita-Lumsden
Soifer
TAO XIAO
Ueda
Valladares-Ayerbes
Wang
Wang
Weeden
Xu
Xu
Yang
YURONG OUYANG
Zeng
Zhang
Zhao
Zhao
Zhi
Zhou
Publication venue: 'Spandidos Publications'
Publication date
Field of study

Crossref

The Challenges and the Promise of Molecular Targeted Therapy in Malignant Gliomas

Author: Aguilera
Al-Nedawi
Arbab
Auffinger
Babic
Bai
Batchelor
Batchelor
Becher
Blesa
Board
Boland
Brandes
Cabanas
Cancer Genome Atlas Research Network
Carracedo
Caruso
Cen
Cerniglia
Chakravarti
Chamberlain
Chang
Chang
Chaponis
Chen
Chinnaiyan
Chinot
Clement
Cloughesy
Cohen
Da Fu
de Groot
de la Pena
Demirci
Den
Desjardins
di Tomaso
Dixit
Dong
Drappatz
El Habr
Ellis
Fan
Fields
Franceschi
Friday
Friedman
Furnari
Gajadhar
Galanis
Galanis
Gan
Gatson
Geoerger
Gerstner
Giglio
Gilbert
Gong
Habeck
Hanchong Xu
Hardee
Hatanpaa
Hegi
Hong
Hongxiang Wang
Huang
Huang
Iwamoto
Iwamoto
Jansen
Jo
Johns
Juxiang Chen
Kalani
Karamboulas
Kesavabhotla
Khasraw
Kim
Kinsella
Krakstad
Kreisl
Kreisl
Kreisl
Kubicek
Kuczynski
Lai
Landis-Piwowar
Larsen
Lee
Lehky
Li
Li
Liu
Lo
Lo
Lombardi
LoRusso
Louis
Lustig
Lv
Maruotti
Marx
Masui
Mercer
Michaud
Millet
Mizoguchi
Mizushina
Momota
Morris
Nabors
Navis
Nawroth
Nazarenko
New
Neyns
Nghiemphu
Nicholas
Ohgaki
Ohka
Omuro
Onishi
Onishi
Pan
Patrick
Paul
Peereboom
Pelloski
Phuphanich
Pitter
Plate
Prasad
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Reardon
Ribas
Rich
Riva
Ruiz
Sarkaria
Sarkaria
Scaringi
Scott
Shabason
Sharma
Singh
Smith
Snuderl
Stopschinski
Stupp
Stupp
Stupp
Stupp
Tabatabai
Takahashi-Yanaga
Takano
Tanaka
Tao Xu
Tate
Thiessen
Trembath
Uhm
Van Meir
Vlachostergios
Wachsberger
Wang
Wang
Wang
Weiler
Weller
Wen
Wen
Wheeler
Yakes
Yan
Yang
Yang
Yang
Yin
Ying Jiang
Yong Yan
Yung
Yung
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref