Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for
  Querying and Classifying IoT Threats

Choo, Kim-Kwang Raymond; Das, Jayanta Kumar; Dietrich, Glenn; Ebadi, Nima; George, Jerris; Ghandiparsi, Rambod; Haley, Hannah; Humphreys, Rachael; Linares, Billy; Pavlich, Ryan; Rios, Anthony; Slavin, Rocky; Tan, Adrian; Tarbell, Richard

Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for Querying and Classifying IoT Threats

Authors: Kim-Kwang Raymond Choo
Jayanta Kumar Das
Glenn Dietrich
Nima Ebadi
Jerris George
Rambod Ghandiparsi
Hannah Haley
Rachael Humphreys
Billy Linares
Ryan Pavlich
Anthony Rios
Rocky Slavin
Adrian Tan
Richard Tarbell
Publication date: 25 June 2024
Publisher

Abstract

Recognizing the promise of natural language interfaces to databases, prior studies have emphasized the development of text-to-SQL systems. While substantial progress has been made in this field, existing research has concentrated on generating SQL statements from text queries. The broader challenge, however, lies in inferring new information about the returned data. Our research makes two major contributions to address this gap. First, we introduce a novel Internet-of-Things (IoT) text-to-SQL dataset comprising 10,985 text-SQL pairs and 239,398 rows of network traffic activity. The dataset contains additional query types limited in prior text-to-SQL datasets, notably temporal-related queries. Our dataset is sourced from a smart building's IoT ecosystem exploring sensor read and network traffic data. Second, our dataset allows two-stage processing, where the returned data (network traffic) from a generated SQL can be categorized as malicious or not. Our results show that joint training to query and infer information about the data can improve overall text-to-SQL performance, nearly matching substantially larger models. We also show that current large language models (e.g., GPT3.5) struggle to infer new information about returned data, thus our dataset provides a novel test bed for integrating complex domain-specific reasoning into LLMs

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2406.17574

Last time updated on 04/01/2025