Search CORE

25 research outputs found

Benchmark for Security Testing on Embedded Systems

Author: Bagchi Saurabh
Clements Abraham
Saab Khaled K
Publication venue: 'Purdue University (bepress)'
Publication date: 04/08/2016
Field of study

With the growing popularity of the Internet of Things (IoT), embedded devices continue to integrate more into our daily lives. For this reason, security for embedded devices is a vital issue to address. Attacks such as stack smashing, code injection, data corruption and Return Oriented Programming (ROP) are still a threat to embedded systems. As new methods are developed to defend embedded systems against such attacks, a benchmark to compare these methods is not present. In this work, a benchmark is presented that is aimed at testing the security of new techniques that defend against these common attacks. Two programs are developed that carry three key values needed for a benchmark: realistic embedded application, complex control flow, and being deterministic. The first application is a pin lock system and the second is a compression data logger. A complexity evaluation of the two applications revealed that the pin lock system contained 171 functions and 190 nodes with 252 edges in the control-flow graph, and the compression data logger contained 192 functions and 1,357 nodes with 2,123 edges in the control-flow graph. The current benchmark will be improved in the future by adding more applications with a wider range of complexity

Purdue E-Pubs

Cross-Modal Data Programming Enables Rapid Medical Machine Learning

Author: Dunnmon Jared
Goldman Roger
Khandwala Nishith
Lee-Messer Christopher
Lungren Matthew
Markert Matthew
Ratner Alexander
Rubin Daniel
Ré Christopher
Saab Khaled
Sagreiya Hersh
Publication venue
Publication date: 26/03/2019
Field of study

Labeling training datasets has become a key barrier to building medical machine learning models. One strategy is to generate training labels programmatically, for example by applying natural language processing pipelines to text reports associated with imaging studies. We propose cross-modal data programming, which generalizes this intuitive strategy in a theoretically-grounded way that enables simpler, clinician-driven input, reduces required labeling time, and improves with additional unlabeled data. In this approach, clinicians generate training labels for models defined over a target modality (e.g. images or time series) by writing rules over an auxiliary modality (e.g. text reports). The resulting technical challenge consists of estimating the accuracies and correlations of these rules; we extend a recent unsupervised generative modeling technique to handle this cross-modal setting in a provably consistent way. Across four applications in radiography, computed tomography, and electroencephalography, and using only several hours of clinician time, our approach matches or exceeds the efficacy of physician-months of hand-labeling with statistical significance, demonstrating a fundamentally faster and more flexible way of building machine learning models in medicine

arXiv.org e-Print Archive

eScholarship - University of California

Spatiotemporal Modeling of Multivariate Signals With Graph Neural Networks and Structured State Space Models

Author: Dunnmon Jared A.
Lee-Messer Christopher
Qu Liangqiong
Rubin Daniel L.
Saab Khaled K.
Tang Siyi
Publication venue
Publication date: 20/11/2022
Field of study

Multivariate signals are prevalent in various domains, such as healthcare, transportation systems, and space sciences. Modeling spatiotemporal dependencies in multivariate signals is challenging due to (1) long-range temporal dependencies and (2) complex spatial correlations between sensors. To address these challenges, we propose representing multivariate signals as graphs and introduce GraphS4mer, a general graph neural network (GNN) architecture that captures both spatial and temporal dependencies in multivariate signals. Specifically, (1) we leverage Structured State Spaces model (S4), a state-of-the-art sequence model, to capture long-term temporal dependencies and (2) we propose a graph structure learning layer in GraphS4mer to learn dynamically evolving graph structures in the data. We evaluate our proposed model on three distinct tasks and show that GraphS4mer consistently improves over existing models, including (1) seizure detection from electroencephalography signals, outperforming a previous GNN with self-supervised pretraining by 3.1 points in AUROC; (2) sleep staging from polysomnography signals, a 4.1 points improvement in macro-F1 score compared to existing sleep staging models; and (3) traffic forecasting, reducing MAE by 8.8% compared to existing GNNs and by 1.4% compared to Transformer-based models

arXiv.org e-Print Archive

Domino: Discovering Systematic Errors with Cross-Modal Embeddings

Author: Delbrouck Jean-Benoit
Dunnmon Jared
Eyuboglu Sabri
Lee-Messer Christopher
Ré Christopher
Saab Khaled
Varma Maya
Zou James
Publication venue
Publication date: 11/04/2022
Field of study

Machine learning models that achieve high overall accuracy often make systematic errors on important subsets (or slices) of data. Identifying underperforming slices is particularly challenging when working with high-dimensional inputs (e.g. images, audio), where important slices are often unlabeled. In order to address this issue, recent studies have proposed automated slice discovery methods (SDMs), which leverage learned model representations to mine input data for slices on which a model performs poorly. To be useful to a practitioner, these methods must identify slices that are both underperforming and coherent (i.e. united by a human-understandable concept). However, no quantitative evaluation framework currently exists for rigorously assessing SDMs with respect to these criteria. Additionally, prior qualitative evaluations have shown that SDMs often identify slices that are incoherent. In this work, we address these challenges by first designing a principled evaluation framework that enables a quantitative comparison of SDMs across 1,235 slice discovery settings in three input domains (natural images, medical images, and time-series data). Then, motivated by the recent development of powerful cross-modal representation learning approaches, we present Domino, an SDM that leverages cross-modal embeddings and a novel error-aware mixture model to discover and describe coherent slices. We find that Domino accurately identifies 36% of the 1,235 slices in our framework - a 12 percentage point improvement over prior methods. Further, Domino is the first SDM that can provide natural language descriptions of identified slices, correctly generating the exact name of the slice in 35% of settings.Comment: ICLR 2022 (Oral

arXiv.org e-Print Archive

Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing

Author: Bhaskhar Nandita
Dubost Florian
Dunnmon Jared
Fu Daniel Y
Hong Erin
Lee-Messer Christopher
Rubin Daniel
Saab Khaled
Tang Siyi
Publication venue
Publication date: 25/03/2021
Field of study

Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection performance on sequential data, such as video recordings, when only sparse labels are available, such as event start times without their corresponding end times. Our method uses noisy guesses of the events' end times to train event detection models. Depending on how conservative these guesses are, mislabeled false positives may be introduced into the training set (i.e., negative sequences mislabeled as positives). We further propose a mathematical model for estimating how many inaccurate labels a model is exposed to, based on how noisy the end time guesses are. Finally, we show that neural networks can improve their detection performance by leveraging more training data with less conservative approximations despite the higher proportion of incorrect labels. We adapt sequential versions of MNIST and CIFAR-10 to empirically evaluate our method, and find that our risk-tolerant strategy outperforms conservative estimates by 12 points of mean average precision for MNIST, and 3.5 points for CIFAR. Then, we leverage the proposed training strategy to tackle a real-life application: processing continuous video recordings of epilepsy patients to improve seizure detection, and show that our method outperforms baseline labeling methods by 10 points of average precision

arXiv.org e-Print Archive

EPMA position paper in cancer: current overview and future perspectives

Author: A Aderem
A Arora
A Bensimon
A Bird
A Dobrovic
A Fenech
A Fosså
A Graser
A Hirayama
A Lopez-Beltran
A Pollack
A Ring
A Sanchez-Pla
A Schneider
A Sreekumar
A Sreekumar
A Thomas
A Zaanan
A-M Bleau
AA Brandes
AA Brandes
AD Kelly
AD Roth
AE Katz
AM Edwards
AP Christiano
AP Feinberg
AR Tate
AZ Wang
B Domon
B Haibe-Kains
B Kaufman
B Levin
B Mee
B Pesch
B Sitter
B Taback
B Vischioni
B Weinstein
BC Yoo
BF Gage
BR Henderson
BR Wei
C Alix-Panabieres
C Bokemeyer
C Bronner
C Gessner
C Giovannini
C Lengauer
C Pinto
C Schmidt
C Sotiriou
CD Parry
CJ Mettlin
CJ Murray
CM Croce
CM Hu
CN Coleman
CO Evans
D Altshuler
D Ford
D Remedios
D Salonga
D Tavian
D Williamson
D Zhang
DF Amanatullah
DG Bostwick
DG Castro De
DH Harpole
DJ Kwiatkowski
DJ Slamon
DJ Slamon
DP Bartel
DP Kane
DW Parsons
E Birney
E Carrillo-de Santa Pau
E Cutsem Van
E Cutsem Van
E Hayakawa
E Roudier
E Sabido
E Zwick
EA Matzke
EF Petricoin
EH Romond
EJ Garcia
EL Franco
EM McDonagh
ER Nelson
EV Chambers
F Al-Mulla
F Bray
F Elstrodt
F Lordick
F Mouliere
F Nicolantonio Di
FF Costa
FG Herrera
FG Lehnhardt
G Arpino
G Bauman
G Cox
G Des Guetz
G Giaccone
G Grech
G Lu-Yao
GA Calin
GD Anderson
GG Chen
GP Skliris
GS Ginsburg
GS Payne
H Ellegren
H Gao
H Hayashi
H He
H Ishida
H Kim
H Kojima
H Körner
H Parkinson
H Seidman
H Son
H Sugo
H Tlaskalova-Hogenova
H Uramoto
H Xing
H Yan
H Zhou
H-J Xu
HJ Mann
HK Kim
HT Lynch
HU Lemke
I Lokody
I Osman
I Tomlinson
J Adams
J Baselga
J Cebulla
J Cui
J Geradts
J Gudmundsson
J Jeuken
J Jiricny
J Krol
J Lu
J Ophir
J Rafter
J Schlessinger
J Taylor
J Tol
J Watanabe
J-Y Douillard
JA Chan
JA Rodriguez
JC Alers
JD Boone
JD Roberts
JE Garber
JF Hainfeld
JF Tsai
JG Cairncross
JG Hacia
JG Herman
JG Herman
JH Lee
JJ Brière
JJ Chen
JJ Lou
JL Bos
JL Yen
JP Brussel van
JR Gnarra
JR Silber
JR Yates
JS Basuki
JS Ross
JS Smith
JS Townsend
JT Isaacs
JW Vardiman
JY Park
JY Wang
K Honda
K Krapfenbauer
K Okumoto
K Popat
K Sasaki
KA Ekmektzoglou
KA McBride
KA O'Donnell
KG Davidson
KI Pritchard
KM Hirshfield
KM Sanchez
KS Oliner
KS Ramos
L Berliner
L Cheng
L Dang
L Fachal
L Hood
L Jankova
L Lacroix
L Lessard
LA Aaltonen
LM Lazarenko
LP Lim
M Arnedos
M Borre
M Borre
M Brunelli
M Cebioglu
M Choi
M Colleoni
M D'Amico
M Dowsett
M Dowsett
M Dowsett
M Enserink
M Esteller
M Fakih
M Georgitsi
M Gerhard
M Gerlinger
M Ikeguchi
M Ikeguchi
M Ikeguchi
M Jahanzeb
M Jin
M Kennedy
M Koukourakis
M Kouwenhoven
M Kuczyk
M Maemondo
M Murray
M Perrotti
M Qattan
M Re Del
M Schena
M Whirl-Carrillo
M Yeager
M Yunus
M-C Hung
M-S Tsao
MA Hamburg
MA Harding
MA Rahman
ME Davis
ME Hegi
MJ Duffy
MJ Duffy
MJ Piccart-Gebhart
MJ Riemenschneider
MJ Roy
MJ Roy
MJ Wakefield
MK Halushka
ML Gulley
ML Smith
ML Sulis
MM Cotreau
MS Cookson
MV Holmes
MV Relling
N Hearle
ND Kohatsu
NE Hynes
NH Park
NS Verkaik
O Galm
O Golubnitschaja
O Golubnitschaja
O Golubnitschaja
O Golubnitschaja
OJ Halvorsen
P Blume-Jensen
P Hardt
P Wikström
PA Jones
PA Jones
PJ Elliott
PJ Stephens
PP Pandolfi
PR Srinivas
R Aebersold
R Beaglehole
R Chen
R Garzon
R Hu
R Jover
R Kaddurah-Daouk
R Koomägi
R Meinsma
R Montironi
R Parsons
R Schmidt
R Silvestrini
R Vadigepalli
RA Copeland
RA Copeland
RA Copeland
RA Fleming
RD Langer
RD Mass
RE Tarone
RJ Davies
RJ DeBerardinis
RJ Gilbertson
RJ Motzer
RJ Slebos
RL Jones
RM Yang
RR Love
RV Bubnov
RV Bubnov
S Alberts
S Ciafre
S Hagan
S Hazany
S Johnston
S Mazurek
S Nakatsuka
S Nishiumi
S Patil
S Popat
S Signoretti
S Takakura
S Thibodeau
S Tiziani
S Tuupanen
S Verma
SC Chandrasekharappa
SL Gerson
SL Poon
SV Sharma
T Bekaii-Saab
T Caldés
T Chen
T Irvine
T Mitsudomi
T Sørlie
T Sørlie
T Ueki
T Wang
TA Hopp
TH Hu
TJ Ley
TJ Lynch
TS Maughan
U Francke
U Pastorino
V Bonadona
V Gogvadze
V Lam
VA Miller
VM Asiago
VN Kim
VN Kim
W Giaretti
W Tang
W Troudi
W Wick
WG Feero
WH Catherino
WJ Catalona
WT Khaled
X Zhan
X Zhan
Y Chen
Y Hiraku
Y Kashi
Y Ohsaki
Y Pommier
Y Qiu
Y Ren
Y Yang
Y-J Bang
YH Datta
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Outil automatique de génération de vecteurs de test pour les circuits analogiques

Author: Saab Khaled
Publication venue
Publication date: 01/01/1995
Field of study

PolyPublie

Outil automatique de test de circuits analogiques

Author: Saab Khaled
Publication venue
Publication date: 01/01/1999
Field of study

Méthodologie et algorithmes pour le test de circuits analogiques -- Méthode automatique de calcul de sensibilité -- Génération de vecteurs de test pour les fautes paramétriques -- Génération de vecteurs de test pour les fautes catastropiques -- Compaction de vecteurs de test -- Insertion de points de test

PolyPublie

ABSTRACT Closing the Gap Between Analog and Digital

Author: Khaled Saab
Publication venue
Publication date
Field of study

This paper presents a highly effective method for parallel hard fault simulation and test specification development. The proposed method formulates the fault simulation problem as a problem of estimating the fault value based on the distance between the output parameter distribution of the fault-free and the faulty circuit. We demonstrate the effectiveness and practicality of our proposed method by showing results on different designs. This approach extended by parametric fault testing has been implemented as an automated tools set for IC testing

CiteSeerX

Parametric fault simulation and test vector generation

Author: Bozena Kaminska
Khaled Saab
Naim Ben-Hamida
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2000
Field of study

Process variation has forever been the major fail cause of analog circuit where small deviations in component values cause large deviations in the measured output parameters. This paper presents a new approach for parametric fault simulation and test vector generation. The proposed approach utilizes the process information and the sensitivity of the circuit principal components in order to generate statistical models of the fault-free and the faulty circuit. The obtained information is then used as a measurement to quantify the testability of the circuit. This approach extended by hard fault testing has been implemented as automated tool set for IC testing called FaultMaxx and TestMaxx

CiteSeerX

Crossref