Search CORE

225 research outputs found

ControlFlag: A Self-supervised Idiosyncratic Pattern Detection System for Software Control Structures

Author: Gottschlich Justin E
Hasabnis Niranjan
Publication venue: ScholarlyCommons
Publication date: 01/01/2020
Field of study

Software debugging has been shown to utilize upwards of 50% of developers’ time. Machine programming, the field concerned with the automation of software (and hardware) development, has recently made progress in both research and production-quality automated debugging systems. In this paper, we present ControlFlag, a system that detects possible idiosyncratic violations in software control structures. ControlFlag also suggests possible corrections in the event a true error is detected. A novelty of ControlFlag is that it is entirely self-supervised; that is, it requires no labels to learn about the potential idiosyncratic programming pattern violations. In addition to presenting ControlFlag’s design, we also provide an abbreviated experimental evaluation

arXiv.org e-Print Archive

ScholarlyCommons@Penn

Software Language Comprehension using a Program-Derived Semantics Graph

Author: Gottschlich Justin E
Iyer Roshni G
Sun Yizhou
Wang Wei
Publication venue: ScholarlyCommons
Publication date: 01/01/2020
Field of study

Traditional code transformation structures, such as abstract syntax trees (ASTs), conteXtual flow graphs (XFGs), and more generally, compiler intermediate representations (IRs), may have limitations in extracting higher-order semantics from code. While work has already begun on higher-order semantics lifting (e.g., Aroma’s simplified parse tree (SPT), verified lifting’s lambda calculi, and Halide’s intentional domain specific language (DSL)), research in this area is still immature. To continue to advance this research, we present the program-derived semantics graph (PSG), a new graphical structure to capture semantics of code. The PSG is designed to provide a single structure for capturing program semantics at multiple levels of abstraction. The PSG may be in a class of emerging structural representations that cannot be built from a traditional set of predefined rules and instead must be learned. In this paper, we describe the PSG and its fundamental structural differences compared to state-of-the-art structures. Although our exploration into the PSG is in its infancy, our early results and architectural analysis indicate it is a promising new research direction to automatically extract program semantics

arXiv.org e-Print Archive

ScholarlyCommons@Penn

Hieracium hypochoeroides subsp. montis-scuderii (Asteraceae), a new endemic subspecies from Sicily (Italy)

Author: Cristaudo A.
DI GRISTINA E.
Galesi R.
Gottschlich G.
Raimondo F.
Publication venue: 'Fondazione Pro Herbario Mediterraneo'
Publication date: 01/01/2013
Field of study

Hieracium hypochoeroides subsp. montis-scuderii, a new subspecies endemic to Sicily, is described and illustrated. It is only known from the carbonate cliffs of Monte Scuderi (Peloritani Mountains, NE-Sicily). Informations on its ecology and taxonomic relationships are provided

Archivio istituzionale della ricerca - Università di Palermo

Precision and Recall for Time Series

Author: Alam Mejbah
Gottschlich Justin E
Lee Tae J
Tatbul Nesime
Zdonik Stan
Publication venue: ScholarlyCommons
Publication date: 01/01/2018
Field of study

Classical anomaly detection is principally concerned with point-based anomalies, those anomalies that occur at a single point in time. Yet, many real-world anomalies are range-based, meaning they occur over a period of time. Motivated by this observation, we present a new mathematical model to evaluate the accuracy of time series classification algorithms. Our model expands the well-known Precision and Recall metrics to measure ranges, while simultaneously enabling customization support for domain-specific preferences

arXiv.org e-Print Archive

ScholarlyCommons@Penn

Greenhouse: A Zero-Positive Machine Learning System for Time-Series Anomaly Detection

Author: Gottschlich Justin E
Lee Tae J
Metcalf Eric
Tatbul Nesime
Zdonik Stan
Publication venue: ScholarlyCommons
Publication date: 01/01/2018
Field of study

This short paper describes our ongoing research on Greenhouse - a zero-positive machine learning system for time-series anomaly detection

arXiv.org e-Print Archive

ScholarlyCommons@Penn

Precision and Recall for Range-Based Anomaly Detection

Author: Gottschlich Justin E
Lee Tae J
Metcalf Eric
Tatbul Nesime
Zdonik Stan
Publication venue: ScholarlyCommons
Publication date: 01/01/2018
Field of study

Classical anomaly detection is principally concerned with point- based anomalies, anomalies that occur at a single data point. In this paper, we present a new mathematical model to express range- based anomalies, anomalies that occur over a range (or period) of time

arXiv.org e-Print Archive

ScholarlyCommons@Penn

Toward Scalable Verification for Safety-Critical Deep Networks

Author: Barrett Clark
Gottschlich Justin E
Julian Kyle
Katz Guy
Kochenderfer Mykel J
Kuper Lindsey
Publication venue: ScholarlyCommons
Publication date: 01/01/2018
Field of study

The increasing use of deep neural networks for safety-critical applications, such as autonomous driving and flight control, raises concerns about their safety and reliability. Formal verification can address these concerns by guaranteeing that a deep learning system operates as intended, but the state of the art is limited to small systems. In this work-in-progress report we give an overview of our work on mitigating this difficulty, by pursuing two complementary directions: devising scalable verification techniques, and identifying design choices that result in deep learning systems that are more amenable to verification

arXiv.org e-Print Archive

ScholarlyCommons@Penn

A Zero-Positive Learning Approach for Diagnosing Software Performance Regressions

Author: Alam Mejbah
Gottschlich Justin E
Mattson Timothy
Muzahid Abdullah
Tatbul Nesime
Turek Javier S
Publication venue: ScholarlyCommons
Publication date: 01/01/2019
Field of study

The field of machine programming (MP), the automation of the development of software, is making notable research advances. This is, in part, due to the emergence of a wide range of novel techniques in machine learning. In this paper, we apply MP to the automation of software performance regression testing. A performance regression is a software performance degradation caused by a code change. We present AutoPerf–a novel approach to automate regression testing that utilizes three core techniques:(i) zero-positive learning,(ii) autoencoders, and (iii) hardware telemetry. We demonstrate AutoPerf’s generality and efficacy against 3 types of performance regressions across 10 real performance bugs in 7 benchmark and open-source programs. On average, AutoPerf exhibits 4% profiling overhead and accurately diagnoses more performance bugs than prior state-of-the-art approaches. Thus far, AutoPerf has produced no false negatives

arXiv.org e-Print Archive

ScholarlyCommons@Penn

MISIM: A Novel Code Similarity System

Author: Dubey Pradeep
Gottschlich Justin E
Hasabnis Niranjan
Kraska Tim
Marcus Ryan
Mattson Timothy
Petersen Paul
Sarkar Vivek
Tatbul Nesime
Tithi Jesmin J
Venkat Anand
Ye Fangke
Zhou Shengtian
Publication venue: ScholarlyCommons
Publication date: 01/06/2020
Field of study

Code similarity systems are integral to a range of applications from code recommendation to automated software defect correction. We argue that code similarity is now a first-order problem that must be solved. To begin to address this, we present machine Inferred Code Similarity (MISIM), a novel end-to-end code similarity system that consists of two core components. First, MISIM uses a novel context-aware semantic structure, which is designed to aid in lifting semantic meaning from code syntax. Second, MISIM provides a neural-based code similarity scoring algorithm, which can be implemented with various neural network architectures with learned parameters. We compare MISIM to three state-of-the-art code similarity systems: (i) code2vec, (ii) Neural Code Comprehension, and (iii) Aroma. In our experimental evaluation across 328,155 programs (over 18 million lines of code), MISIM has 1.5x to 43.4x better accuracy than all three systems

ScholarlyCommons@Penn

- Notulae to the Italian native vascular flora: 10.

Author: Bagella S
Barberis G
Bartolucci F
Briozzo I
Calbi M
Caria Mc
Cavallaro V
Chianese G
Cibei C
Conti F
Dagnino D
Domina G
Esposito A
Forte L
Galasso G
Giacanelli V
Gottschlich G
Lattanzi E
Longo D
Mei G
Merli M
Nepi C
Orsenigo S
Pau Gb
Pazienza G
Peccenini S
Pisanu S
Rivieccio G
Roma-Marzio F
Scafidi F
Selvi F
Stinca A
Turcato C
Publication venue: 'Pensoft Publishers'
Publication date: 01/01/2020
Field of study

Florence Research