Search CORE

112 research outputs found

AutoML from Software Engineering Perspective: Landscapes and Challenges

Author: Chen Zhenpeng
Wang Chao
Zhou Minghui
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2023
Field of study

Machine learning (ML) has been widely adopted in modern software, but the manual configuration of ML (e.g., hyper-parameter configuration) poses a significant challenge to software developers. Therefore, automated ML (AutoML), which seeks the optimal configuration of ML automatically, has received increasing attention from the software engineering community. However, to date, there is no comprehensive understanding of how AutoML is used by developers and what challenges developers encounter in using AutoML for software development. To fill this knowledge gap, we conduct the first study on understanding the use and challenges of AutoML from software developers’ perspective. We collect and analyze 1,554 AutoML downstream repositories, 769 AutoML-related Stack Overflow questions, and 1,437 relevant GitHub issues. The results suggest the increasing popularity of AutoML in a wide range of topics, but also the lack of relevant expertise. We manually identify specific challenges faced by developers for AutoML-enabled software. Based on the results, we derive a series of implications for AutoML framework selection, framework development, and research

UCL Discovery

HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus

Author: Hu Songlin
Ma Guangyuan
Su Zhenpeng
Wu Xing
Zhou Wei
Publication venue
Publication date: 06/09/2023
Field of study

ChatGPT has gained significant interest due to its impressive performance, but people are increasingly concerned about its potential risks, particularly around the detection of AI-generated content (AIGC), which is often difficult for untrained humans to identify. Current datasets utilized for detecting ChatGPT-generated text primarily center around question-answering, yet they tend to disregard tasks that possess semantic-invariant properties, such as summarization, translation, and paraphrasing. Our primary studies demonstrate that detecting model-generated text on semantic-invariant tasks is more difficult. To fill this gap, we introduce a more extensive and comprehensive dataset that considers more types of tasks than previous work, including semantic-invariant tasks. In addition, the model after a large number of task instruction fine-tuning shows a strong powerful performance. Owing to its previous success, we further instruct fine-tuning Tk-instruct and built a more powerful detection system. Experimental results show that our proposed detector outperforms the previous state-of-the-art RoBERTa-based detector

arXiv.org e-Print Archive

Adonis: Practical and Efficient Control Flow Recovery through OS-Level Traces

Author: Chen Jiali
Chen Zhenpeng
Li Ding
Li Shaofei
Liu Xuanzhe
Yang Chengxu
Zhou Yuhan
Publication venue
Publication date: 18/06/2023
Field of study

Control flow recovery is critical to promise the software quality, especially for large-scale software in production environment. However, the efficiency of most current control flow recovery techniques is compromised due to their runtime overheads along with deployment and development costs. To tackle this problem, we propose a novel solution, Adonis, which harnesses OS-level traces, such as dynamic library calls and system call traces, to efficiently and safely recover control flows in practice. Adonis operates in two steps: it first identifies the call-sites of trace entries, then it executes a pair-wise symbolic execution to recover valid execution paths. This technique has several advantages. First, Adonis does not require the insertion of any probes into existing applications, thereby minimizing runtime cost. Second, given that OS-level traces are hardware-independent, Adonis can be implemented across various hardware configurations without the need for hardware-specific engineering efforts, thus reducing deployment cost. Third, as Adonis is fully automated and does not depend on manually created logs, it circumvents additional development cost. We conducted an evaluation of Adonis on representative desktop applications and real-world IoT applications. Adonis can faithfully recover the control flow with 86.8% recall and 81.7% precision. Compared to the state-of-the-art log-based approach, Adonis can not only cover all the execution paths recovered, but also recover 74.9% of statements that cannot be covered. In addition, the runtime cost of Adonis is 18.3× lower than the instrument-based approach; the analysis time and storage cost (indicative of the deployment cost) of Adonis is 50× smaller and 443× smaller than the hardware-based approach, respectively. To facilitate future replication and extension of this work, we have made the code and data publicly available

UCL Discovery

InfoEntropy Loss to Mitigate Bias of Learning Difficulties for Generative Language Models

Author: Bai Xue
Chen Hui
Ding Guiguang
Hu Songlin
Lin Zijia
Su Zhenpeng
Wu Xing
Zhou Wei
Publication venue
Publication date: 10/11/2023
Field of study

Generative language models are usually pretrained on large text corpus via predicting the next token (i.e., sub-word/word/phrase) given the previous ones. Recent works have demonstrated the impressive performance of large generative language models on downstream tasks. However, existing generative language models generally neglect an inherent challenge in text corpus during training, i.e., the imbalance between frequent tokens and infrequent ones. It can lead a language model to be dominated by common and easy-to-learn tokens, thereby overlooking the infrequent and difficult-to-learn ones. To alleviate that, we propose an Information Entropy Loss (InfoEntropy Loss) function. During training, it can dynamically assess the learning difficulty of a to-be-learned token, according to the information entropy of the corresponding predicted probability distribution over the vocabulary. Then it scales the training loss adaptively, trying to lead the model to focus more on the difficult-to-learn tokens. On the Pile dataset, we train generative language models at different scales of 468M, 1.2B, and 6.7B parameters. Experiments reveal that models incorporating the proposed InfoEntropy Loss can gain consistent performance improvement on downstream benchmarks

arXiv.org e-Print Archive

Efficient Silicon Metasurfaces for Visible Light

Author: Fang Hanlin
Krauss Thomas F.
Li Juntao
Li Kezheng
Liu Jin
Reardon Christopher P.
Stellinga Daan
Su Rongbin
Wang Xuehua
Yao Beimeng
Zhou Lidan
Zhou Zhenpeng
Publication venue: 'American Chemical Society (ACS)'
Publication date: 20/09/2016
Field of study

Dielectric metasurfaces require high refractive index contrast materials for optimum performance. This requirement imposes a severe restraint; either devices have been demonstrated at wavelengths of 700 nm and above using high-index semiconductors such as silicon, or they use lower index dielectric materials such as TiO2 or Si3N4 and operate in the visible wavelength regime. Here, we show that the high refractive index of silicon can be exploited at wavelengths as short as 532 nm by demonstrating a crystalline silicon metasurface with a transmission efficiency of 71% at this wavelength and a diffraction efficiency of 95% into the desired diffraction order. The metasurfaces consist of a graded array of silicon posts arranged in a square lattice on a quartz substrate. We show full 2π phase control, and we experimentally demonstrate polarization-independent beam deflection at 532 nm wavelength. Our results open a new way for realizing efficient metasurfaces based on silicon for the technologically all-important display applications

arXiv.org e-Print Archive

Crossref

White Rose Research Online

FigShare

Correlation-driven eightfold magnetic anisotropy in a two-dimensional oxide monolayer.

Author: Arenholz Elke
Cao Hui
Chen Hanghui
Cui Zhangzhang
Dong Yongqi
Gilbert Dustin A
Grutter Alexander J
Guo Jinghua
Hu Zhenpeng
Kirby Brian J
Liu Yi-Sheng
Lu Yalin
Ma Jiaji
Shafer Padraic
Wang Jingyuan
Xia Jing
Zhai Xiaofang
Zhou Hua
Publication venue: eScholarship, University of California
Publication date: 01/04/2020
Field of study

Engineering magnetic anisotropy in two-dimensional systems has enormous scientific and technological implications. The uniaxial anisotropy universally exhibited by two-dimensional magnets has only two stable spin directions, demanding 180° spin switching between states. We demonstrate a previously unobserved eightfold anisotropy in magnetic SrRuO3 monolayers by inducing a spin reorientation in (SrRuO3)1/(SrTiO3) N superlattices, in which the magnetic easy axis of Ru spins is transformed from uniaxial 〈001〉 direction (N < 3) to eightfold 〈111〉 directions (N ≥ 3). This eightfold anisotropy enables 71° and 109° spin switching in SrRuO3 monolayers, analogous to 71° and 109° polarization switching in ferroelectric BiFeO3. First-principle calculations reveal that increasing the SrTiO3 layer thickness induces an emergent correlation-driven orbital ordering, tuning spin-orbit interactions and reorienting the SrRuO3 monolayer easy axis. Our work demonstrates that correlation effects can be exploited to substantially change spin-orbit interactions, stabilizing unprecedented properties in two-dimensional magnets and opening rich opportunities for low-power, multistate device applications

arXiv.org e-Print Archive

eScholarship - University of California