Search CORE

166 research outputs found

A General Static Binary Rewriting Framework for WebAssembly

Author: Cao Shangtong
Guo Yao
He Ningyu
Wang Haoyu
Publication venue
Publication date: 02/05/2023
Field of study

Binary rewriting is a widely adopted technique in software analysis. WebAssembly (Wasm), as an emerging bytecode format, has attracted great attention from our community. Unfortunately, there is no general-purpose binary rewriting framework for Wasm, and existing effort on Wasm binary modification is error-prone and tedious. In this paper, we present BREWasm, the first general purpose static binary rewriting framework for Wasm, which has addressed inherent challenges of Wasm rewriting including high complicated binary structure, strict static syntax verification, and coupling among sections. We perform extensive evaluation on diverse Wasm applications to show the efficiency, correctness and effectiveness of BREWasm. We further show the promising direction of implementing a diverse set of binary rewriting tasks based on BREWasm in an effortless and user-friendly manner

arXiv.org e-Print Archive

Turning a CLIP Model into a Scene Text Spotter

Author: Bai Xiang
Cao Haoyu
Liu Yuliang
Sun Xing
Yu Wenwen
Zhu Xingkui
Publication venue
Publication date: 20/08/2023
Field of study

We exploit the potential of the large-scale Contrastive Language-Image Pretraining (CLIP) model to enhance scene text detection and spotting tasks, transforming it into a robust backbone, FastTCM-CR50. This backbone utilizes visual prompt learning and cross-attention in CLIP to extract image and text-based prior knowledge. Using predefined and learnable prompts, FastTCM-CR50 introduces an instance-language matching process to enhance the synergy between image and text embeddings, thereby refining text regions. Our Bimodal Similarity Matching (BSM) module facilitates dynamic language prompt generation, enabling offline computations and improving performance. FastTCM-CR50 offers several advantages: 1) It can enhance existing text detectors and spotters, improving performance by an average of 1.7% and 1.5%, respectively. 2) It outperforms the previous TCM-CR50 backbone, yielding an average improvement of 0.2% and 0.56% in text detection and spotting tasks, along with a 48.5% increase in inference speed. 3) It showcases robust few-shot training capabilities. Utilizing only 10% of the supervised data, FastTCM-CR50 improves performance by an average of 26.5% and 5.5% for text detection and spotting tasks, respectively. 4) It consistently enhances performance on out-of-distribution text detection and spotting datasets, particularly the NightTime-ArT subset from ICDAR2019-ArT and the DOTA dataset for oriented object detection. The code is available at https://github.com/wenwenyu/TCM.Comment: arXiv admin note: text overlap with arXiv:2302.1433

arXiv.org e-Print Archive

Influence of the Martian crustal magnetic fields on the Mars-solar wind interaction and plasma transport

Author: Guokan Li
Haoyu Lu
Haoyu Lu
Jinbin Cao
Jinbin Cao
Shibang Li
Yihui Song
Yun Li
Yun Li
Publication venue: 'Frontiers Media SA'
Publication date: 01/04/2023
Field of study

The plasma transport process is important for the ionosphere of Mars, which controls the structure of the ionosphere above an altitude of 200 km. Plasma transport from the dayside ionosphere is crucial for producing the nightside ionosphere on Mars. The alteration in dayside plasma transport in the presence of crustal fields may influence the distribution of Martian ionospheric plasma and plasma escape in the magnetotail. This study employed a three-dimensional multispecies magnetohydrodynamic (MHD) model to simulate Mars-solar wind interactions. We show the magnetic field distribution and plasma velocity variation on the Martian day-side. The results indicate that the ion transport from low- to high-solar-zenith-angle areas in the south is inhibited by crustal fields, leading to a reduction in the ion number density and a thinner ionosphere near the southern terminator. Many heavy ions remain in the southern dayside ionosphere rather than moving to the nightside. In addition, the maximum reduction in the tailward flux of the planetary ions calculated by the MHD simulation is more than 50% at the southern terminator, indicating an inhibitory effect of the crustal fields on day-to-night transport. These effects may lead to a reduction in ion number density in the southern nightside ionosphere. Finally, we demonstrate a decrease in the global heavy-ion loss rate

Directory of Open Access Journals

Machine Learning Prediction of Glass Transition Temperature of Conjugated Polymers From Chemical Structure

Author: Alesadi Amirhadi
Cao Zhiqiang
Gu Xiaodan
Li Zhaofan
Xia Wenjie
Zhang Song
Zhao Haoyu
Publication venue: The Aquila Digital Community
Publication date: 15/06/2022
Field of study

Predicting the glass transition temperature (Tg) is of critical importance as it governs the thermomechanical performance of conjugated polymers (CPs). Here, we report a predictive modeling framework to predict Tg of CPs through the integration of machine learning (ML), molecular dynamics (MD) simulations, and experiments. With 154 Tg data collected, an ML model is developed by taking simplified “geometry” of six chemical building blocks as molecular features, where side-chain fraction, isolated rings, fused rings, and bridged rings features are identified as the dominant ones for Tg. MD simulations further unravel the fundamental roles of those chemical building blocks in dynamical heterogeneity and local mobility of CPs at a molecular level. The developed ML model is demonstrated for its capability of predicting Tg of several new high-performance solar cell materials to a good approximation. The established predictive framework facilitates the design and prediction of Tg of complex CPs, paving the way for addressing device stability issues that have hampered the field from developing stable organic electronics

Aquila Digital Community

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration

Author: Bao Changcun
Cao Haoyu
Chen Huang
Jiang Deqiang
Liu Chaohu
Liu Hao
Liu Yinsong
Sun Xing
Yin Kun
Publication venue
Publication date: 03/09/2023
Field of study

We propose a novel end-to-end document understanding model called SeRum (SElective Region Understanding Model) for extracting meaningful information from document images, including document analysis, retrieval, and office automation. Unlike state-of-the-art approaches that rely on multi-stage technical schemes and are computationally expensive, SeRum converts document image understanding and recognition tasks into a local decoding process of the visual tokens of interest, using a content-aware token merge module. This mechanism enables the model to pay more attention to regions of interest generated by the query decoder, improving the model's effectiveness and speeding up the decoding speed of the generative scheme. We also designed several pre-training tasks to enhance the understanding and local awareness of the model. Experimental results demonstrate that SeRum achieves state-of-the-art performance on document understanding tasks and competitive results on text spotting tasks. SeRum represents a substantial advancement towards enabling efficient and effective end-to-end document understanding.Comment: Accepted to ICCV 2023 main conferenc

arXiv.org e-Print Archive

Characterizing and Detecting WebAssembly Runtime Bugs

Author: Chen Zhenpeng
Dongliang Mu
Gang Huang
Haoyu Wang
Shangtong Cao
Xiapu Luo
Xuanzhe Liu
Yixuan Zhang
Yun Ma
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/09/2023
Field of study

WebAssembly (abbreviated WASM) has emerged as a promising language of the Web and also been used for a wide spectrum of software applications such as mobile applications and desktop applications. These applications, named as WASM applications, commonly run in WASM runtimes. Bugs in WASM runtimes are frequently reported by developers and cause the crash of WASM applications. However, these bugs have not been well studied. To fill in the knowledge gap, we present a systematic study to characterize and detect bugs in WASM runtimes. We first harvest a dataset of 311 real-world bugs from hundreds of related posts on GitHub. Based on the collected high-quality bug reports, we distill 31 bug categories of WASM runtimes and summarize their common fix strategies. Furthermore, we develop a pattern-based bug detection framework to automatically detect bugs in WASM runtimes. We apply the detection framework to seven popular WASM runtimes and successfully uncover 60 bugs that have never been reported previously, among which 13 have been confirmed and 9 have been fixed by runtime developers

UCL Discovery