Search CORE

204 research outputs found

Improved Algorithms for Online Rent Minimization Problem Under Unit-Size Jobs

Author: Sun Enze
Yang Zonghan
Zhang Yuhao
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st Annual European Symposium on Algorithms (ESA 2023)
Publication date: 01/01/2023
Field of study

We consider the Online Rent Minimization problem, where online jobs with release times, deadlines, and processing times must be scheduled on machines that can be rented for a fixed length period of T. The objective is to minimize the number of machine rents. This problem generalizes the Online Machine Minimization problem where machines can be rented for an infinite period, and both problems have an asymptotically optimal competitive ratio of O(log(p_max/p_min)) for general processing times, where p_max and p_min are the maximum and minimum processing times respectively. However, for small values of p_max/p_min, a better competitive ratio can be achieved by assuming unit-size jobs. Under this assumption, Devanur et al. (2014) gave an optimal e-competitive algorithm for Online Machine Minimization, and Chen and Zhang (2022) gave a (3e+7) ? 15.16-competitive algorithm for Online Rent Minimization. In this paper, we significantly improve the competitive ratio of the Online Rent Minimization problem under unit size to 6, by using a clean oracle-based online algorithm framework

Dagstuhl Research Online Publication Server

A Closer Look at the Adversarial Robustness of Deep Equilibrium Models

Author: Liu Yang
Pang Tianyu
Yang Zonghan
Publication venue
Publication date: 02/06/2023
Field of study

Deep equilibrium models (DEQs) refrain from the traditional layer-stacking paradigm and turn to find the fixed point of a single layer. DEQs have achieved promising performance on different applications with featured memory efficiency. At the same time, the adversarial vulnerability of DEQs raises concerns. Several works propose to certify robustness for monotone DEQs. However, limited efforts are devoted to studying empirical robustness for general DEQs. To this end, we observe that an adversarially trained DEQ requires more forward steps to arrive at the equilibrium state, or even violates its fixed-point structure. Besides, the forward and backward tracks of DEQs are misaligned due to the black-box solvers. These facts cause gradient obfuscation when applying the ready-made attacks to evaluate or adversarially train DEQs. Given this, we develop approaches to estimate the intermediate gradients of DEQs and integrate them into the attacking pipelines. Our approaches facilitate fully white-box evaluations and lead to effective adversarial defense for DEQs. Extensive experiments on CIFAR-10 validate the adversarial robustness of DEQs competitive with deep networks of similar sizes.Comment: Accepted at NeurIPS 2022. Our code is available at https://github.com/minicheshire/DEQ-White-Box-Robustnes

arXiv.org e-Print Archive

Spatial-Temporal Data Modeling with Graph Neural Networks

Author: Wu Zonghan
Publication venue
Publication date: 01/01/2021
Field of study

University of Technology Sydney. Faculty of Engineering and Information Technology.Spatial-temporal graph modeling is an important task to analyze the spatial relations and temporal trends of components in a system. It aims to model the dynamic node-level inputs by assuming inter-dependency between connected nodes. A basic assumption behind spatial-temporal graph modeling is that a node's future information is conditioned on its historical information as well as its neighbors' historical information. Therefore how to capture spatial and temporal dependencies simultaneously becomes a primary challenge. Current studies on spatial-temporal graph modeling face four major shortcomings: 1) Most graph neural networks only focus on the low frequency band of graph signals; 2) Current studies assume the graph structure of data reflects the genuine dependency relationships among nodes; 3) Existing studies on spatial-temporal graph neural networks are not applicable to pure multivariate time series data due to the absence of a predefined graph and lack of a general framework; 4) Existing approaches either model spatial-temporal dependencies locally or model spatial correlations and temporal correlations separately. The aim of this thesis is to study spatial-temporal data from the perspective of deep learning on graphs. I have studied the research objective in deep depth with four research questions: (1) How to coordinate the low, middle, and high frequency band of graph signals in graph convolution networks. (2) How to model spatial-temporal graph data effectively and efficiently; (3) How to handle spatial dependencies when a graph is totally missing, incomplete or inaccurate in spatial-temporal graph modeling; (4) In contrast to traditional spatial-temporal graph neural networks that handle spatial dependencies and temporal dependencies in separate, how to unify space and time as a whole in message passing. To address the aforementioned four research problems, I proposed four algorithms or models that can achieve satisfactory results. Specifically, I proposed an Automatic Graph Convolutional Network to learn graph frequency bands for graph convolution filters automatically; I introduced an efficient and effective framework that integrates diffusion graph convolution and dilated temporal convolution to capture spatial-temporal dependencies simultaneously. I developed a novel joint-learning algorithm that can capture spatial-temporal dependencies and learn latent graph structures at the same time; I designed a unified graph neural network that captures the inner spatial-temporal dependencies without compromising space-time integrity. To validate the proposed methods, I have conducted experiments on real-world datasets with a range of tasks including node classification, graph classification, and spatial-temporal graph forecasting. Experimental results demonstrate the effectiveness of the proposed methods

OPUS - University of Technology Sydney

XFlow: Benchmarking Flow Behaviors over Graphs

Author: Chen Zhiqian
Zhang Zijian
Zhang Zonghan
Publication venue
Publication date: 07/08/2023
Field of study

The occurrence of diffusion on a graph is a prevalent and significant phenomenon, as evidenced by the spread of rumors, influenza-like viruses, smart grid failures, and similar events. Comprehending the behaviors of flow is a formidable task, due to the intricate interplay between the distribution of seeds that initiate flow propagation, the propagation model, and the topology of the graph. The study of networks encompasses a diverse range of academic disciplines, including mathematics, physics, social science, and computer science. This interdisciplinary nature of network research is characterized by a high degree of specialization and compartmentalization, and the cooperation facilitated by them is inadequate. From a machine learning standpoint, there is a deficiency in a cohesive platform for assessing algorithms across various domains. One of the primary obstacles to current research in this field is the absence of a comprehensive curated benchmark suite to study the flow behaviors under network scenarios. To address this disparity, we propose the implementation of a novel benchmark suite that encompasses a variety of tasks, baseline models, graph datasets, and evaluation tools. In addition, we present a comprehensive analytical framework that offers a generalized approach to numerous flow-related tasks across diverse domains, serving as a blueprint and roadmap. Drawing upon the outcomes of our empirical investigation, we analyze the advantages and disadvantages of current foundational models, and we underscore potential avenues for further study. The datasets, code, and baseline models have been made available for the public at: https://github.com/XGraphing/XFlo

arXiv.org e-Print Archive

Improving Adversarial Robustness of DEQs with Explicit Regulations Along the Neural Dynamics

Author: Li Peng
Liu Yang
Pang Tianyu
Yang Zonghan
Publication venue
Publication date: 02/06/2023
Field of study

Deep equilibrium (DEQ) models replace the multiple-layer stacking of conventional deep networks with a fixed-point iteration of a single-layer transformation. Having been demonstrated to be competitive in a variety of real-world scenarios, the adversarial robustness of general DEQs becomes increasingly crucial for their reliable deployment. Existing works improve the robustness of general DEQ models with the widely-used adversarial training (AT) framework, but they fail to exploit the structural uniquenesses of DEQ models. To this end, we interpret DEQs through the lens of neural dynamics and find that AT under-regulates intermediate states. Besides, the intermediate states typically provide predictions with a high prediction entropy. Informed by the correlation between the entropy of dynamical systems and their stability properties, we propose reducing prediction entropy by progressively updating inputs along the neural dynamics. During AT, we also utilize random intermediate states to compute the loss function. Our methods regulate the neural dynamics of DEQ models in this manner. Extensive experiments demonstrate that our methods substantially increase the robustness of DEQ models and even outperform the strong deep network baselines.Comment: Accepted at ICML 2023. Our code is available at https://github.com/minicheshire/DEQ-Regulating-Neural-Dynamic

arXiv.org e-Print Archive

Synergistic toughening of hard, nacre-mimetic MoSi2 coatings by self-assembled hierarchical structure

Author: Munroe Paul
Xie Zonghan
Xu Jiang
Zhao Xiaoli
Publication venue: Edith Cowan University, Research Online, Perth, Western Australia
Publication date: 01/01/2014
Field of study

Like many other intermetallic materials, MoSi2 coatings are typically hard, but prone to catastrophic failure due to their low toughness at ambient temperature. In this paper, a self-assembled hierarchical structure that closely resembles that of nacre (i.e., mother of pearl) was developed in a MoSi2 -based coating through a simple, yet cost-effective, depostion technique. The newly formed coating is tough and can withstand multiple indentations at high loads. Key design features responsible for this remarkable outcome were identified. They include a functionally graded multilayer featuring elastic modulus oscillation, varying sublayer thickness and a columnar structure that are able to attenuate stress concentrations; interlocking boundaries between adjacent sublayers that improve the bonding and arrest the cracks; a transitional layer that bridges the coating and substrate and facilitates load transfer. Moreover, the contributions of six important structural characteristics to damage resistance are quantified using finite elemnet analysis and in an additive manner (i.e., from low- to high-level complexity). The in-situ toughened coating is envisaged to enhance the mechanical performance and extend the lifespan of metal components used in safety-critical applications

Crossref

Adelaide Research & Scholarship

PubMed Central

Research Online @ ECU