Search CORE

88 research outputs found

The convergence of PM2.5 concentration in Chinese cities: a distribution dynamic approach

Author: He Qiuqin
Lin Weiran
Yu Haijing
Publication venue: Taylor and Francis Group and Juraj Dobrila University of Pula, Faculty of economics and tourism Dr. Mijo Mirković
Publication date: 01/08/2021
Field of study

To fill the gap in the research on the convergence trend of air pollutants since 2013 in China and overcome the Galton fallacy caused by the parametric regression method, this study examines the convergence trend of the annual average concentration of fine particulate matter 2.5 (PM2.5) in China’s prefecture-level cities after 2013 using a distribution dynamic approach. The winter PM2.5 pollution in Chinese cities is severe. Hence, the convergence of the average winter PM2.5 concentration of prefecturelevel cities is also explored in this study. The results show that during 2015–2019, the annual average PM2.5 concentration level improved significantly. However, the average PM2.5 winter concentration level in 2015–2018 did not significantly decrease, with some cities showing severe pollution levels. The annual average PM2.5 of China’s prefecture-level cities exhibit club convergence, while the PM2.5 concentration in winter exhibits ‘unikurtosis’. In the long run, the annual average PM2.5 clusters around two levels, at approximately 35 lg/m3 and 60 lg/m3 , while the average PM2.5 in winter is concentrated at 100 lg/m3 . In the long run, in the central region, PM2.5 pollution is more severe than in northern and southern areas, regardless of the annual or winter average PM2.5 concentration

Directory of Open Access Journals

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering

Author: Chen Xiuyuan
Huang Weiran
Lin Yuan
Zhang Yuchen
Publication venue
Publication date: 24/11/2023
Field of study

We propose a novel and challenging benchmark, AutoEval-Video, to comprehensively evaluate large vision-language models in open-ended video question answering. The comprehensiveness of AutoEval-Video is demonstrated in two aspects: 1) AutoEval-Video constructs open-ended video-questions across 9 skill dimensions, addressing capabilities of perception, comprehension, and generation. 2) AutoEval-Video contains newly collected videos that cover over 40 distinct themes. To efficiently evaluate responses to the open-ended questions, we employ an LLM-based evaluation approach, but instead of merely providing a reference answer, we annotate unique evaluation rules for every single instance (video-question pair). To maximize the robustness of these rules, we develop a novel adversarial annotation mechanism. By using instance-specific rules as prompt, GPT-4, as an automatic evaluator, can achieve a stable evaluation accuracy of around 97.0\%, comparable to the 94.9\% - 97.5\% accuracy of a human evaluator. Furthermore, we assess the performance of eight large vision-language models on AutoEval-Video. Among them, GPT-4V(ision) significantly outperforms other models, achieving an accuracy of 32.2\%. However, there is still substantial room for improvement compared to human accuracy of 72.8\%. By conducting an extensive case study, we uncover several drawbacks of GPT-4V, such as limited temporal and dynamic comprehension, and overly general responses. Code is available at \href{https://github.com/Xiuyuan-Chen/AutoEval-Video}{\color{magenta}https://github.com/Xiuyuan-Chen/AutoEval-Video}

arXiv.org e-Print Archive

Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Author: Gao Ruijiang
Lin Weiran
Shugan Steven M.
Yin Mingzhang
Publication venue
Publication date: 18/10/2023
Field of study

Designing products to meet consumers' preferences is essential for a business's success. We propose the Gradient-based Survey (GBS), a discrete choice experiment for multiattribute product design. The experiment elicits consumer preferences through a sequence of paired comparisons for partial profiles. GBS adaptively constructs paired comparison questions based on the respondents' previous choices. Unlike the traditional random utility maximization paradigm, GBS is robust to model misspecification by not requiring a parametric utility model. Cross-pollinating the machine learning and experiment design, GBS is scalable to products with hundreds of attributes and can design personalized products for heterogeneous consumers. We demonstrate the advantage of GBS in accuracy and sample efficiency compared to the existing parametric and nonparametric methods in simulations

arXiv.org e-Print Archive

Group-based Robustness: A General Framework for Customized Robustness in the Real World

Author: Bauer Lujo
Eyal Neo
Lin Weiran
Lucas Keane
Reiter Michael K.
Sharif Mahmood
Publication venue
Publication date: 28/06/2023
Field of study

Machine-learning models are known to be vulnerable to evasion attacks that perturb model inputs to induce misclassifications. In this work, we identify real-world scenarios where the true threat cannot be assessed accurately by existing attacks. Specifically, we find that conventional metrics measuring targeted and untargeted robustness do not appropriately reflect a model's ability to withstand attacks from one set of source classes to another set of target classes. To address the shortcomings of existing methods, we formally define a new metric, termed group-based robustness, that complements existing metrics and is better-suited for evaluating model performance in certain attack scenarios. We show empirically that group-based robustness allows us to distinguish between models' vulnerability against specific threat models in situations where traditional robustness metrics do not apply. Moreover, to measure group-based robustness efficiently and accurately, we 1) propose two loss functions and 2) identify three new attack strategies. We show empirically that with comparable success rates, finding evasive samples using our new loss functions saves computation by a factor as large as the number of targeted classes, and finding evasive samples using our new attack strategies saves time by up to 99\% compared to brute-force search methods. Finally, we propose a defense method that increases group-based robustness by up to 3.52

\times

arXiv.org e-Print Archive

Primary lung tumor segmentation from PET–CT volumes with spatial–topological constraint

Author: Cui Hui
Eberl Stefan
Feng Dagan
Fulham Michael
Lin Weiran
Wang Xiuying
Zhou Jianlong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/07/2015
Field of study

Sydney eScholarship

A Framework for Real-Time Service-Oriented Architecture

Author: Lin Kwei-Jay
Nie Weiran
Panahi Mark
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2009
Field of study

Service-oriented architectures (SOA), though widely accepted in a variety of industries, must be enhanced to support real-time activities in order to gain even greater adoption.We present RT-Llama, a novel architecture for real-time SOA to support predictability in business processes. Based on a user-specified process and deadline, our architecture, containing global resource management and business process composition components, can reserve resources in advance for each service in the process to ensure it meets its end-to-end deadline. This is facilitated by also creating a real-time enterprise middleware that manages utilization of local resources by using efficient data structures and handles service requests via reserved CPU bandwidth. We demonstrate that RT-Llama’s reservation components are both efficient and adaptable to dynamic real-time environments

Crossref

eScholarship - University of California

Driving behavior-guided battery health monitoring for electric vehicles using machine learning

Author: Jiang Nanhua
Jiang Weiran
Khoo Edwin
Lin Jing
Ren Yao
Song Ziyou
Zhang Jiawei
Publication venue
Publication date: 25/09/2023
Field of study

An accurate estimation of the state of health (SOH) of batteries is critical to ensuring the safe and reliable operation of electric vehicles (EVs). Feature-based machine learning methods have exhibited enormous potential for rapidly and precisely monitoring battery health status. However, simultaneously using various health indicators (HIs) may weaken estimation performance due to feature redundancy. Furthermore, ignoring real-world driving behaviors can lead to inaccurate estimation results as some features are rarely accessible in practical scenarios. To address these issues, we proposed a feature-based machine learning pipeline for reliable battery health monitoring, enabled by evaluating the acquisition probability of features under real-world driving conditions. We first summarized and analyzed various individual HIs with mechanism-related interpretations, which provide insightful guidance on how these features relate to battery degradation modes. Moreover, all features were carefully evaluated and screened based on estimation accuracy and correlation analysis on three public battery degradation datasets. Finally, the scenario-based feature fusion and acquisition probability-based practicality evaluation method construct a useful tool for feature extraction with consideration of driving behaviors. This work highlights the importance of balancing the performance and practicality of HIs during the development of feature-based battery health monitoring algorithms

arXiv.org e-Print Archive

Efficient Private Multiset ID Protocols

Author: Bolin Ding
Cong Zhang
Dongdai Lin
Weiran Liu
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 24/06/2023
Field of study

Private-ID (PID) protocol enables two parties, each holding a private set of items, to privately compute a set of random universal identifiers (UID) corresponding to the records in the union of their sets, where each party additionally learns which UIDs correspond to which items in its set but not if they belong to the intersection or not. PID is very useful in the privacy computation of databases query, e.g. inner join and join for compute. Known PID protocols all assume the input of both parties is a set. In the case of join, a more common scenario is that one party\u27s primary key (unique) needs to join the other party\u27s foreign key (duplicate). How to construct an efficient Private Multiset ID (PMID) protocol to support the above \emph{key-foreign key join} remains open. We resolve this problem by constructing efficient PMID protocols from Oblivious PRF, Private Set Union, and a newly introduced primitive called Deterministic-Value Oblivious Programmable PRF (dv-OPPRF). We also propose some PMID applications, including Private Inner Join, Private Full Join, and Private Join for Compute. We implement our PMID protocols and state-of-the-art PID protocols as performance baselines. The experiments show that the performances of our PMID are almost the same as the state-of-the-art PIDs when we set the multiplicity

U_x = U_y = 1

. Our PMID protocols scale well when either

U_x > 1

U_y > 1

. The performances also correctly reflect excessive data expansion when both

U_x, U_y > 1

for the more general \emph{cross join} case

Cryptology ePrint Archive

Linear Private Set Union from Multi-Query Reverse Private Membership Test

Author: Cong Zhang
Dongdai Lin
Min Zhang
Weiran Liu
Yu Chen
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 02/12/2022
Field of study

Private set union (PSU) protocol enables two parties, each holding a set, to compute the union of their sets without revealing anything else to either party. So far, there are two known approaches for constructing PSU protocols. The first mainly depends on additively homomorphic encryption (AHE), which is generally inefficient since it needs to perform a non-constant number of homomorphic computations on each item. The second is mainly based on oblivious transfer and symmetric-key operations, which is recently proposed by Kolesnikov et al. (ASIACRYPT 2019). It features good practical performance, which is several orders of magnitude faster than the first one. However, neither of these two approaches is optimal in the sense that their computation and communication complexity are not both

O(n)

, where

n

is the size of the set. Therefore, the problem of constructing the optimal PSU protocol remains open. In this work, we resolve this open problem by proposing a generic framework of PSU from oblivious transfer and a newly introduced protocol called multi-query reverse private membership test (mq-RPMT). We present two generic constructions of mq-RPMT. The first is based on symmetric-key encryption and general 2PC techniques. The second is based on re-randomizable public-key encryption. Both constructions lead to PSU with linear computation and communication complexity. We implement our two PSU protocols and compare them with the state-of-the-art PSU. Experiments show that our PKE-based protocol has the lowest communication of all schemes, which is

3.7-14.8\times

lower depending on set size. The running time of our PSU scheme is

1.2-12\times

faster than that of state-of-the-art depending on network environments

Cryptology ePrint Archive

Encode and Permute that Database! Single-Server Private Information Retrieval with Constant Online Time, Communication, and Client-Side Storage

Author: Aiping Liang
Cong Zhang
Dongdai Lin
Lei Zhang
Liqiang Peng
Shuaishuai Li
Weiran Liu
Xinwei Gao
Yuan Hong
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 24/10/2023
Field of study

Private Information Retrieval (PIR) facilitates the retrieval of database entries by a client from a remote server without revealing which specific entry is being queried. The preprocessing model has emerged as a significant technique for constructing efficient PIR systems, allowing parties to execute a one-time, query-independent offline phase, and then a fast online retrieval phase. In particular, Corrigan-Gibbs and Kogan (EUROCRYPT 2020) presented a new framework for constructing PIR with sublinear online time. Nevertheless, their protocol is deemed impractical in the single-server setting due to the heavy use of Fully Homomorphic Encryption (FHE). More recently, two state-of-the-art (SOTA) single-server PIR protocols (Zhou et al., S&P 2024 and Mughees-Ren, ePrint 2023) have eliminated FHE, at the price of linear offline communication. However, the client-side storage is still relatively large (

\tilde{O}(\sqrt{n})

), which poses challenges to practical deployment, especially when the client has limited computation and storage capabilities. To address such limitation, we propose a novel PIR protocol Pai, which only requires constant online time, communication, and client-side storage. The price we pay is only a

1

5\times

increase in offline communication, which would be acceptable since it is a one-time cost.Building upon our Pai, we also present a Symmetric KPIR (KSPIR) PaiKSPIR and a Chargeable KSPIR (CKSPIR) PaiCKSPIR. These two variants of PIR offer enhanced functionalities while maintaining computational complexities similar to the original Pai. In addition to providing rigorous theoretical proofs of correctness and privacy for Pai, we have undertaken comprehensive protocol implementations and conducted extensive experiments to validate their high efficiency. Our empirical findings demonstrate that our protocols achieve notably higher online efficiency than SOTA protocols, e.g., Pai exhibits

8.8

91.8\times

better online communication cost and

2.5

8.8\times

better online time. Given the superior online time and storage, our protocol is well-suited for practical deployment

Cryptology ePrint Archive