Search CORE

90 research outputs found

Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Author: Cheng Yingfeng
Fan Changjie
Hao Jianye
Hu Yujing
Liu Wulong
Meng Zhaopeng
Peng Jiajie
Wang Weixun
Wang Zhaodong
Yang Tianpei
Zhang Zongzhang
Publication venue
Publication date: 25/05/2020
Field of study

Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing transfer approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) to accelerate RL by taking advantage of this idea. Our framework learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and surpasses state-of-the-art policy transfer methods in terms of learning efficiency and final performance in both discrete and continuous action spaces.Comment: Accepted by IJCAI'202

arXiv.org e-Print Archive

Crossref

Aegis: A Lightning Fast Privacy-preserving Machine Learning Platform against Malicious Adversaries

Author: Bingsheng Zhang
Kui Ren
Lichun Li
Tianpei Lu
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 18/01/2024
Field of study

Privacy-preserving machine learning (PPML) techniques have gained significant popularity in the past years. Those protocols have been widely adopted in many real-world security-sensitive machine learning scenarios, e.g., medical care and finance. In this work, we introduce

\mathsf{Aegis}

~-- a high-performance PPML platform built on top of a maliciously secure 3-PC framework over ring

\mathbb{Z}_{2^\ell}

. In particular, we propose a novel 2-round secure comparison (a.k.a., sign bit extraction) protocol in the preprocessing model. The communication of its semi-honest version is only 25% of the state-of-the-art (SOTA) constant-round semi-honest comparison protocol by Zhou et al.(S&P 2023); both communication and round complexity of its malicious version are approximately 50% of the SOTA (BLAZE) by Patra and Suresh (NDSS 2020), for

\ell=64

. Moreover, the communication of our maliciously secure inner product protocol is merely

3\ell

bits, reducing 50% from the SOTA (Swift) by Koti et al. (USENIX 2021). Finally, the resulting ReLU and MaxPool PPML protocols outperform the SOTA by

4\times

in the semi-honest setting and

10\times

in the malicious setting, respectively

Cryptology ePrint Archive

Placement and Routing in 3D Integrated Circuits

Author: B. Goplen
C. Ababei
H. Mogal
K. Bazargan
S. Sapatnekar
Tianpei Zhang
Yan Feng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Multi-party Private Function Evaluation for RAM

Author: Bingsheng Zhang
Keyu Ji
Kui Ren
Tianpei Lu
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 29/01/2023
Field of study

Private function evaluation (PFE) is a special type of MPC protocols that, in addition to the input privacy, can preserve the function privacy. In this work, we propose a PFE scheme for RAM. In particular, we first design an efficient 4-server distributed ORAM scheme with amortized communication

O(\log n)

per access (both reading and writing). We then simulate a RISC RAM machine over the MPC platform, hiding (i) the memory access pattern, (ii) the machine state (including registers, program counter, condition flag, etc.), and (iii) the executed instructions. Our scheme can naturally support a simplified TinyRAM instruction set; if a public RAM program

P

with given inputs

x

needs to execute

z

instruction cycles, our PFE scheme is able to securely evaluate

P(x)

on private

P

and

x

within

5z+1

online rounds. We prototype and benchmark our system for set intersection, binary search, quicksort, and heapsort algorithms. For instance, to obliviously perform the binary search algorithm on a

2^{10}

array takes

5.81s

with function privacy

Cryptology ePrint Archive

UC Secure Private Branching Program and Decision Tree Evaluation

Author: Bingsheng Zhang
Keyu Ji
Kui Ren
Lichun Li
Tianpei Lu
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 01/11/2022
Field of study

Branching program (BP) is a DAG-based non-uniform computational model for L/poly class. It has been widely used in formal verification, logic synthesis, and data analysis. As a special BP, a decision tree is a popular machine learning classifier for its effectiveness and simplicity. In this work, we propose a UC-secure efficient 3-party computation platform for outsourced branching program and/or decision tree evaluation. We construct a constant-round protocol and a linear-round protocol. In particular, the overall (online + offline) communication cost of our linear-round protocol is

O(d(\ell + \log m+\log n))

and its round complexity is

2d-1

, where

m

is the DAG size,

n

is the number of features,

\ell

is the feature length, and

d

is the longest path length. To enable efficient oblivious hopping among the DAG nodes, we propose a lightweight

1

-out-of-

N

shared OT protocol with logarithmic communication in both online and offline phase. This partial result may be of independent interest to some other cryptographic protocols. Our benchmark shows, compared with the state-of-the-arts, the proposed constant-round protocol is up to 10X faster in the WAN setting, while the proposed linear-round protocol is up to 15X faster in the LAN setting

Cryptology ePrint Archive

Simultaneous shield and buffer insertion for crosstalk noise reduction in global routing

Author: Sachin S Sapatnekar
Tianpei Zhang
Publication venue
Publication date: 01/01/2007
Field of study

Abstract We present a method for incorporating crosstalk reduction criteri

CiteSeerX

Buffering global interconnects in structured ASIC design

Author: Sachin S. Sapatnekar
Tianpei Zhang
Publication venue
Publication date: 01/01/2005
Field of study

Structured ASICs present an attractive alternative to reducing design costs and turnaround times in nanometer designs. As with conventional ASICs, such designs require global wires to be buffered. However, via-programmable designs must prefabricate and preplace buffers in the layout. This paper proposes a novel and accurate statistical estimation technique for distributing prefabricated buffers through a layout. It employs Rent’s rule to estimate the buffer distribution required for the layout, so that an appropriate structured ASIC may be selected for the design. Experimental results show that the buffer distribution estimation is accurate and economic, and that a uniform buffer distribution can maintain a high degree of regularity in design and shows a good timing performance, comparable with nonuniform buffer distribution. Key words: Structured ASIC, Rent’s rule, buffer insertion, interconnect, physical design

CiteSeerX

Crossref