Search CORE

31 research outputs found

Efficient Approximation Algorithms for Multi-Antennae Largest Weight Data Retrieval

Author: Guo Longkun
Shen Hong
Zhu Wenxing
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

In a mobile network, wireless data broadcast over

m

channels (frequencies) is a powerful means for distributed dissemination of data to clients who access the channels through multi-antennae equipped on their mobile devices. The

\delta

-antennae largest weight data retrieval (

\delta

ALWDR) problem is to compute a schedule for downloading a subset of data items that has a maximum total weight using

\delta

antennae in a given time interval. In this paper, we propose a ratio

1-\frac{1}{e}-\epsilon

approximation algorithm for the

\delta

-antennae largest weight data retrieval (

\delta

ALWDR) problem that has the same ratio as the known result but a significantly improved time complexity of

O(2^{\frac{1}{\epsilon}}\frac{1}{\epsilon}m^{7}T^{3.5}L)

from

O(\epsilon^{3.5}m^{\frac{3.5}{\epsilon}}T^{3.5}L)

when

\delta=1

\cite{lu2014data}. To our knowledge, our algorithm represents the first ratio

1-\frac{1}{e}-\epsilon

approximation solution to

\delta

ALWDR for the general case of arbitrary

\delta

. To achieve this, we first give a ratio

1-\frac{1}{e}

algorithm for the

\gamma

-separated

\delta

ALWDR (

\delta

\gamma

LWDR) with runtime

O(m^{7}T^{3.5}L)

, under the assumption that every data item appears at most once in each segment of

\delta

\gamma

LWDR, for any input of maximum length

L

m

channels in

T

time slots. Then, we show that we can retain the same ratio for

\delta

\gamma

LWDR without this assumption at the cost of increased time complexity to

O(2^{\gamma}m^{7}T^{3.5}L)

. This result immediately yields an approximation solution of same ratio and time complexity for

\delta

ALWDR, presenting a significant improvement of the known time complexity of ratio

1-\frac{1}{e}-\epsilon

approximation to the problem

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Efficient algorithm for the k-means problem with Must-Link and Cannot-Link constraints

Author: Guo Longkun
Jia Chaoqi
Liao Kewen
Lu Zhigang
Publication venue: Tsinghua University
Publication date: 01/01/2023
Field of study

Constrained clustering, such as k -means with instance-level Must-Link (ML) and Cannot-Link (CL) auxiliary information as the constraints, has been extensively studied recently, due to its broad applications in data science and AI. Despite some heuristic approaches, there has not been any algorithm providing a non-trivial approximation ratio to the constrained k -means problem. To address this issue, we propose an algorithm with a provable approximation ratio of O(logk) when only ML constraints are considered. We also empirically evaluate the performance of our algorithm on real-world datasets having artificial ML and disjoint CL constraints. The experimental results show that our algorithm outperforms the existing greedy-based heuristic methods in clustering accuracy

ACU Research Bank

Convergence and Correctness of Belief Propagation for Weighted Min-Max Flow

Author: Dai Guowei
Guo Longkun
Gutin Gregory
Zhang Xiaoyan
Zhang Zanbo
Publication venue
Publication date: 01/01/2022
Field of study

Royal Holloway - Pure

Acceleration for Timing-Aware Gate-Level Logic Simulation with One-Pass GPU Parallelism

Author: Fang Weijie
Fu Yanggeng
Gao Jiaquan
Guo Longkun
Gutin Gregory
Zhang Xiaoyan
Publication venue
Publication date: 26/04/2023
Field of study

Witnessing the advancing scale and complexity of chip design and benefiting from high-performance computation technologies, the simulation of Very Large Scale Integration (VLSI) Circuits imposes an increasing requirement for acceleration through parallel computing with GPU devices. However, the conventional parallel strategies do not fully align with modern GPU abilities, leading to new challenges in the parallelism of VLSI simulation when using GPU, despite some previous successful demonstrations of significant acceleration. In this paper, we propose a novel approach to accelerate 4-value logic timing-aware gate-level logic simulation using waveform-based GPU parallelism. Our approach utilizes a new strategy that can effectively handle the dependency between tasks during the parallelism, reducing the synchronization requirement between CPU and GPU when parallelizing the simulation on combinational circuits. This approach requires only one round of data transfer and hence achieves one-pass parallelism. Moreover, to overcome the difficulty within the adoption of our strategy in GPU devices, we design a series of data structures and tune them to dynamically allocate and store new-generated output with uncertain scale. Finally, experiments are carried out on industrial-scale open-source benchmarks to demonstrate the performance gain of our approach compared to several state-of-the-art baselines

arXiv.org e-Print Archive

Improved approximation algorithms for computing $k$ k disjoint paths subject to two constraints

Author: Longkun Guo
Hong Shen
Kewen Liao
Publication venue: Springer Science and Business Media LLC
Publication date: 01/01/2013
Field of study

Crossref

Ezid

On the shallow-light Steiner tree problem

Author: Guo Longkun
Liao Kewen
Shen Hong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Let G = (V, E) be a given graph with nonnegative integral edge cost and delay, S ⊆ V be a terminal set and r ∈ S be the selected root. The shallow-light Steiner tree (SLST) problem is to compute a minimum cost tree spanning the terminals of S, such that the delay between r and every other terminal is bounded by a given delay constraint D ∈ ℤ 0 + . It is known that the SLST problem is NP-hard and unless NP ⊆ DTIME(n log log n ) there exists no approximation algorithm with ratio (1, γ log2 n) for some fixed γ > 0 [12]. Nevertheless, under the same assumption it admits no approximation ratio better than (1, γ log 2 n) for some fixed γ > 0 even when D = 2 [2]. This paper first gives an exact algorithm with time complexity O(3 t nD + 2 t n 2 D 2 + n 3 D 3 ), where n and t are the numbers of vertices and terminals of the given graph respectively. This is a pseudo polynomial time parameterized algorithm with respect to the parameterization “number of terminals”. Later, this algorithm is improved to a parameterized approximation algorithm with a time complexity O(3 t n 2 /∈ + 2 t n 4 /∈ 2 + n 6 /∈ 3 ) and a bifactor approximation ratio (1 + ∈, 1). That is, for any small real number ∈ > 0, the algorithm computes a Steiner tree with delay and cost bounded by (1 + ∈)D and the optimum cost respectively

Crossref

ACU Research Bank

Adelaide Research & Scholarship