Minimum Throughput Maximization for Multi-UAV Enabled WPCN: A Deep Reinforcement Learning Method

Abstract

This paper investigates joint unmanned aerial vehicle (UAV) trajectory planning and time resource allocation for minimum throughput maximization in a multiple UAV-enabled wireless powered communication network (WPCN). In particular, the UAVs perform as base stations (BS) to broadcast energy signals in the downlink to charge IoT devices, while the IoT devices send their independent information in the uplink by utilizing the collected energy. The formulated throughput optimization problem which involves joint optimization of 3D path design and channel resource assignment with the constraint of flight speed of UAVs and uplink transmit power of IoT devices, is not convex and thus is extremely difficult to solve directly. We take advantage of the multi-agent deep Q learning (DQL) strategy and propose a novel algorithm to tackle this problem. Simulation results indicate that the proposed DQL-based algorithm significantly improve performance gain in terms of minimum throughput maximization compared with the conventional WPCN scheme

    Similar works