Keywords :
communications for machine learning; Estimation; goal-oriented communications; Measurement; Multi-agent systems; multi-agent systems; Quantization (signal); reinforcement learning; Semantic communications; Semantics; Task analysis; task-effective data compression; Topology; Centralized controllers; Communication for machine learning; Goal-oriented communication; Machine-learning; Reinforcement learnings; Semantic communication; State aggregation; Task-effective data compression; Signal Processing; Information Systems; Hardware and Architecture; Computer Science Applications; Computer Networks and Communications
Abstract :
[en] We consider a task-effective quantization problem that arises when multiple agents are controlled via a centralized controller (CC). While agents have to communicate their observations to the CC for decision-making, the bit-budgeted communications of agent-CC links may limit the task-effectiveness of the system which is measured by the system’s average sum of stage costs/rewards. As a result, each agent should compress/quantize its observation such that the average sum of stage costs/rewards of the control task is minimally impacted. We address the problem of maximizing the average sum of stage rewards by proposing two different Action-Based State Aggregation (ABSA) algorithms that carry out the indirect and joint design of control and communication policies in the multi-agent system. While the applicability of ABSA-1 is limited to single-agent systems, it provides an analytical framework that acts as a stepping stone to the design of ABSA-2. ABSA-2 carries out the joint design of control and communication for a multi-agent system. We evaluate the algorithms -with average return as the performance metric -using numerical experiments performed to solve a multi-agent geometric consensus problem. The numerical results are concluded by introducing a new metric that measures the effectiveness of communications in a multi-agent system.
Name of the research project :
U-AGR-7288 - C22/IS/17220888/RUTINE (01/09/2023 - 31/08/2026) - VU Thang Xuan
Scopus citations®
without self-citations
8