Q-Learning-Based Time-Critical Data Aggregation Scheduling in IoT

Vo, Van-Vi, Nguyen, Tien-Dung, Le, Duc-Tai, Choo, Hyunseung

Nov-25-2025–arXiv.org Artificial Intelligence

Time-critical data aggregation in Internet of Things (IoT) networks demands efficient, collision-free scheduling to minimize latency for applications like smart cities and industrial automation. Traditional heuristic methods, with two-phase tree construction and scheduling, often suffer from high computational overhead and suboptimal delays due to their static nature. To address this, we propose a novel Q-learning framework that unifies aggregation tree construction and scheduling, modeling the process as a Markov Decision Process (MDP) with hashed states for scalability. By leveraging a reward function that promotes large, interference-free batch transmissions, our approach dynamically learns optimal scheduling policies. Simulations on static networks with up to 300 nodes demonstrate up to 10.87% lower latency compared to a state-of-the-art heuristic algorithm, highlighting its robustness for delay-sensitive IoT applications. This framework enables timely insights in IoT environments, paving the way for scalable, low-latency data aggregation.

machine learning, node, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-25-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - South Korea > Gyeonggi-do
    - Suwon (0.04)
  - Vietnam > Hanoi
    - Hanoi (0.04)

Genre:
- Research Report (0.40)

Industry:
- Energy (0.46)
- Information Technology (0.35)
- Telecommunications (0.47)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.35)
    - Reinforcement Learning (1.00)
  - Internet of Things (1.00)