Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter FrameworkShow others and affiliations
2026 (English)In: IEEE Internet of Things Journal, ISSN 2327-4662, Vol. 13, no 5, p. 8092-8107Article in journal (Refereed) Published
Abstract [en]
The proliferation of large-scale Internet-of-things (IoT) deployments and the emergence of 6G wireless technologies have created a pressing need for intelligent, energy-aware, and low-latency communication frameworks. In this work, we propose a novel two-phase reinforcement learning (RL)-based architecture designed to minimize the age of information (AoI) in 6G-enabled IoT networks. Our approach integrates (i) a deep deterministic policy gradient (DDPG)-driven backscatter-assisted cognitive radio non-orthogonal multiple access (CR-NOMA) scheme in the uplink, and (ii) a lightweight Q-learning-based power-domain NOMA (PD-NOMA) strategy for the downlink. In the uplink, energy harvesting (EH) sensors employ deep RL to jointly optimize backscatter reflection coefficients and transmission scheduling over shared spectrum using CR-NOMA. This enables energy-efficient communication and reduced AoI under dynamic energy and channel conditions. In the downlink, the edge node serves multiple IoT users simultaneously using PD-NOMA, where a Q-learning agent intelligently decides whether to transmit fresh or cached data to each user based on battery levels, channel quality, and information freshness. Both phases are modeled as Markov decision processes (MDPs), allowing agents to learn independently and converge toward optimal policies that balance information freshness, spectral efficiency (SE), and energy constraints. Extensive simulations demonstrate that the proposed framework effectively reduces AoI across both phases, with consistent convergence even under varying sensor densities and EH conditions. Moreover, by relying on explainable and verifiable learning mechanisms, our model addresses emerging concerns around reliability and trustworthiness in artificial intelligence (AI)-driven 6G-IoT systems. This framework represents a step toward scalable, adaptive, and responsible AI integration for future mission-critical IoT applications.
Place, publisher, year, edition, pages
IEEE conference proceedings, 2026. Vol. 13, no 5, p. 8092-8107
Keywords [en]
Age Of Information (aoi), Cognitive Radio Non-orthogonal Multiple Access (cr-noma), Deep Deterministic Policy Gradient (ddpg), Internet-of-things (iot), Markov Decision Processes (mdps), Power-domain Noma (pd-noma), Reinforcement Learning (rl), Spectral Efficiency (se) And Artificial Intelligence (ai), Backscattering, Behavioral Research, Cognitive Systems, Deep Learning, Energy Efficiency, Energy Harvesting, Intelligent Agents, Internet Of Things, Markov Processes, Memory Architecture, Mobile Telecommunication Systems, Network Architecture, Power Management, Radio Systems, Reinforcement Learning, Trusted Computing, Age Of Information, Cognitive Radio Non-orthogonal Multiple Access, Deep Deterministic Policy Gradient, Deterministics, Internet-of-thing, Markov Decision Process, Markov Decision Processes, Multiple Access, Non-orthogonal, Policy Gradient, Power-domain Noma, Powerdomains, Reinforcement Learnings, Spectral Efficiencies, Spectral Efficiency And Artificial Intelligence, Cognitive Radio
National Category
Communication Systems
Identifiers
URN: urn:nbn:se:miun:diva-55624DOI: 10.1109/JIOT.2025.3611868ISI: 001696600200009Scopus ID: 2-s2.0-105016718705OAI: oai:DiVA.org:miun-55624DiVA, id: diva2:2002438
2025-09-302025-09-302026-03-09Bibliographically approved