| Carbon footprint monitoring can provide data support for power carbon emission reduction by deploying large-scale sensing devices to accurately monitor and track carbon emission data from sources,network,and loads.In the process of massive sensing devices access to base station(BS),the existing uplink access management techniques are not suitable.Fast uplink grant possesses the advantages of reducing signaling overhead and access conflicts,which can well meet the communication access needs of carbon monitoring.However,large-scale carbon footprint monitoring devices access still faces many challenges,such as incomplete global state information(GSI)and differentiated quality of service(QoS)requirements.The joint optimization of long-term priority constraint and device access management still faces several challenges.First,considering the prohibitive signaling overhead and privacy concerns,it is difficult to obtain the perfect GSI of the entire network.Then,carbon footprint monitoring involves various services with differentiated priorities.Without fine-granularity access management,it is difficult to simultaneously guarantee requirements of low-priority and high-priority services.Finally,access management is not convex due to the coupling between the long-term service priority constraint and the short-term access delay minimization.Existing convex optimization and integer programming-based approaches are difficult to be applied directly.In this paper,we formulate the maximum access queuing delay minimization problem under the long-term service priority constraint and the short-term access management constraint.Lyapunov optimization is leveraged to decouple the long-term service priority constraint and short-term access management optimization,and decompose the long-term stochastic optimization problem into a series of short-term deterministic problems.Then,a priority-aware deep Q-network(DQN)-based fast uplink grant access management(PDAC)algorithm is proposed to achieve intelligent access management with differentiated service priority requirements.PD AC utilizes DQN to handle nonconvex high-dimensional optimization problem with service priority constraint to achieve intelligent access management and priority awareness.Simulation results demonstrate that PDAC outperforms the existing algorithms in access queuing delay,buffer queue backlog,and priority deficit fluctuation. |