Abstract: Reinforcement learning (RL) has been increasingly adopted in IoT systems for tasks such as resource allocation and control. However, in privacy-critical and resource-constrained environments ...