论文标题

用硬期限的边缘推理系统的动态压缩比选择

Dynamic Compression Ratio Selection for Edge Inference Systems with Hard Deadlines

论文作者

Huang, Xiufeng, Zhou, Sheng

论文摘要

在物联网(IoT)设备上实施机器学习算法已成为新兴应用程序(例如自动驾驶,环境监视)至关重要的。但是,计算能力和能耗的局限性使得在物联网设备上运行复杂的机器学习算法变得困难,尤其是在存在延迟截止日期时。一种解决方案是将计算密集型任务卸载到边缘服务器。但是,原始数据的无线上传时间很耗时,可能导致截止日期违规。为了降低通信成本,可以利用有损耗的数据压缩来进行推理任务,但可能带来更多错误的推理结果。在本文中,我们为带有硬期限的边缘推理系统提出了动态压缩比选择方案。关键的想法是平衡沟通成本和推理准确性之间的权衡。通过使用剩余的截止日期预算进行排队任务的剩余截止预算,通过动态选择最佳压缩率,可以在有限的通信资源下及时完成更多任务。此外,提出了通过错误的推断重新传播任务数据较少压缩数据的信息,以提高准确性性能。虽然通常很难知道推理的正确性,但我们使用不确定性来估计推理的信心,并基于此,共同优化了信息增强和压缩比选择。最后,考虑到无线传输错误,我们进一步设计了一个重传方案,以减少由于数据包损失而导致的性能降解。仿真结果表明,在不同的截止日期和任务到达率下,提出的方案的性能。

Implementing machine learning algorithms on Internet of things (IoT) devices has become essential for emerging applications, such as autonomous driving, environment monitoring. But the limitations of computation capability and energy consumption make it difficult to run complex machine learning algorithms on IoT devices, especially when latency deadline exists. One solution is to offload the computation intensive tasks to the edge server. However, the wireless uploading of the raw data is time consuming and may lead to deadline violation. To reduce the communication cost, lossy data compression can be exploited for inference tasks, but may bring more erroneous inference results. In this paper, we propose a dynamic compression ratio selection scheme for edge inference system with hard deadlines. The key idea is to balance the tradeoff between communication cost and inference accuracy. By dynamically selecting the optimal compression ratio with the remaining deadline budgets for queued tasks, more tasks can be timely completed with correct inference under limited communication resources. Furthermore, information augmentation that retransmits less compressed data of task with erroneous inference, is proposed to enhance the accuracy performance. While it is often hard to know the correctness of inference, we use uncertainty to estimate the confidence of the inference, and based on that, jointly optimize the information augmentation and compression ratio selection. Lastly, considering the wireless transmission errors, we further design a retransmission scheme to reduce performance degradation due to packet losses. Simulation results show the performance of the proposed schemes under different deadlines and task arrival rates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源