Abstract: Collaborative inference among Internet-of-Things (IoT) devices can reduce deep neural network (DNN) inference latency by exploiting the communication and computing resources of IoT devices.