PyTorch: keep the GPU busy

We all know it’s important to use GPU resources efficiently, especially during inference. One easy and highly effective way to achieve this is to reorder some of your inference logic to exploit PyTorch’s asynchronous GPU operations. This becomes especially...