Further references: - https://github.com/Microsoft/onnxruntime/issues/133 - https://github.com/Microsoft/onnxruntime/tree/master/onnxruntime/test/perftest - https://cloudblogs.microsoft.com/opensource/2020/01/21/microsoft-onnx-open-source-optimizations-transformer-inference-gpu-cpu/