News
ONNX provides tools and libraries that enable you to apply compression techniques, such as pruning, quantization, and knowledge distillation, to your models. By compressing the models with ONNX, you ...
Microsoft Research AI today said it plans to open-source an optimized version of Google’s popular BERT natural language model designed to work with the ONNX Runtime inference engine. Microsoft ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results