COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20200809210536/https://github.com/topics/inference-optimization
Here are
11 public repositories
matching this topic...
High-efficiency floating-point neural network inference operators for mobile, server, and Web
The Tensor Algebra SuperOptimizer for Deep Learning
Batch normalization fusion for PyTorch
Updated
Apr 6, 2020
Python
Optimize layers structure of Keras model to reduce computation time
Updated
Jul 18, 2020
Python
A set of tool which would make your life easier with Tensorrt and Onnxruntime. This Repo is designed for YoloV3
Updated
Dec 31, 2019
Python
Modified inference engine for quantized convolution using product quantization
ncnn is a high-performance neural network inference framework optimized for the mobile platform
PyTorch Mobile: Android examples of usage in applications
Updated
Oct 10, 2019
Java
PyTorch Mobile: iOS examples
Updated
Oct 10, 2019
Swift
A simple tool that applies structure-level optimizations (e.g. Quantization) to a TensorFlow model
Updated
Aug 13, 2018
Python
MIVisionX Python Inference Analyzer using pre-trained ONNX/NNEF/Caffe models to analyze and summarize images
Updated
Dec 10, 2019
Python
Improve this page
Add a description, image, and links to the
inference-optimization
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
inference-optimization
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.