inference-optimization

Here are 20 public repositories matching this topic...

google / XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

cpu neural-network inference multithreading simd matrix-multiplication neural-networks convolutional-neural-networks convolutional-neural-network inference-optimization mobile-inference

Updated Mar 28, 2023
C

jiazhihao / TASO

Star

The Tensor Algebra SuperOptimizer for Deep Learning

deep-neural-networks deep-learning inference-optimization

Updated Jan 26, 2023
C++

alibaba / BladeDISC

Star

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

machine-learning deep-learning neural-network compiler tensorflow pytorch inference-optimization mlir

Updated Mar 28, 2023
C++

Oulu-IMEDS / pytorch_bn_fusion

Star

Batch normalization fusion for PyTorch

deep-neural-networks deep-learning pytorch batch-normalization inference-optimization

Updated Apr 6, 2020
Python

mit-han-lab / inter-operator-scheduler

Star

[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration

acceleration cnn parallelism inference-optimization

Updated Apr 27, 2022
C++

ZFTurbo / Keras-inference-time-optimizer

Star

Optimize layers structure of Keras model to reduce computation time

keras inference-optimization

Updated Jul 18, 2020
Python

graphsignal / graphsignal-python

Star

Graphsignal Python agent

python debugging machine-learning monitoring deep-learning tensorflow inference pytorch artificial-intelligence openai jax inference-optimization huggingface onnxruntime langchain

Updated Mar 26, 2023
Python

Rapternmn / PyTorch-Onnx-Tensorrt

Star

A set of tool which would make your life easier with Tensorrt and Onnxruntime. This Repo is designed for YoloV3

pytorch darknet tensorrt onnx onnx-torch yolov3 inference-optimization onnxruntime

Updated Dec 31, 2019
Python

lmaxwell / Armednn

Star

cross-platform modular neural network inference library, small and efficient

neural-network eigen lstm inference-engine eigen3 inference-optimization conv1d

Updated Apr 27, 2022
C++

Bisonai / ncnn

Star

Modified inference engine for quantized convolution using product quantization

quantization product-quantization edge-machine-learning inference-optimization mobile-deep-learning inference-acceleration

Updated Jul 1, 2022
C++

sjlee25 / batch-partitioning

Star

Batch Partitioning for Multi-PE Inference with TVM (2020)

deep-learning data-parallelism tvm inference-optimization dl-optimization dl-compiler

Updated Dec 17, 2022
Python

zhliuworks / Fast-MobileNetV2

Star

🤖️ Optimized CUDA Kernels for Fast MobileNetV2 Inference

cuda-kernels mobilenet-v2 inference-optimization

Updated Dec 28, 2021
Cuda

effrosyni-papanastasiou / constrained-em

Star

A constrained expectation-maximization algorithm for feasible graph inference.

network-inference expectation-maximization feasibility expectation-maximisation-algorithm inference-optimization

Updated Jun 10, 2021
Jupyter Notebook

cedrickchee / pytorch-mobile-ios

Star

PyTorch Mobile: iOS examples

machine-learning ios-app inference-optimization libtorch edge-ai pytorch-mobile

Updated Oct 10, 2019
Swift

aalbaali / LieBatch

Star

Batch estimation on Lie groups

lie-groups state-estimation g2o inference-optimization batch-optimization

Updated Dec 17, 2021
MATLAB

piotrostr / infer-trt

Star

Interface for TensorRT engines inference along with an example of YOLOv4 engine being used.

deep-learning object-detection tensorrt inference-optimization

Updated May 7, 2022
Python

kiritigowda / mivisionx-inference-analyzer

Star

MIVisionX Python Inference Analyzer uses pre-trained ONNX/NNEF/Caffe models to analyze inference results and summarize individual image results

Updated Nov 17, 2020
Python

ieee820 / ncnn

Star

ncnn is a high-performance neural network inference framework optimized for the mobile platform

arm-neon mobile-networks ncnn inference-optimization

Updated May 29, 2019
C++

cedrickchee / pytorch-mobile-android

Star

PyTorch Mobile: Android examples of usage in applications

machine-learning android-app inference-optimization libtorch edge-ai pytorch-mobile

Updated Oct 10, 2019
Java

goshaQ / inference-optimizer

Star

A simple tool that applies structure-level optimizations (e.g. Quantization) to a TensorFlow model

tensorflow tensorflow-models inference-optimization

Updated Aug 13, 2018
Python

Improve this page

Add a description, image, and links to the inference-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Feb	MAR	Apr
	28
2022	2023	2024