Issues · NVIDIA/Model-Optimizer · GitHub

[RFC] Model Optimizer - Product Roadmap
#146 · omrialmog opened on Mar 6, 2025
7

Labels Milestones New issue

NVFP4 ONNX export fails when rotate=True (FWHT trace + post-processor)

onnx.quantization

torch.quantization

#1424

· Clemxxx opened

on May 10, 2026

Add VLM components to _default_disabled_quantizer_cfg?

feature request

#1396

· harmya opened

on May 6, 2026

Do you have plans to support Qwen Image Edit?

feature request

#1366

· lzcchl opened

on Apr 29, 2026

[Feature Request] DeepSeek-V4-Flash & DeepSeek-V4-Pro NVFP4 checkpoint

feature request

#1346

· erhwenkuo opened

on Apr 27, 2026

Logical conflict between data loading and collation.

#1319

· wenqibiao opened

on Apr 22, 2026

[Feature Request] Kimi-K2.6 NVFP4 checkpoint and corresponding EAGLE3 draft model for offline speculative decoding

feature request

#1308

· dododream opened

on Apr 21, 2026

Attention-only LoRA fine-tuning on NVFP4-quantized 122B MoE — community implementation + preprint

#1294

· ddreeselogs opened

on Apr 19, 2026

GLM5.1 nvfp4 checkpoint

feature request

#1283

· functionstackx opened

on Apr 16, 2026

What’s the Correct Way to Quantize Qwen3.5 (MoE/Dense) to NVFP4?

#1255

· seindum opened

on Apr 14, 2026

Pre-Quantized Checkpoints: Gemma 4 models

feature request

#1237

· rnett opened

on Apr 11, 2026

Add _QuantGemma4TextExperts plugin for fused 3D MoE expert quantization

feature request

torch.quantization

#1173

· marioiseli89 opened

on Apr 3, 2026

ONNX models generated by llm_export.py are missing some input and output nodes

#1147

· idruker-cerence opened

on Mar 31, 2026