Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
View all resources
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
GitHub Stars
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
NVIDIA
/
Model-Optimizer
Public
Notifications
You must be signed in to change notification settings
Fork
403
Star
2.7k
Code
Issues
55
Pull requests
166
Actions
Security and quality
0
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Security and quality
Insights
[RFC] Model Optimizer - Product Roadmap
#146 ·
omrialmog
opened
on Mar 6, 2025
7
Issues
Search Issues
is
:
issue
state
:
open
is:issue state:open
Search
Labels
Milestones
New issue
Search results
Open
Closed
NVFP4 ONNX export fails when rotate=True (FWHT trace + post-processor)
bug
Something isn't working
Something isn't working
feature
Issue in actual ModelOpt feature
Issue in actual ModelOpt feature
onnx.quantization
torch.quantization
Status: Open.
#1424
In NVIDIA/Model-Optimizer;
·
Clemxxx
opened
on May 10, 2026
Add VLM components to _default_disabled_quantizer_cfg?
feature request
New feature or request
New feature or request
Status: Open.
#1396
In NVIDIA/Model-Optimizer;
·
harmya
opened
on May 6, 2026
Do you have plans to support Qwen Image Edit?
feature request
New feature or request
New feature or request
Status: Open.
#1366
In NVIDIA/Model-Optimizer;
·
lzcchl
opened
on Apr 29, 2026
[Feature Request] DeepSeek-V4-Flash & DeepSeek-V4-Pro NVFP4 checkpoint
feature request
New feature or request
New feature or request
Status: Open.
#1346
In NVIDIA/Model-Optimizer;
·
erhwenkuo
opened
on Apr 27, 2026
Logical conflict between data loading and collation.
question
Help is is needed
Help is is needed
Status: Open.
#1319
In NVIDIA/Model-Optimizer;
·
wenqibiao
opened
on Apr 22, 2026
[Feature Request] Kimi-K2.6 NVFP4 checkpoint and corresponding EAGLE3 draft model for offline speculative decoding
feature request
New feature or request
New feature or request
Status: Open.
#1308
In NVIDIA/Model-Optimizer;
·
dododream
opened
on Apr 21, 2026
Attention-only LoRA fine-tuning on NVFP4-quantized 122B MoE — community implementation + preprint
question
Help is is needed
Help is is needed
Status: Open.
#1294
In NVIDIA/Model-Optimizer;
·
ddreeselogs
opened
on Apr 19, 2026
GLM5.1 nvfp4 checkpoint
feature request
New feature or request
New feature or request
Status: Open.
#1283
In NVIDIA/Model-Optimizer;
·
functionstackx
opened
on Apr 16, 2026
What’s the Correct Way to Quantize Qwen3.5 (MoE/Dense) to NVFP4?
question
Help is is needed
Help is is needed
Status: Open.
#1255
In NVIDIA/Model-Optimizer;
·
seindum
opened
on Apr 14, 2026
Pre-Quantized Checkpoints: Gemma 4 models
feature request
New feature or request
New feature or request
Status: Open.
#1237
In NVIDIA/Model-Optimizer;
·
rnett
opened
on Apr 11, 2026
Add _QuantGemma4TextExperts plugin for fused 3D MoE expert quantization
feature request
New feature or request
New feature or request
torch.quantization
triaged
Status: Open.
#1173
In NVIDIA/Model-Optimizer;
·
marioiseli89
opened
on Apr 3, 2026
ONNX models generated by llm_export.py are missing some input and output nodes
bug
Something isn't working
Something isn't working
triaged
Status: Open.
#1147
In NVIDIA/Model-Optimizer;
·
idruker-cerence
opened
on Mar 31, 2026
You can’t perform that action at this time.