Simple plans. Real scale.
Start free, scale as you grow. Every plan includes monthly credits you can spend on fine-tuning, evals, gateway routing, and observability.
Free
Everything you need to evaluate Inference.net and ship your first project.
What's included
- $1 monthly credit grant
- 1 deployment Active deployments
- 2 / mo Training jobs
- 500 / mo Eval samples
- 1M / mo Gateway inference requests
- 50 GB / mo Gateway inference data
- 1M / mo OTEL spans
- 50 GB / mo OTEL span data
- Pay-as-you-go serverless inference
Starter
Higher limits, monthly credit, and room to grow beyond proof-of-concept.
What's included
- 1 deployment Active deployments
- 10 / mo Training jobs
- 1K / mo Eval samples
- 10M / mo Gateway inference requests
- 500 GB / mo Gateway inference data
- 10M / mo OTEL spans
- 500 GB / mo OTEL span data
- Pay-as-you-go serverless inference
Growth
Scaled limits and committed credits for teams running production workloads.
What's included
- 1 deployment Active deployments
- 25 / mo Training jobs
- 10K / mo Eval samples
- 50M / mo Gateway inference requests
- 2.5 TB / mo Gateway inference data
- 50M / mo OTEL spans
- 2.5 TB / mo OTEL span data
- Pay-as-you-go serverless inference
Enterprise
Talk with our team.
Custom contracts, dedicated infrastructure, and a direct line to our team. Talk to our team about committed-use pricing, dedicated support, and bespoke deployment limits.
Includes
- Custom contracts and committed-use pricing
- Dedicated infrastructure and deployment limits
- Direct support channel with our team
- Custom models trained for your workload
Meet with our research team
Schedule a call with our research team to learn more about how Specialized Language Models can cut costs and improve performance.