
Modular MAX
The world's most performant AI execution engine and platform for heterogeneous compute.
Just now
Has API
PricingFreemium
Free to $49/yr
Model Quantization
Heterogeneous Hardware Inference
Kernel Fusion
Discover the strongest tools and workflows for model quantization.

The world's most performant AI execution engine and platform for heterogeneous compute.

The world's fastest deep learning inference optimizer and runtime for NVIDIA GPUs.