Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a model
8 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
new activity
8 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config
new activity
8 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config