πŸ”¨ Build Notes

πŸ”¨ Build Notes#

  • Enable exactly one GPU back-end that matches your device (CUBLAS, METAL, OPENCL, …).

  • For very large models, more VRAM helps, but OptiML’s hybrid placement reduces the requirement.

  • Use quantization to lower memory and often improve speed on PC-class hardware.

  • Ensure release builds (-DCMAKE_BUILD_TYPE=Release) for best performance.