Quantization on Text Matrix

Quantization on Text Matrixhttps://155a386f.text-matrix.pages.dev/tags/quantization/Recent content in Quantization on Text MatrixHugozh-cnWed, 08 Apr 2026 23:16:10 +0800Quantization 量化技术完全指南：从原理到 LLM 实战https://155a386f.text-matrix.pages.dev/posts/tech/llm/quantization-llm-model-compression-guide/Sun, 29 Mar 2026 23:28:00 +0800https://155a386f.text-matrix.pages.dev/posts/tech/llm/quantization-llm-model-compression-guide/<h1 id="quantization-量化技术完全指南从原理到-llm-实战">Quantization 量化技术完全指南：从原理到 LLM 实战</h1> <blockquote> <p><strong>目标读者</strong>：想深入理解量化技术、压缩大模型体积的开发者 <strong>核心问题</strong>：如何将 159GB 的大模型压缩到能在笔记本运行，同时只损失 5-10% 精度？</p>