view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Oct 16, 2025 β’ 18
view article Article Introducing AutoRound: Intelβs Advanced Quantization for LLMs and VLMs +7 Apr 29, 2025 β’ 43
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 Dec 17, 2024 β’ 7
view article Article AMD + π€: Large Language Models Out-of-the-Box Acceleration with AMD GPU +4 Dec 5, 2023 β’ 4
view article Article Overview of natively supported quantization schemes in π€ Transformers +3 Sep 12, 2023 β’ 13
view article Article Overview of natively supported quantization schemes in π€ Transformers +3 Sep 12, 2023 β’ 13