sglang news - Search News

AMD officially releases ROCm 6.3 with new features and optimizations

SGLang, Multi-Node FFT, new Vision Libraries, Fortran Compiler, and more AMD has officially launched ROCm 6.3, the latest ...

10h

ROCm 6.3 adds several new features including a Fortran compiler, and SGLang

ROCm 6.3 adds several new features to the open source platform, helping accelerate various workloads on Instinct GPUs such as ...

The Next Platform17h

AMD ROCm 6.3 Has Goodies For AI Aficionados And HPC Gurus Alike

Speeds and feeds are great, but hardware is only as useful as the software that can harness it, and, for AMD, that’s the ROCm ...

insideHPC1d

AMD Releases ROCm Version 6.3

AMD today announced the release of ROCm Version 6.3 open-source platform, introducing tools and optimizations for AI, ML and HPC workloads on AMD Instinct GPU accelerators. ROCm 6.3 is engineered for ...

Analytics India Magazine14d

The CUDA Killer

While NVIDIA’s fame rests on its GPUs, the real magic comes from CUDA, the software it can’t do without. In a recent ...

marktechpost17d

RAGCache: Optimizing Retrieval-Augmented Generation with Dynamic Caching

Furthermore, when compared to SGLang, a high-performance LLM serving system, RAGCache still showed substantial improvements of up to 3.5× reduction in TTFT and 1.8× enhancement in throughput. These ...

GitHub18d

Add model support for Phi 3.5 MoE

merrymercy Awaiting requested review from merrymercy merrymercy will be requested when the pull request is marked ready for review merrymercy is a code owner Ying1123 Awaiting requested review from ...

GitHub29d

GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang. - zc142365/GPTQModel-Fork ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Related topics