Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 24.3k 4.8k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    772 57

  3. ome ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 392 62

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 276 49

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 726 177

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 248 75

Repositories

Showing 10 of 25 repositories
  • sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    sgl-project/sglang’s past year of commit activity
    Python 24,339 Apache-2.0 4,764 603 (27 issues need help) 1,769 Updated Mar 12, 2026
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 113 29 10 1 Updated Mar 12, 2026
  • sgl-docs Public
    sgl-project/sgl-docs’s past year of commit activity
    MDX 4 Apache-2.0 15 0 0 Updated Mar 12, 2026
  • ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    sgl-project/ome’s past year of commit activity
    Go 392 Apache-2.0 62 32 (2 issues need help) 44 Updated Mar 11, 2026
  • genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    sgl-project/genai-bench’s past year of commit activity
    Python 276 MIT 49 6 10 Updated Mar 11, 2026
  • sgl-flash-attn Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    sgl-project/sgl-flash-attn’s past year of commit activity
    Python 20 BSD-3-Clause 2,513 0 0 Updated Mar 11, 2026
  • mini-sglang Public

    A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

    sgl-project/mini-sglang’s past year of commit activity
    Python 3,677 MIT 486 8 19 Updated Mar 11, 2026
  • sgl-cookbook Public

    Cookbook of SGLang - Recipe

    sgl-project/sgl-cookbook’s past year of commit activity
    JavaScript 97 Apache-2.0 44 6 (1 issue needs help) 10 Updated Mar 11, 2026
  • sgl-kernel-npu Public

    SGLang kernel library for NPU

    sgl-project/sgl-kernel-npu’s past year of commit activity
    C++ 104 MIT 91 15 39 Updated Mar 11, 2026
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 248 Apache-2.0 75 88 (8 issues need help) 31 Updated Mar 11, 2026