ilumiere

Follow

💭

I may be slow to respond.

ilumiere

💭

I may be slow to respond.

Follow

explorer

6 followers · 27 following

Pinned Loading

lmdeploy lmdeploy Public

Forked from InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python
ollama ollama Public

Forked from ollama/ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go
sglang sglang Public

Forked from sgl-project/sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Python
text-generation-inference text-generation-inference Public

Forked from huggingface/text-generation-inference

Large Language Model Text Generation Inference

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
llama.cpp llama.cpp Public

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++