codes1gn

Follow

💭

I may be slow to respond.

Heng Shi codes1gn

💭

I may be slow to respond.

Follow

Puzzle Puzzle Junkie Junkie

8 followers · 16 following

Shanghai
09:15 (UTC -12:00)

Achievements

Achievements

Pinned Loading

LancerLab/choreo LancerLab/choreo Public

An ultra-accessible DSL for high-performance kernel programming

C++ 4
LancerLab/kebab LancerLab/kebab Public

Dissecting Hopper/Ampere performance with microbenchmarks and Gemm best practices.

Cuda 3
sparse-gemm-with-hopper-sptc sparse-gemm-with-hopper-sptc Public

A minimal MatMul/Gemm case for using WGMMA + Structural Sparsity in Hopper

Cuda 2
Ragdoll Ragdoll Public

Ragdoll: Reusable Abstraction Language Layer for General-purpose & Domain-Oriented Computing

Jupyter Notebook 3
Raptors Raptors Public

Rust Actor model for Parallel System

Rust
Samoyeds Samoyeds Public

Forked from LancerLab/Samoyeds

Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)

Jupyter Notebook