sift

sift is a standalone Rust CLI and Hybrid Information Retrieval (IR) system for local document retrieval in agentic workflows. It searches raw local corpora without a long-running daemon, uses a composable search strategy architecture, and employs a Zig-style heuristic caching system for near-instant repeated queries.

The core idea is simple: point sift at a directory, extract text on demand, and run a layered search pipeline (Expansion, Retrieval, Fusion, Reranking). There is no external database, no daemon, and no background indexing service.

For the project background and design rationale, read the introductory post: Sift: Local Hybrid Search Without the Infrastructure Tax.

Current Contract

Single Rust Binary: No external database, daemon, or long-running service.
Pure-Rust Toolchain: No C++ dependencies, enabling easy static binary distribution.
High Performance: SIMD-accelerated scoring and memory-mapped I/O for ultra-fast retrieval on large corpora.
Default Strategy: Uses the page-index-hybrid champion preset (Lexical + Semantic).
Layered Pipeline: Query Expansion -> Retrieval -> Fusion -> Reranking.
Heuristic Incremental Caching: Uses mtime, inode, and size to bypass extraction and hashing for unchanged files.
Fully Processed Assets: Cache blobs contain text, term stats, and pre-computed dense vector embeddings, enabling search at dot-product speeds.
Inference & Reranking: Runs locally through Candle with support for both dense embeddings and advanced LLM reranking (e.g., Qwen2.5).
Supported Inputs: Text, HTML, PDF, and OOXML files (.docx, .xlsx, .pptx).

Installation

Homebrew (macOS and Linux)

brew tap rupurt/homebrew-tap
brew install sift

One-liner Install (macOS and Linux)

curl --proto '=https' --tlsv1.2 -LsSf https://github.com/rupurt/sift/releases/latest/download/sift-installer.sh | sh

Manual Download

Download the latest pre-built binaries and installers for your platform from the GitHub Releases page. We provide:

Linux: .tar.gz archives plus the cross-platform shell installer
macOS: .tar.gz archives plus the cross-platform shell installer
Windows: .zip archives, .msi, and the PowerShell installer

Embedded Library

sift can also be embedded from another Rust project. The supported library contract lives at the crate root:

Sift, SiftBuilder
SearchInput, SearchOptions
Retriever, Fusion, Reranking
SearchResponse, SearchHit, ScoreConfidence

Everything under sift::internal exists to support the bundled executable, benchmarks, and repository-internal tests. It is not part of the supported embedding contract and may change without notice.

Runnable Example Consumer

examples/sift-embed is the canonical runnable embedding reference. It is a standalone Rust crate that depends on sift through the crate-root facade and exposes a minimal sift-embed CLI.

From the repo root:

just embed-build
just embed-search tests/fixtures/rich-docs "architecture decision"

The embedded example supports the same shape directly:

cargo run --manifest-path examples/sift-embed/Cargo.toml -- search "hybrid search"
cargo run --manifest-path examples/sift-embed/Cargo.toml -- search tests/fixtures/rich-docs "architecture decision"

If PATH is omitted, sift-embed search "<term>" searches the current directory. See examples/sift-embed/README.md for the full runnable example notes.

Add The Dependency

Use the git repository until a versioned registry release is part of your delivery path:

[dependencies]
sift = { git = "https://github.com/rupurt/sift" }

For local development against a checked-out copy:

[dependencies]
sift = { path = "../sift" }

Minimal Embedding Example

use sift::{Retriever, Reranking, SearchInput, SearchOptions, Sift};

fn main() -> anyhow::Result<()> {
    let sift = Sift::builder().build();

    let response = sift.search(
        SearchInput::new("./docs", "hybrid search")
            .with_options(
                SearchOptions::default()
                    .with_strategy("bm25")
                    .with_retrievers(vec![Retriever::Bm25])
                    .with_reranking(Reranking::None)
                    .with_limit(5)
                    .with_shortlist(5),
            ),
    )?;

    for hit in response.results {
        println!("{} {}", hit.rank, hit.path.display());
    }

    Ok(())
}

Embedders should consume SearchResponse directly and own their own rendering. Helpers under sift::internal are executable support code, not stable library API.

How Sift Works

At runtime, sift orchestrates a high-performance asset pipeline:

flowchart TD
  A[CLI parse] --> B[Resolve PATH + QUERY]
  B --> C[Directory Walk]
  
  subgraph Pipeline ["Optimized Asset Pipeline"]
    C1{Heuristic Hit?} -->|Yes| C2[Mapped I/O: Load Document Blob]
    C1 -->|No| C3[BLAKE3 Hash + Check CAS]
    C3 -->|Hit| C4[Update Manifest + Load Blob]
    C3 -->|Miss| C5[Extract + Embed + Save Blob]
  end

  C --> C1
  C2 --> D[Build Transient Corpus]
  C4 --> D
  C5 --> D

  subgraph Strategy ["Layered Search Strategy"]
    D --> E{Expansion}
    E -->|None| E1[Direct Query]
    E -->|HyDE| E2[LLM: Hypothetical Answer]
    E -->|SPLADE| E3[LLM: Generative Expansion]
    E -->|Classified| E4[LLM: Intent Classification]
    
    E1 --> F{Retrievers}
    E2 --> F
    E3 --> F
    E4 --> F
    
    F -->|Lexical| F1[BM25]
    F -->|Exact| F2[Phrase]
    F -->|Semantic| F3[Vector - SIMD Dot Product]
    
    subgraph QueryCache ["Query Embedding Cache"]
      F3
    end

    F1 --> G[Reciprocal Rank Fusion]
    F2 --> G
    F3 --> G
    G --> H{Reranking}
    H -->|Basic| H1[Position-Aware]
    H -->|Advanced| H2[LLM - Deep Semantic Pass]
  end

  H1 --> I[Colorized CLI Output]
  H2 --> I

Performance & Scalability

sift is optimized for speed without sacrificing its stateless UX:

Zero-Inference Search: On repeated queries, neural network inference is bypassed entirely by loading pre-computed embeddings and reusing query embeddings from the session cache.
SIMD Acceleration: Vector similarity calculations use hardware-specific SIMD instructions for a ~7x speedup over standard scalar implementations.
Mapped I/O: Document retrieval from the global cache uses mmap to minimize system call overhead and leverage OS-level paging.
Fast Path Heuristics: Filesystem metadata checks happen in microseconds, allowing sift to skip hashing for unchanged files.

Search Examples

The default strategy (champion alias page-index-hybrid) is used automatically:

sift search tests/fixtures/rich-docs "architecture decision"

Force a specific strategy or override components:

# Lexical only (no vectors)
sift search --strategy page-index "service catalog"

# Semantic only (vectors)
sift search --strategy vector "architecture"

# Custom mix
sift search --retrievers bm25,vector --reranking none "query"

Verbose Mode

Trace the pipeline and timings at different levels:

-v: Phase timings (Loading, Retrieval, etc.)
-vv: Detailed retriever timings and cache hit/miss traces.
-vvv: Granular internal scoring data.

Configuration & Customization

CONFIGURATION.md: Guide to sift.toml, available strategies, and environment variables.
EVALUATION.md: How to manage datasets and run quality/latency evaluations.
ARCHITECTURE.md: Deep dive into the hexagonal design and asset pipeline.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.github/workflows		.github/workflows
.justfiles		.justfiles
.keel		.keel
benches		benches
examples/sift-embed		examples/sift-embed
src		src
tests		tests
wix		wix
.envrc		.envrc
.gitignore		.gitignore
.siftignore		.siftignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CONFIGURATION.md		CONFIGURATION.md
CONSTITUTION.md		CONSTITUTION.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
EVALUATION.md		EVALUATION.md
INSTRUCTIONS.md		INSTRUCTIONS.md
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
build.rs		build.rs
debug_tensors.rs		debug_tensors.rs
flake.lock		flake.lock
flake.nix		flake.nix
justfile		justfile
keel.toml		keel.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sift

Current Contract

Installation

Homebrew (macOS and Linux)

One-liner Install (macOS and Linux)

Manual Download

Embedded Library

Runnable Example Consumer

Add The Dependency

Minimal Embedding Example

How Sift Works

Performance & Scalability

Search Examples

Verbose Mode

Configuration & Customization

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sift

Current Contract

Installation

Homebrew (macOS and Linux)

One-liner Install (macOS and Linux)

Manual Download

Embedded Library

Runnable Example Consumer

Add The Dependency

Minimal Embedding Example

How Sift Works

Performance & Scalability

Search Examples

Verbose Mode

Configuration & Customization

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages