Skip to content
View sipemu's full-sized avatar

Highlights

  • Pro

Block or report sipemu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sipemu/README.md

Simon Müller

Software Architect | Data Scientist | Ph.D. Mathematics

I design high-performance data systems in Rust and C++ and apply statistical and machine-learning methods to industrial planning problems. My work spans DuckDB extensions, time-series forecasting engines, and GenAI infrastructure (RAG, MCP servers, foundation model inference).

LinkedInCrates.ioDataZooDE


What I Do

  • Time-Series Forecasting -- Hierarchical, probabilistic, and intermittent-demand forecasting for supply chains, built as native DuckDB extensions.
  • Statistical Computing -- Production-grade regression, hypothesis testing, and causal inference in Rust, exposed through DuckDB and Polars.
  • GenAI & RAG Infrastructure -- Vector databases (HNSW/DiskANN), retrieval-augmented generation, and Model Context Protocol (MCP) servers for AI-assisted development.
  • Foundation Model Inference -- Pure-Rust inference engines for time-series models, targeting edge and WASM deployment without Python dependencies.
  • Enterprise Data Integration -- DuckDB extensions for SAP and API ecosystems, bridging legacy ERP systems with modern analytical workflows.
  • Inventory & Supply Chain Optimisation -- Stochastic inventory models and demand planning applications.

Featured Projects

Project Domain Highlight Stack
flapi API / AI Agents DuckDB-powered API gateway with MCP server and VS Code extension C++, DuckDB
erpl Enterprise Integration DuckDB extension bridging SAP systems via RFC C++, DuckDB
anofox-forecast Time-Series Forecasting 2,900x faster than statsmodels; DuckDB community extension C++, Rust, DuckDB
anofox-tabular Data Quality Anomaly detection, validation, and data preparation in DuckDB C++, DuckDB
polars-statistics Statistical Computing High-performance statistical testing and regression for Polars Rust, Python
Magpie GenAI / RAG Vector DB and RAG engine with HNSW, hybrid retrieval, AST-aware chunking Rust
Chronos-2 ML Inference Pure-Rust re-implementation of Amazon's Chronos-2 time-series foundation model Rust, Candle
Inventory Optimisation Supply Chain Stochastic inventory models for demand planning Rust
fdars Functional Data Analysis FDA algorithms -- depth measures, clustering, smoothing, regression Rust, R
dbt-lineage-viewer Data Engineering Fast CLI for visualising dbt model lineage Rust

flapi and erpl are DataZooDE projects.


Tech Stack

Core: Rust C++ Python R

Data & ML: DuckDB Polars Apache Arrow

Infrastructure: Docker AWS GitHub Actions


Pinned Loading

  1. DataZooDE/anofox-statistics DataZooDE/anofox-statistics Public

    A DuckDB extension for statistical regression analysis, providing OLS, Ridge, WLS, and time-series regression capabilities with full diagnostics and inference directly in SQL.

    Rust 7 1

  2. DataZooDE/anofox-forecast DataZooDE/anofox-forecast Public

    Statistical timeseries forecasting in DuckDB

    C++ 25 2

  3. anofox-statistics-rs anofox-statistics-rs Public

    Statistical tests in Rust

    Rust 3

  4. anofox-regression anofox-regression Public

    Regression analysis in Rust.

    Rust 3

  5. anofox-forecast anofox-forecast Public

    Timeseries forecasting in Rust

    Rust 3

  6. fdars fdars Public

    Functional Data Analysis in R and Rust - High-performance FDA algorithms including depth measures, metrics, clustering, smoothing, and regression

    Rust