Skip to content
View nitesht2's full-sized avatar

Highlights

  • Pro

Block or report nitesht2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nitesht2/README.md

Hi, I'm Nitesh Thapa 👋

Data and Analytics Engineer | Data Engineer | Building production data pipelines and AI-powered analytics

At Living Spaces, I run 30+ automated workflows, build BigQuery data models, and integrate cross-platform analytics (Adobe Analytics, Placer.ai) into leadership-level insights across 40+ retail locations.

Open to Analytics Engineer, Senior Data Analyst, Data Engineer, and AI/Automation Engineer roles.

LinkedIn Twitter Email


🛠️ Tech Stack

Domain Tools
Data Engineering BigQuery, dbt, Airflow, Airbyte, Cloud Data Fusion, PostgreSQL, Alteryx
Cloud GCP (BigQuery, GCS, Cloud Data Fusion, Cloud Composer)
Languages Python, SQL, JavaScript/React
BI & Analytics Adobe Analytics, Placer.ai, Tableau
AI / LLM Claude API, OpenAI API, Gemini, Ollama, DeepSeek, LangChain, CrewAI
Automation Playwright, Scrapling, FFmpeg, MoviePy

Python SQL BigQuery dbt Airflow GCP Anthropic


📊 Featured Projects

Project What It Does Stack
gcp-secure-employee-pipeline Production GCP pipeline with PII protection (salary masking, SHA-256 hashing), daily Airflow orchestration, and Tableau dashboards Python, GCS, Cloud Data Fusion, BigQuery, Airflow, Tableau
crypto-etl-pipeline End-to-end ETL extracting live crypto data from CoinGecko, transforming with pandas, loading into PostgreSQL with Airflow scheduling Python, PostgreSQL, Airflow, pandas
bigstar-data-pipeline End-to-end ELT pipeline: Postgres → Airbyte → BigQuery → dbt with full data modeling layer Postgres, Airbyte, BigQuery, dbt
second-brain-ai Always-on AI agent that reads articles, videos, and tweets into a self-organizing Obsidian wiki you query in plain English — runs 24/7 on a VPS for ~$8–15/mo Python, Hermes Agent, DeepSeek, Obsidian
marketing-intelligence-agent Web-based AI analyst answering natural-language questions about campaigns, stores, and demographics — grounded in data, never invents numbers Node.js, Express, Claude API
yt-researcher Python CLI that fetches YouTube transcripts as clean Markdown/JSON. Zero API keys, one dependency Python, argparse

Also built: baagchal — a browser PWA of the traditional Nepali strategy board game (play it live).


💼 What I Do at Work

  • Analytics Engineering — build and maintain BigQuery data models and Alteryx workflows powering cross-functional dashboards
  • Data Analysis — surface insights from Adobe Analytics, Placer.ai foot-traffic data, and campaign spend for leadership decisions
  • Pipeline Automation — manage 30+ automated data pipelines processing daily feeds across retail operations
  • AI Proof-of-Concept — building a Claude-powered analytics tool that surfaces declining-market insights for leadership

🚀 Also Building

Beyond open-source, I run several private projects: an algorithmic trading bot (EMA-cloud strategy, paper trading via Alpaca), a multi-agent AI orchestration system (research + content + analysis agents), and AI content pipelines that have published 25+ automated videos. Happy to walk through any of them.

Pinned Loading

  1. baagchal baagchal Public

    Browser-based PWA of the traditional Nepali strategy board game (4 Tigers vs 20 Goats). Play online!

    HTML

  2. bigstar-data-pipeline bigstar-data-pipeline Public

    End-to-end ELT pipeline: Postgres → Airbyte → BigQuery → dbt

  3. crypto-etl-pipeline crypto-etl-pipeline Public

    An end-to-end ETL pipeline to extract, transform, and load cryptocurrency data using Python, PostgreSQL, and Airflow.

    Python

  4. gcp-secure-employee-pipeline gcp-secure-employee-pipeline Public

    Fully automated GCP pipeline: Python → GCS → Cloud Data Fusion (salary masking + SHA-256 password hashing) → BigQuery → Tableau | Daily Airflow orchestration

    Python

  5. marketing-intelligence-agent marketing-intelligence-agent Public

    AI-powered marketing analytics agent with real-time campaign insights and competitive intelligence

    JavaScript

  6. yt-researcher yt-researcher Public

    Python CLI tool that fetches YouTube video transcripts and outputs them as clean Markdown and JSON files. Zero API keys. One dependency.

    Python