Skip to content
View ohsono's full-sized avatar

Highlights

  • Pro

Organizations

@UCLA-Trustworthy-AI-Lab

Block or report ohsono

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ohsono/README.md

Hi, I'm Hochan Son

Datastore, SRE, DevOps Engineer & ML Practitioner based in Los Angeles, CA.

I build data infrastructure, ML pipelines, and distributed systems. My background spans ADtech, entertainment, and enterprise — from MySpace and Hallmark Labs to Branch.io and ADP, with graduate work at UCLA Trustworthy AI Lab.

Areas of Focus

  • Synthetic data generation (using Transformer, VAE, Diffusion models) & privacy-preserving ML
  • Legacy Data Ops to SRE & DevOps to scale in the Cloud Native Infra
  • Large-scale data/ML pipelines (MLFlow, Kafka, LMDB, distributed training)
  • Local LLM inference & serving (CUDA, MLX, RDMA, vLLM, Ollama)
  • Large-scale Database engineering (RDBMS, NoSQL, and Distributed SQL)
  • CI/CD & containerized for ML workflows (Docker, kubernetes, GitHub Actions)

Tech Stack

  • Languages: C, Python, SQL, Go, Bash, Java
  • ML/AI: PyTorch, Diffusion, Variational Autoencoder (VAE), vLLM, MLX, MCP, Agents
  • Data: Kafka, SQLite3, LMDB, PostgreSQL, MySQL, MS SQL Server, ProxySQL, Datadog,
  • Infra: Kubernetes, Docker, HPC (Distributed GPU Training), GCP, AWS
  • CI/CD: GitHub Actions, ArgoCD

Education

  • UCLA — Master of Applied Statistics Data Science (MASDS)
  • University At Buffalo - B.S. Computer Science & Engineering

Publication

ICLR 2026 - DeLTA, accepted (poster)

Connect

LinkedIn GitHub

Pinned Loading

  1. KPA KPA Public

    Kafka-Python-Admin

    Python 1 1

  2. SentimentAnalysis SentimentAnalysis Public

    STATS-418 Final Project: Sentiment Analysis in UCLA

    Python 1

  3. synthcity synthcity Public

    Forked from vanderschaarlab/synthcity

    A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

    Python 1

  4. CI-Pathway-exercise CI-Pathway-exercise Public

    2025 NCSA CI Pathway exercise project

    HTML 1

  5. stats413 stats413 Public

    UCLA MASDS stats413 course repo

    Jupyter Notebook

  6. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python