Skip to content
View Ganebang's full-sized avatar
  • aivancity
  • France
  • 14:51 (UTC +02:00)

Block or report Ganebang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ganebang/README.md

Hi 👋 I'm Amala,

I’m a Data Engineer focused on designing and building robust, scalable data pipelines. I have a strong interest in cloud infrastructure, distributed systems, and the architectures that power machine learning workloads. I’m currently completing a Master’s degree in Data Engineering & Cloud Computing at aivancity Paris – School for Technology, Business & Society. My journey allows me to combine solid computer science fundamentals with hands‑on experience through real-world data engineering projects and training sessions.

🚀 About Me

  • 🔭 I’m currently working on: End‑to‑end data pipelines, ETL/ELT optimization, and cloud‑native architectures.
  • 🌱 I’m currently learning: Advanced orchestration, distributed compute optimization, and ML‑ready data architectures.
  • 👯 I’m looking to collaborate on: Open‑source data engineering tools and scalable data platform components.
  • 💬 Ask me about: Data engineering, cloud architecture, PySpark, Databricks, Airflow, and scalable systems.
  • 📫 How to reach me: ganekwada@gmail.com
  • 😄 Pronouns: He/Him
  • Fun fact: I enjoy transforming messy datasets into elegant, automated pipelines.

🛠️ Tech Stack & Tools

Data Engineering

Python PySpark Scala SQL

Cloud & Big Data

Databricks Azure

Orchestration

Airflow

Containers & DevOps

Docker Terraform Git GitHub

Data Modeling & Transformation

dbt

Pinned Loading

  1. Crypto-pipeline Crypto-pipeline Public

    Python

  2. fabric-wind-power-analytics fabric-wind-power-analytics Public

    Git repository for the wind power analytics project workspace on Microsoft Fabric

    Python

  3. Green-AI Green-AI Public

    Green AI : Sustainable Medical Summarization

    Jupyter Notebook

  4. realtime-api-kafka-spark-dashboard-Amala realtime-api-kafka-spark-dashboard-Amala Public

    This project aims to make you design and implement a complete real-time data processing platform based on a stream coming from an HTTP API

    Python