Skip to content
View abdalla92's full-sized avatar

Block or report abdalla92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abdalla92/README.md

Hi there πŸ‘‹

Welcome to my GitHub profile! I'm Abdullahi Osman, a passionate Data Engineer and Cloud Technologies Enthusiast dedicated to building scalable, efficient data solutions that drive business value.


πŸš€ About Me

  • πŸ“ Location: Toronto, Canada
  • πŸ› οΈ Profession: Data Engineer & Cloud Enthusiast
  • πŸ—οΈ Expertise: Designing and implementing robust ETL pipelines, data warehouses, and analytics platforms
  • πŸ’» Tech Stack: Python β€’ SQL β€’ Apache Spark β€’ Airflow β€’ Snowflake β€’ Oracle β€’ MongoDB β€’ Cassandra
  • πŸ€– Interests: Machine learning and AI integrations in data pipelines
  • 🌱 Philosophy: Active open-source collaborator and continuous learner

πŸ› οΈ Technical Skillset

Programming & Languages

  • Python | SQL | PL/SQL | Shell Scripting | PySpark

Data Engineering & Platforms

  • ETL/ELT | Data Warehousing | Snowflake | Apache Spark | Apache Airflow | Kafka

Databases & Storage

  • Relational: Oracle, SQL Server, Snowflake, PostgreSQL, MySQL
  • NoSQL: MongoDB, Cassandra
  • Cloud Storage: AWS S3, Azure Blob Storage

Cloud & Infrastructure

  • Platforms: Azure | AWS | Snowflake | Databricks | IBM Cloud | OCI
  • Tools: Docker | Git | Terraform | CI/CD | Ansible

Analytics & Visualization

  • Power BI | Looker Studio | Cognos Analytics | Tableau

πŸ“ˆ GitHub Statistics

Metric Count
πŸ“¦ Repositories 29+
🌟 Starred Projects 3+
πŸ‘¨β€πŸ’» Open Source Contributions Active
πŸ”¬ Technologies Explored 15+

πŸ“š Featured Projects

1. Snowflake Data Pipeline

Building scalable end-to-end data pipelines with Snowflake

  • ✨ JSON ingestion from S3 via Snowpipe
  • πŸ”„ CDC streams + incremental MERGE operations
  • 🌍 Geolocation enrichment with timezone conversions
  • Tech: Snowflake | S3 | CDC Streams | SQL

2. Machine Learning with Apache Spark

Applying ML models at scale using Apache Spark

  • πŸ€– Classification and regression pipelines
  • πŸ“Š Feature engineering and optimization
  • Tech: PySpark | MLlib | Python

3. BI Dashboards

Visual analytics on car sales data

  • πŸ“ˆ Interactive dashboards for business insights
  • 🎯 KPI tracking and trend analysis
  • Tech: Cognos Analytics | Looker Studio | SQL

4. Streaming ETL Pipeline with Kafka

Real-time data streaming and processing

  • ⚑ Event-driven architecture
  • πŸ”„ Kafka producers/consumers
  • πŸ’Ύ Incremental data loading
  • Tech: Apache Kafka | Python | SQL

πŸ“« Get in Touch

Let's collaborate on impactful data solutions! Feel free to reach out:

Popular repositories Loading

  1. Abdalla Abdalla Public

  2. Creative-programing--PeerAssessment--1- Creative-programing--PeerAssessment--1- Public

    Creative programming of Digital Media and Mobile Apps. (Peer Assessment-1)

    1

  3. E-commerce-Lakehouse-on-Microsoft-Fabric E-commerce-Lakehouse-on-Microsoft-Fabric Public

    Forked from Anshul3773/E-commerce-Lakehouse-on-Microsoft-Fabric

    An end-to-end data Lakehouse solution using Microsoft Fabric, implementing Medallion architecture for an e-commerce dataset. The project involved data ingestion, dimensional modeling, and generatin…

    Jupyter Notebook

  4. Machine-Learning-Project-Handwritten-Digit-Classification Machine-Learning-Project-Handwritten-Digit-Classification Public

    This project successfully demonstrates the application of various machine learning models to the task of handwritten digit classification

    Jupyter Notebook

  5. Python-Project-for-Data-Engineering Python-Project-for-Data-Engineering Public

    Python Project for Data Engineering: Acquiring and processing information on world's largest banks

    Python

  6. RDBMS-Database-Design-and-Implementation RDBMS-Database-Design-and-Implementation Public

    This project aims to design relational database systems for improved operational efficiencies and to make it easier for executives to make data-driven decisions.