Results-driven Data Engineer with 3+ years of experience in designing and optimizing data pipelines, ETL workflows, and data models across cloud and on-prem environments. Adept in SQL, Python, PySpark, and Azure ecosystem, with hands-on exposure to Airflow, Kafka, and NoSQL systems. Proven track record in delivering scalable data solutions, improving system performance, and supporting business intelligence initiatives. Known for taking ownership, collaborating with stakeholders, and building reusable, high-quality data assets that align with enterprise analytics goals.
Python
SQL
PySpark
Pandas
MySQL
Git
Azure Data Factory
Azure Databricks
Kafka
Airflow
Power BI
Excel
Tableau
Cognos
ETL Pipelines
Dimensional Modeling
Data Warehousing
Data Virtualization
MongoDB
Cassandra
Docker
Machine Learning Basics
Cohort Analysis