Data Engineer and Big Data Architect with 7+ years of experience designing and delivering scalable, high-performance data pipelines across AWS, GCP, and Azure. Specialized in real-time stream processing, data lake architecture, and cloud-native data engineering using tools like AWS Glue, EMR, Redshift, Kinesis, Kafka, PySpark, and Lake Formation. Proven success in re-architecting legacy systems, optimizing performance, and enabling secure, reliable data insights across industries including Finance, Hospitality, Retail, SaaS, and Energy.
AWS Glue
EMR
Lambda
S3
Redshift
Kinesis
Apache Spark with Scala
PySpark
Kafka
Hive
Tez
NiFi
Shell Scripting
PostgreSQL
MySQL
undefined