Summary
Overview
Work History
Education
Skills
Timeline
Generic

Siddhesh Dushi

Pune

Summary

Seeking a dynamic Data Engineer role with 4+ years of experience to contribute expertise in data pipeline development, database management. Committed to driving innovation and fostering a culture of excellence in a progressive organization.

Overview

4
4
years of professional experience

Work History

Senior Software Engineer

Kantar Analytics
02.2025 - 04.2025
  • Developed and optimized complex SQL queries to extract data based on client specific requirements.
  • Loaded data from production tables to non-production environment tables to support development and testing activities.
  • Prepared SQL queries to perform daily volume checks on data ingested into the raw and gold tables.

Data Engineer

IRIS Software
03.2024 - 02.2025
  • Designed ETL pipelines using AWS Glue for processing sales data and transforming raw JSON files into parquet format.
  • Automated data ingestion from AWS S3 and implemented data transforming workflows using PySpark.
  • Managed Redshift Data Warehouse, optimizing schema, distribution keys and queries for performance.
  • Monitored data pipelines using AWS Cloud Watch and AWS SNS for failure alerts.
  • Processed and validated large-scale datasets, ensuring scalability and reliability with AWS services.
  • Partnered with BI teams to create custom queries for actionable insights and data-driven decision-making.

AWS Data Engineer

Unosis IT Solutions
01.2021 - 03.2024

Retail data mart

  • Designed data pipeline using AWS Glue to clean, transform and load raw JSON files from S3 into snowflake via Snowpipe.
  • Performed data cleaning, null handling and aggregations using PySpark, ensuring high quality, analysis-ready for retail insights.
  • Designed Snowflake tables in collaboration with analyst and optimized complex SQL queries for efficient data retrieval and reporting.
  • Maintained clear documentation of PySpark logic, SQL queries and transformation rules to support cross functional collaboration and reproducibility.

API Data Extraction

  • Orchestrated seamless TMDB API data retrieval using Python scripts and Terraform.
  • Developing AWS lambda function to process and store data in S3.
  • Optimizing performance and scalability via server less computing.
  • Documenting integration architecture for streamlined knowledge transfer.
  • Fostering data-driven decision making.

Education

Dr. Babasaheb Ambedkar Marathwada University
2021

Skills

  • Python
  • SQL (MySQL, Oracle)
  • Spark (PySpark)
  • AWS Glue
  • AWS S3
  • AWS Lambda
  • AWS Redshift
  • AWS Athena
  • AWS CloudWatch
  • AWS EventBridge
  • Snowflake
  • JIRA

Timeline

Senior Software Engineer

Kantar Analytics
02.2025 - 04.2025

Data Engineer

IRIS Software
03.2024 - 02.2025

AWS Data Engineer

Unosis IT Solutions
01.2021 - 03.2024

Dr. Babasaheb Ambedkar Marathwada University
Siddhesh Dushi