Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic
Rajesh Sutar

Rajesh Sutar

Senior Data Engineering Lead | Spark · Scala · Snowflake · AWS· Airflow
PUNE

Summary

Senior Data Engineering Lead with 12+ years of experience architecting large-scale data platforms, Spark–Scala pipelines, Snowflake workloads, and cloud-native systems on AWS. Achieved 30% AWS cost optimization and reduced a major pipeline runtime from 2 hours to 8 minutes. Strong expertise in distributed architecture, workflow automation, and leading high-performing data engineering teams.

Overview

12
12
years of professional experience
5
5
Certifications

Work History

Specialist Software Engineer Lead

Nice Interactive Solutions
01.2022 - Current
  • Architected and optimized Spark–Scala and Snowflake pipelines for high throughput and low latency.
  • Improved a major Spark pipeline from 2 hours to 8 minutes using partition pruning, caching, execution tuning, and storage optimization.
  • Achieved 30% AWS cost optimization through EMR tuning, auto-scaling strategies, and resource right-sizing.
  • Implemented Snowflake performance enhancements, including clustering key design, query tuning, and search optimization.
  • Hands-on experience with the AWS ecosystem: ECS for containerized job execution and service orchestration, EMR for large-scale Spark processing and automated cluster lifecycle, Lambda for serverless automation, metadata jobs, and workflow triggers, and a basic understanding of EKS for container orchestration and future platform migrations.
  • Led a 7-member engineering team, driving code quality, reviews, mentoring, and onboarding.
  • Owned release planning, sprint strategy, and R&D roadmap, ensuring predictable delivery, and technical governance.

Consultant – Senior Data Engineer (Team Lead)

Atos
07.2021 - 01.2022
  • Led Oracle → Cassandra migration using Spark–Scala.
  • Designed validation framework ensuring near-zero downtime.
  • Managed executors, cluster memory, and performance tuning.
  • Guided offshore team and code quality reviews.

Senior Software Development Engineer – Big Data

Emtec Technologies
06.2020 - 02.2021
  • Designed AWS-based pipelines and automated EMR lifecycle.
  • Built API Gateway + Lambda workflows.
  • Created Terraform modules for automated provisioning.

Big Data Engineer – Spark/Scala

Cognizant Technology Solutions
07.2019 - 01.2020
  • Developed Spark-based ingestion and analytics pipelines.
  • Prepared deployment documentation and handled hypercare support.

Team Lead – Big Data Developer (Spark/Scala/Java)

Synechron Technologies
05.2016 - 07.2019
  • Led Spark–Scala development team.
  • Developed Kafka + Spark Streaming pipelines.
  • Managed deployments via TeamCity + Git.

Software Engineer

Earlier Roles
09.2013 - 05.2016

Collabera (Veritas) – Associate SQA Engineer — Python automation

BalaSai Net – Technical Support Engineer — Linux server management & hosting infra

Apostle InfoTech – Sr. Java Developer — Struts/Hibernate-based J2EE apps, team lead

Education

MCA - Technology Management

Pune University
Pune
01.2013

B.Sc. IT - undefined

North Maharashtra University
01.2010

Skills

Big Data: Spark, Scala, Python, Hive, Kafka

Certification

Cloudera Certified Data Engineer (CCPDE)

Timeline

Specialist Software Engineer Lead

Nice Interactive Solutions
01.2022 - Current

Consultant – Senior Data Engineer (Team Lead)

Atos
07.2021 - 01.2022

Senior Software Development Engineer – Big Data

Emtec Technologies
06.2020 - 02.2021

Big Data Engineer – Spark/Scala

Cognizant Technology Solutions
07.2019 - 01.2020

Team Lead – Big Data Developer (Spark/Scala/Java)

Synechron Technologies
05.2016 - 07.2019

Software Engineer

Earlier Roles
09.2013 - 05.2016

B.Sc. IT - undefined

North Maharashtra University

MCA - Technology Management

Pune University
Rajesh SutarSenior Data Engineering Lead | Spark · Scala · Snowflake · AWS· Airflow