Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Nikhil Pasupuleti

Big Data Engineer
Pune

Summary

Accomplished Data Engineer with expertise in Databricks and Snowflake, driving scalable solutions at TCS. Developed metadata-driven products and robust pipelines, enhancing data integrity and storage. Proficient in PySpark, SQL, and Python. Excel in problem-solving and collaboration, leveraging Azure and AWS for impactful data orchestration and management across platforms.

Overview

1
1
Certification
2
2
years of professional experience

Work History

Databricks Data Engineer

TCS – GSK
01.2026 - Current
  • Developed a metadata data product in Databricks using a modular Python architecture, enabling scalable processing across Bronze, Silver, and Gold layers.
  • Built reusable PySpark components to support both SCD Type 2 and non-SCD data loading patterns, reducing code duplication, and improving maintainability.
  • Developed robust data transformation pipelines using Databricks, handling create, update, delete, and rename scenarios through SCD Type 2 historical tracking.
  • Created comprehensive unit tests using pytest to validate business logic, transformation rules, and metadata processing workflows, improving code quality and reliability.
  • Integrated Databricks workloads with Azure Data Factory (ADF) pipelines for automated end-to-end orchestration, scheduling, and monitoring of data processing jobs.

Snowflake Data Engineer

TCS – McDonald's Japan
06.2025 - 10.2025
  • Implemented Snowflake stored procedures to dynamically load files from AWS S3 and Azure ADLS based on metadata-driven configurations stored in configuration tables.
  • Built error handling, audit logging, and email notifications for load failures.
  • Used Snowpark (Python) and DataComPy to perform automated data validation between staged files and tables.

Databricks Data Engineer

TCS - Mcdonald's Japan
01.2025 - 05.2025
  • Developed and maintained full and incremental data export pipelines from Databricks to Google Cloud Storage (GCS).
  • Implemented monitoring and validation mechanisms to detect export failures and data discrepancies.

Databricks Developer

TCS - NTT Data
01.2024 - 12.2024
  • Worked on migrating SAS-based data pipelines to Databricks for improved performance and maintainability.
  • Developed and optimized PySpark scripts for data transformation and ETL workloads.
  • Fixed bugs in the existing codebase.

Education

B.tech - CSE

JNTUH
07.2023

Skills

Proficient in Databricks and Snowflake

Experience with Azure Data Factory (ADF)

Proficient in PySpark, SQL, Python, and Delta Lake

Data build tool(DBT) expertise

Certification

Databricks Data Engineer associate

Timeline

Databricks Data Engineer

TCS – GSK
01.2026 - Current

Snowflake Data Engineer

TCS – McDonald's Japan
06.2025 - 10.2025

Databricks Data Engineer

TCS - Mcdonald's Japan
01.2025 - 05.2025

Databricks Developer

TCS - NTT Data
01.2024 - 12.2024

B.tech - CSE

JNTUH
Nikhil PasupuletiBig Data Engineer