Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic
Arshil Shaikh

Arshil Shaikh

Cloud Data Engineer
Kalyan

Summary

Results-driven GCP Data Engineer with 5 years of experience designing, developing, and optimizing cloud-based data solutions. Expertise in building scalable ETL pipelines, data migration, and analytics platforms leveraging the full Google Cloud Platform ecosystem, including BigQuery, Dataflow, Dataproc, Cloud Composer, and Looker. Skilled in BigQuery administration, query optimization, and cost management to ensure high-performance and efficient resource utilization. Proven success in migrating complex on-prem and multi-cloud workloads to GCP, enabling faster insights, improved data quality, and business agility within Agile environments. Adept at collaborating with cross-functional teams to deliver actionable insights through data visualization and reporting.

Overview

5
5
years of professional experience

Work History

Senior App Developer - GCP Data engineer

Telus Digital, India
05.2024 - Current
  • Led migration of complex on-prem SAS and SQL Server pipelines to Google Cloud Platform, ensuring seamless transition without business disruption.
  • Extracted and processed large datasets from Oracle and SQL Server using PySpark on Dataproc Serverless, optimizing ingestion and transformation processes.
  • Converted legacy SAS workflows into BigQuery stored procedures, implementing multi-step transformations (CLASSIFY, RGU_SUMMARY, STAGE, FINAL) to deliver accurate segmentation, product indicators, and eligibility flags.
  • Designed and implemented a Business Address Universe platform in BigQuery, replacing legacy systems and improving marketing segmentation accuracy by 20%.
  • Developed and deployed Looker dashboards for executive and campaign reporting, creating calculated fields, dynamic filters, and optimized queries to enhance data visualization and decision-making.
  • Automated campaign pipelines, including Silent Roamers EDM migration, with Cloud Workflows, Cloud Scheduler, and GCS-to-SFMC SFTP integration, reducing campaign execution time by 40%.
  • Implemented Infrastructure as Code with Pulumi and managed deployments via GitHub, ensuring consistent and rapid provisioning of GCP resources.
  • Collaborated with BI analysts, marketing teams, and business stakeholders to validate datasets and ensure 99.9% data accuracy across platforms.

Senior Software Engineer - Data

Persistent System Limited, Pune
06.2022 - 05.2024
  • Led migration of enterprise datasets from on-prem Oracle to BigQuery, leveraging Dataflow with JDBC connectors for efficient high-volume ingestion.
  • Translated legacy Informatica mappings into BigQuery SQL transformations, ensuring seamless data integration and alignment with warehouse standards.
  • Managed BigQuery job administration, including monitoring, resource utilization control, query optimization, and proactive termination of inefficient jobs, reducing cost by X%.
  • Migrated AWS Glue ETL pipelines to GCP Cloud Composer (Airflow), redesigning workflows for improved reliability and reduced execution time.
  • Collaborated with cross-functional teams to validate migrated datasets and maintain high data quality during platform transition.

Engineer I

Datametica Solution Pvt. Ltd., Pune
10.2020 - 06.2022
  • Managed ingestion of large .PSV datasets into BigQuery, ensuring smooth, reliable, and error-free data loading.
  • Developed scalable Dataflow pipelines to extract, transform, and load data from multiple source systems, improving processing efficiency and reducing manual intervention.
  • Orchestrated end-to-end ETL workflows using Python-based Airflow DAGs (Cloud Composer) for automated and seamless pipeline execution.
  • Authored complex BigQuery SQL transformations leveraging advanced joins, window functions, and nested queries to meet diverse business requirements.
  • Optimized query performance and storage usage, reducing overall BigQuery costs while improving SLA compliance.

Education

Bachelor of Engineering (B.E) - Computer Engineering

Mumbai University
10-2020

Skills

☁ Cloud Platforms: Google Cloud Platform (GCP) BigQuery (Development & Administration) Dataproc Dataflow Cloud Composer / Airflow Cloud Workflows Cloud Scheduler

💻 Programming & Scripting: Python PySpark SQL

📊 Data Engineering & Analytics: ETL/ELT Development Data Modeling Data Migration Performance Optimization Looker Dashboard Development

🛠 Infrastructure & Version Control: Pulumi Git Bitbucket

🗄 Databases: Oracle SQL Server

Accomplishments

  • Google Cloud Certified – Professional Data Engineer
  • Google Cloud Certified – Cloud Digital Leader
  • Rising Star Award (x2) - Datametica Solution Pvt. Ltd
  • Sports Award – Datametica Solution Pvt. Ltd
  • Superstar Award – TELUS International
  • Kudos Award – TELUS International

Timeline

Senior App Developer - GCP Data engineer

Telus Digital, India
05.2024 - Current

Senior Software Engineer - Data

Persistent System Limited, Pune
06.2022 - 05.2024

Engineer I

Datametica Solution Pvt. Ltd., Pune
10.2020 - 06.2022

Bachelor of Engineering (B.E) - Computer Engineering

Mumbai University
Arshil ShaikhCloud Data Engineer