Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

OJAS BADWAIK

Pune

Summary

A certified Data Architect and Engineer with over 12 years of experience, including six years building and modernizing cloud-native, scalable data platforms and pipelines. I have designed and delivered robust ETL/ELT workflows, real-time streaming systems, and large-scale data lakes on AWS, Azure, Databricks, and Snowflake—leveraging tools like Glue, Redshift, Kafka, Spark, and Airflow. I’ve led end-to-end cloud migration and modernization projects using Infrastructure-as-Code (Terraform, CloudFormation) and CI/CD pipelines (GitHub Actions, Jenkins, AWS CodePipeline), ensuring reproducibility, security, and cost efficiency. My expertise spans relational and NoSQL databases (Oracle, PostgreSQL, MySQL, DynamoDB) with automated validation and optimized OLTP/OLAP data models. I recently integrated Generative AI and vector databases into data pipelines to enrich data and drive AI-powered insights, using RAG techniques and industry-leading cloud-native monitoring and security tools. Known for aligning technical roadmaps with business goals, I mentor teams, lead design reviews, and take full ownership from solution design through deployment to deliver resilient, future-proof systems.

Overview

13
13
years of professional experience
1
1
Certification

Work History

Key Developer

EPAM Systems
04.2024 - Current
  • Client: Novartis Institutes for Biomedical Research, Inc
  • Client Domain: Life Sciences & Healthcare
  • Project Description: Data42 is a major digital transformation project by Novartis aimed at unlocking the power of data to improve how medicines are discovered and delivered.
  • Responsibilities:
  • Built scalable ingestion pipelines for clinical, genomic, RWD and EHR data
  • Integrated datasets from trial-management systems and public-health sources
  • Developed ETL/ELT workflows in Apache Spark, Palantir Foundry, Azure Databricks and Synapse Analytics, reducing analytics setup time from 14 days to 1 day
  • Architected data lake zones in Azure Data Lake Storage Gen2 (raw → enriched → curated) and automated ETL/ELT using Synapse pipelines and Databricks.
  • Automated data transforms and validations for downstream ML/analytics
  • Implemented Great Expectations checks for schema, deduplication & anomalies
  • Managed fine-grained RBAC, lineage and ontology in Foundry Code Workbook
  • Optimized Spark jobs with partitioning, caching, AQE and cluster tuning
  • Partnered with data scientists & clinicians to deliver analysis-ready datasets
  • Enforced HIPAA/GDPR/GxP compliance with auditing, masking & encryption, enterprise-grade security with Azure AD, Key Vault, RBAC, and Azure Policy.
  • Enforced data governance with Microsoft Purview and Entra ID RBAC, ensuring end-to-end lineage, data cataloging


Associate Specialist

Synechron Technologies Pvt. LTD
07.2019 - 12.2023
  • Client: Asurion
  • Client Domain: Insurance
  • Project Description:
  • Migrating Legacy Big Data Application from On Premises to AWS Cloud
  • Creating a new platform for Data Quality Check
  • Responsibilities:
  • Migrated legacy on-prem Tibco ETL jobs to an open-source framework on AWS, enhancing scalability and reducing costs
  • Led a team of four, coordinating with business stakeholders to gather requirements and translate them into actionable Big Data solutions
  • Documented requirements and delegated tasks, ensuring clear ownership and timely delivery
  • Managed Agile ceremonies, including daily scrums, sprint planning, and sprint reviews
  • Designed and implemented data pipelines using AWS Glue, Spark (Scala), PySpark, and Pandas to ingest and process flat files
  • Developed CI/CD pipelines in AWS (CodePipeline, CodeBuild) for automated, reliable deployments
  • Executed database migrations to Amazon Aurora and RDS, performing impact analysis and defining migration strategies
  • Architected cloud-native data solutions and data models for new AWS frameworks, selecting appropriate technologies (Redshift, S3, EMR) and delivering proofs-of-concept
  • Established AWS best practices across the Asurion Europe organization, including security, cost management, and governance
  • Collaborated with release and change management teams to streamline production deployments and ensure compliance
  • Proposed and presented cloud migration strategies and technical solutions to stakeholders, fostering trust and alignment
  • Ensured data quality and reliability through rigorous validation, monitoring, and alerting
  • Championed documentation standards and knowledge sharing, improving team efficiency and onboarding
  • Maintained compliance with GDPR and industry regulations by implementing robust security controls and access policies

Senior Consultant

Tibco Technologies Pvt. Ltd
08.2016 - 07.2019
  • Client: Customer: Nielsen
  • Domain: Marketing Analytics
  • Project Description: Creating application for Collecting the Sales Data for different products and clients
  • Responsibilities:
  • Involved In project planning and designing of application
  • Understanding the business needs and functional enhancements
  • Involved in making documents like Design Document (LLD, HLD), Test scenario document
  • Interacted with onshore counterparts for discussions related to Requirement Analysis, Design, Testing
  • Designed and developed various services using TIBCO products
  • Involved in Unit and Integration testing, bug fixing and testing
  • Deployment and Maintenance of BW processes in administrator
  • Production Support of the Application Developed
  • Involved in Making the Country set up automated for Future country rollouts
  • Tasks Performed: (Management)
  • Worked in the Production Support Team to successfully make App live in the Country like China, Taiwan, HongKong into the Production right from the Initial Phase from QA
  • Guided and successfully leaded the team for the Country Enablement

Developer

Wipro Technologies
11.2012 - 08.2016
  • Client: Customer: T-Mobile
  • Client Domain: Telecommunication
  • Project Description: Integrate different services on T-Mobile Platform
  • Responsibilities:
  • Involved In project planning and designing of application
  • Understanding the business needs and functional enhancements
  • Involved in making documents like Design Document (LLD, HLD), Test scenario document
  • Interacted with onshore counterparts for discussions related to Requirement Analysis, Design, Testing
  • Designed and developed various services using TIBCO products
  • Involved in Unit and Integration testing, bug fixing and testing
  • Deployment and Maintenance of BW processes in administrator
  • Production Support of the Application Developed

Education

Bachelor of Engineering - Electronics

Rashtrasant Tukadoji Maharaj Nagpur University
Nagpur, India
01.2008

Skills

  • Big Data Technologies: HDFS, YARN, MapReduce, Apache Spark, Pyspark, Apache Airflow, Apache Kafka
  • AI Technologies: Langchain, OpenAI, Copilot, Huggingface, LangGraph
  • Orchrestration: AWS Datapipeline, Airflow
  • Cloud Technology: AWS, Azure
  • Data Platform: Databricks, Palantir
  • Database: Oracle, MySQL, Postgres, ComosDB, DynamoDB
  • Vector Database: ChromaDB, Pinecone, Oracle 23ai
  • Programming Language: Python
  • SCM Tools: SVN, Git
  • Data Modeling Tools: ERWin, Oracle SQL Developer Data Modeler
  • DevOps Tools: Jenkins, CodePipeline & Github Actions
  • Integration Tool: Tibco
  • Data Warehouse: Redshift, Snowflake, Azure Synapse

Certification

  • Amazon Web Services: AWS Certified AI Practitioner, 2025
  • Amazon Web Services: AWS Certified Data Engineer - Associate, 2025
  • Oracle: Oracle AI Vector Search Certified Professional, 2025
  • Microsoft: Microsoft Certified: Azure AI Engineer Associate (AI-102), 2025
  • Microsoft: Microsoft Certified: Azure Data Engineer Associate (DP-203), 2025
  • Databricks: Databricks Certified Generative AI Engineer Associate, 2025
  • Microsoft: Microsoft Certified: Azure AI Fundamentals (AI-900), 2025
  • Microsoft: Microsoft Certified: Azure Data Fundamentals (DP-900), 2025
  • Microsoft: Microsoft Certified: Azure Fundamentals (AZ-900), 2024
  • Snowflake: SnowPro Core Certification, 2024
  • Databricks: Databricks Certified Data Engineer Professional, 2024
  • EPAM: AI Literacy Program, 2024
  • Databricks: Databricks Certified Data Engineer Associate, 2024
  • Amazon Web Services: AWS Certified Solutions Architect - Professional, 2024
  • Amazon Web Services: AWS Certified DevOps Engineer - Professional, 2024
  • Amazon Web Services: AWS Certified SysOps Administrator - Associate, 2023
  • Amazon Web Services: AWS Certified Developer - Associate, 2023
  • Amazon Web Services: AWS Certified Solutions Architect - Associate, 2023
  • Amazon Web Services: AWS Certified Cloud Practitioner, 2023
  • Databricks: Databricks Certified Associate Developer for Apache Spark 3.0, 2023

Timeline

Key Developer

EPAM Systems
04.2024 - Current

Associate Specialist

Synechron Technologies Pvt. LTD
07.2019 - 12.2023

Senior Consultant

Tibco Technologies Pvt. Ltd
08.2016 - 07.2019

Developer

Wipro Technologies
11.2012 - 08.2016

Bachelor of Engineering - Electronics

Rashtrasant Tukadoji Maharaj Nagpur University
OJAS BADWAIK