Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Debabrat Das

Software Engineer
Pune

Summary

Adept Senior Software Engineer with a proven track record at Deutsche Bank Group, enhancing data migration processes using Python, Pyspark, and GCP. Spearheaded the SparkMatica project, achieving seamless transition to Bigquery. Demonstrates exceptional problem-solving skills and a knack for developing innovative solutions in software development and data analytics.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Senior Software Engineer

Deutsche Bank Group
09.2023 - Current
  • Worked on a interesting project called SparkMatica , where we took Informatica mapping xmls as input and generated a Pyspark mapping and then executed them on Cloud Dataproc, which we built for a migration project from Oracle Cloud to Bigquery.
  • Work on project where we built a tool called data sync, where we used airflow and Dataflow Flex templates to migrate data from Oracle Exacc to Bigquery.
  • Worked on a project Jobnet where we replicated Informatica Jobnet structure to trigger jobs in Airflow
  • Worked extensively on Airflow and Pyspark and created numerous tools and utilities for swift data migration.

Senior Software Engineer

IBM
01.2021 - 09.2023
  • Worked on an python app using Flask called sementic search where combined the capabilities of AI embeddings and Solr to give users a seamless search experience
  • Worked on a Loreal's Data Lake creation journey, and extensively worked on Bigquery and gcp worflows.

Software Engineer

Mindtree
07.2017 - 01.2021

Project- P&G Cloud Data Migration-


Extracted Data from various sources such as XML files, Relational databases, Big Query, Segment personas etc.
Used Big query and GCP bucket SDK in Python, Pandas to transform the unstructured data acquired from various sources to a scalable schema and load them as new line delimited json into Big Query as structured data.

Education

B.Tech - Computer Science

University of Technology And Management
04.2001 -

Skills

Python

GCP

Pyspark

Airflow

Bigquery

Kafka

Big Data

Dataflow

Certification

Google Certified Professional Data Engineer

Timeline

Senior Software Engineer

Deutsche Bank Group
09.2023 - Current

Google Certified Professional Data Engineer

04-2022

Senior Software Engineer

IBM
01.2021 - 09.2023

Software Engineer

Mindtree
07.2017 - 01.2021

B.Tech - Computer Science

University of Technology And Management
04.2001 -
Debabrat DasSoftware Engineer