Adept Senior Software Engineer with a proven track record at Deutsche Bank Group, enhancing data migration processes using Python, Pyspark, and GCP. Spearheaded the SparkMatica project, achieving seamless transition to Bigquery. Demonstrates exceptional problem-solving skills and a knack for developing innovative solutions in software development and data analytics.
Project- P&G Cloud Data Migration-
Extracted Data from various sources such as XML files, Relational databases, Big Query, Segment personas etc.
Used Big query and GCP bucket SDK in Python, Pandas to transform the unstructured data acquired from various sources to a scalable schema and load them as new line delimited json into Big Query as structured data.
Python
GCP
Pyspark
Airflow
Bigquery
Kafka
Big Data
Dataflow
Google Certified Professional Data Engineer
Google Certified Professional Data Engineer