Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Timeline
Generic
Minal Das

Minal Das

Data Engineer
Pune

Summary

Overall 7 years of experience in IT and 4 years of experience as Big Data developer in design and development of Data pipelines, Implementing Data pipeline solution. Hands on Experience in GCP components like Data fusion, Composer, Airflow, Pub/Sub, Dataflow and Big Query.

Overview

7
7
years of professional experience

Work History

Data Engineer

Vodafone
12.2022 - Current

● Designed multiple Datafusion pipelines to improve Business output.

● Pipelines created leveraging composer Airflow DAGs.

● Designed Automation script to enable disable DAGs.

Creating DAGS to load raw data in BQ.

GCP Data Engineer

Accenture
9 2021 - 12.2022
  • Design, build and large scale enterprise data solutions and applications using one or more of GCP data and analytics services in combination with Composer, Dataflow, Dataproc and Big Query
  • Developing DAG in python leveraging Airflow to orchestrate
  • Customized java framework for real-time Ingestion.

AWS Data Engineer

InfoCepts
03.2021 - 09.2021
  • Design, build and large scale enterprise data solutions and applications using one or more of AWS data and analytics services in combination with 3rd parties - Spark, EMR, DynamoDB, EC2, Parameter Store
  • Design and build production data pipelines from ingestion to consumption within big data architecture using Scala programming language
  • Proficient in AWS data processing services (S3, Glue, Athena)
  • Experience in creating and automating ETL pipelines via shell scripts.

Hadoop Developer

Subex
01.2020 - 03.2021
  • Design and develop data pipelines to move data into various layers in big data ecosystem
  • Develop Hive scripts on the data and load to target systems for use by the data analysts for reporting
  • Performance tuning of the Data pipelines to meet the desired outcome
  • Troubleshoot and debug issues independently
  • Understanding, gathering and analyzing the client's problem statement
  • Monitor daily execution, diagnose and log issues, and fix business critical pipelines to ensure SLAs are met with stakeholders
  • Experience with Linux command line, version control software (Git).

Hadoop Developer

Vodafone
09.2017 - 01.2020
  • Develop and implement data pipelines that extracts, transforms and loads data into an information product that helps to inform the organization in reaching strategic goals
  • Data will be stored in Hadoop file system and processed using Spark Scala
  • Involved extensively with Sqoop for importing metadata from RDBMS into the staging Layer
  • Perform analysis of vast data stores and uncover insights.

Education

Bachelor of computer Science -

01.2013 - 2016.04

Skills

  • GCP
  • Composer
  • Airflow
  • DAG
  • Dataflow
  • Big Query
  • HDFS
  • YARN
  • Hive
  • Sqoop
  • Ozzie
  • Pub/Sub
  • SQL
  • Linux/Unix
  • Shell script
  • GitHub
  • Jeera
  • Control-m
  • Data fusion

Accomplishments

  • Received Excellence Award for Outstanding Performance.
  • Received Pat on Back Award for Automation.

Timeline

Data Engineer

Vodafone
12.2022 - Current

AWS Data Engineer

InfoCepts
03.2021 - 09.2021

Hadoop Developer

Subex
01.2020 - 03.2021

Hadoop Developer

Vodafone
09.2017 - 01.2020

Bachelor of computer Science -

01.2013 - 2016.04

GCP Data Engineer

Accenture
9 2021 - 12.2022
Minal DasData Engineer