Summary
Overview
Work History
Education
Skills
Certification
Timeline
Section name
Generic
Akanksha Tiwari

Akanksha Tiwari

Data Scientist/Python Data Engineer
Pune

Summary

Data Scientist with 4+ years of experience in Python-based data analytics, machine learning, and data engineering. Skilled in Pandas, NumPy, SQL, and data visualization using Matplotlib. Experienced in building ETL pipelines, performing exploratory data analysis, and developing data-driven solutions across logistics, financial, automotive, and healthcare datasets. Proficient with Linux environments, OpenShift deployments, and API-based data services.

Overview

5
5
years of professional experience
2
2
Certifications

Work History

Python Data Engineer

Tata Consultancy Services
01.2021 - Current

Project: PostNord (Logistics and Fleet Analytics Platform)

Role: Data Scientist

  • Designed Python-based data processing pipelines using Pandas and NumPy to analyze truck utilization, route performance, and operational logistics data.
  • Built data cleaning and transformation workflows for large operational datasets to improve accuracy and consistency of logistics analytics.
  • Developed REST APIs and WebSocket-based services for streaming real-time operational data, and enabling live monitoring dashboards.
  • Implemented data analysis and visualization using Matplotlib and Jupyter Notebook to identify utilization trends and operational inefficiencies.
  • Performed exploratory data analysis (EDA) on fleet data to uncover patterns affecting transportation efficiency and delivery delays.
  • Integrated analytics services with OpenShift-based microservice environments to enable scalable data processing pipelines.
  • Developed SQL queries and ETL workflows to extract logistics data from operational databases, and transform it for analytics reporting.
  • Version-controlled data science experiments and pipelines using Git and GitHub.
  • Deployed and monitored analytics services on Linux environments, ensuring performance and reliability.

Project: MCX Trading and Banking (Market Data Analytics)

Role: Data Scientist

  • Developed Python automation and data processing scripts for analyzing financial trading datasets.
  • Built SQL-based analytical queries to extract trading metrics, such as volume trends, transaction patterns, and anomaly detection signals.
  • Performed time-series data analysis using Pandas and NumPy to study market activity and trading behavior.
  • Conducted data visualization and trend analysis using Matplotlib in Jupyter Notebook to support financial reporting.
  • Implemented data transformation pipelines to prepare raw trading data for analytics and reporting workflows.
  • Built backend data services using Python APIs to expose processed analytics data to internal reporting systems.
  • Used Linux-based environments for script execution, debugging, and large dataset processing.
  • Managed deployment pipelines and code versioning using Git and GitHub.

Project: Jaguar Land Rover (Automotive Data Analytics)

Role: Data Scientist

  • Developed Python modules using OOP principles to process automotive telemetry and system diagnostic data.
  • Built data transformation and feature engineering pipelines using Pandas and NumPy for automotive performance datasets.
  • Conducted exploratory data analysis to detect patterns in vehicle performance and system behavior.
  • Developed analytical dashboards and visual reports using Matplotlib within Jupyter Notebook.
  • Integrated data services with backend APIs, and OpenShift infrastructure for scalable data processing.
  • Built SQL queries for extracting and analyzing vehicle data stored in relational databases.
  • Used Linux-based development environments for data pipeline execution and testing.
  • Maintained the project codebase and experiment tracking using Git and GitHub.

Project: GSK Healthcare (Healthcare Data Analytics & Reporting)

Role: Data Scientist

  • Processed and analyzed healthcare claims datasets using Python, Pandas, and SQL to identify reporting metrics and trends.
  • Designed data extraction and ETL pipelines to transform healthcare operational data for analytical reporting.
  • Performed data cleaning and preprocessing to ensure data quality and accuracy for downstream analytics tasks.
  • Built SQL-based reporting queries and dashboards supporting healthcare business intelligence workflows.
  • Conducted statistical analysis and visualization using Matplotlib and Jupyter Notebook to highlight healthcare claim trends.
  • Automated data analysis scripts running in Linux environments for scheduled reporting tasks.
  • Managed source code and collaborative development using Git and GitHub.

Education

B.E. - EEE

RGPV University
Bhopal
01-2021

Skills

  • Programming: Python, SQL
  • Data Science & Analytics: Pandas, NumPy, Matplotlib, Data Cleaning, Exploratory Data Analysis, Feature Engineering, Statistical Analysis
  • Data Engineering: ETL Pipelines, Data Transformation, Data Processing Workflows
  • Backend & Integration:Python APIs, WebSocket, OpenShiftt
  • Tools & Platforms: Jupyter Notebook, Git, GitHub, Linux
  • Cloud: Microsoft Azure Fundamentals (AZ-900)
  • Concepts: Machine Learning Basics, Generative AI, Data Analytics, Agile Methodology

Certification

Microsoft Azure Fundamentals (AZ-900)

Timeline

Python Data Engineer

Tata Consultancy Services
01.2021 - Current

B.E. - EEE

RGPV University

Section name

Ongoing: AI & Machine Learning for Data Science
Akanksha TiwariData Scientist/Python Data Engineer