Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Tejaswi Dubey

Senior Principal Data Science
Mumbai

Summary

Senior leadership professional with proven ability to drive strategic initiatives and foster team collaboration. Adept at navigating complex challenges, consistently delivering impactful results, and adapting to changing dynamics. Strong skills in project management, stakeholder engagement, and organizational development. Known for reliability and focus on achieving measurable outcomes.

Overview

14
14
years of professional experience
5
5
years of post-secondary education
2
2
Certifications

Work History

Senior Principal Data Science

Affine Analytics
Mumbai
04.2025 - Current

Principal Data Scientist

Affine Analytics
10.2023 - 03.2025
  • Engineered an ensemble learning-based forecasting framework for NABARD’s balance sheet, improving financial planning accuracy across macro and microeconomic scenarios.
  • Developed intelligent document retrieval and classification system using Large Language Models (LLMs) and RAG architecture, enabling context-aware search and automated indexing of NABARD’s policy and financial documentation.
  • Designed and deployed a client risk rating model leveraging internal credit behavior and external economic signals to reduce Non-Performing Asset (NPA) exposure and trigger early warning systems (EWS) for high-risk accounts.
  • Built a supervised learning pipeline for forecasting Ground Level Credit (GLC) disbursement trends to support policy planning for NABARD and Government of India (GoI), leveraging time-series models and ensemble predictors.
  • Constructed a predictive model for Net Interest Margin (NIM) estimation, aligning model outputs with risk thresholds set by NABARD’s Risk Management Department (RMD) for regulatory compliance and stress testing.
  • Implemented optimized ML pipelines using parallelized processing and lazy evaluation strategies, leading to significant reductions in training and inference latency across compute clusters.
  • Automated repetitive analytics and data wrangling tasks through Python-based scripting and R-based macros, driving efficiency improvements and cutting manual intervention time by over 40%.

Lead Data Scientist

LEAD School
Mumbai
04.2022 - 05.2023
  • Built and deployed a machine learning-based retention classification model, leveraging ensemble techniques and behavioral analytics to reduce user churn by 10%, driving sustained engagement.
  • Designed a regression-driven remedial performance model that identified at-risk learners and recommended personalized interventions, resulting in a 25% uplift in student performance metrics.
  • Led the automated academic content generation pipeline using advanced Natural Language Processing (NLP) techniques, reducing manual effort by the academic team by over 40%.
  • Managed and mentored a team of five data scientists, driving full-lifecycle implementation of machine learning systems—across data engineering, modeling, deployment, and A/B testing—ensuring both scalability and delivery velocity.
  • Integrated parameter-efficient fine-tuned (PEFT) LLMs, specifically FLAN-T5, to extract insights from unstructured customer feedback at scale, improving customer satisfaction (CSAT) scores by 10% through actionable content and sentiment analysis.Implemented LLM( FLAN T5 PEFT) to analyse customer feedback, leading to a 10% improvement in customer satisfaction scores.

Lead Data Scientist

Bombay Shirt Company
Mumbai
11.2020 - 04.2022
  • Developed and productionized a Personalized Recommendation System using collaborative filtering and deep learning techniques, driving a 20% increase in revenue through enhanced user engagement and product discovery.
  • Engineered a SmartFit Algorithm combining computer vision and customer body metrics for precision apparel sizing, significantly improving customer satisfaction and reducing product return rates.
  • Built and deployed machine learning models for customer segmentation using clustering and dimensionality reduction, leading to a 15% lift in marketing campaign effectiveness through hyper-personalized targeting.

Senior Data Scientist

Medikabazaar (Boston Ivy Healthcare Pvt Ltd)
Mumbai
04.2019 - 11.2020
  • Built VIZI, a regression-driven predictive analytics tool that enabled timely procurement decisions in real estate and pharmaceuticals, resulting in inventory cost savings and reduced expiry-related losses.
  • Developed a Chatbot leveraging the Reformer (efficient Transformer) model, enhancing the customer service experience and reducing turnaround time (TAT) for inquiries and support by automating first-line responses.
  • Engineered and deployed a Product Recommendation Engine using hybrid filtering techniques, which increased product visibility and contributed to a 10% uplift in revenue through improved personalization.

Manager Analytics

Market Realist
New Delhi
07.2015 - 08.2018
  • Utilized Python and machine learning libraries (Pandas, Scikit-learn, XGBoost) for comprehensive data exploration, feature engineering, and model development to support decision-making workflows.
  • Developed a Click-Through Rate (CTR) prediction model leveraging classification techniques to optimize creative performance, leading to a measurable increase in user engagement.
  • Led the design, development, and deployment of new data-driven functionalities, contributing to the continuous improvement of platform intelligence and system capabilities.

Data Analyst

CONNECT COMPUSYS PRIVATE LIMITED
New Delhi
11.2011 - 01.2015
  • Performed data ingestion, cleansing, transformation, and validation to enable robust analytics and drive data-informed decision making across business functions.
  • Managed ETL workflows for client data pipelines, ensuring accurate loading, extraction, and validation of high-volume datasets.
  • Authored complex SQL (DDL/DML) scripts to improve data integrity and quality, and extracted actionable insights using statistical tools such as Excel, R, and Python.
  • Developed and deployed Random Forest-based customer retention models, enabling early identification of churn risk and supporting proactive retention strategies.

Education

Some College (No Degree) - Machine Learning & Deep Learning

GreyAtom
Mumbai
01.2018 - 12.2018

SVITS Indore
Indore
01.2004 - 01.2008

Skills

Machine Learning: Linear Regression, Logistic Regression, Decision Trees, Random Forest, Gradient Boosting, Neural Networks, SVM, Clustering, Anomaly Detection

LLM and Transformer, FAISS, Weaviate, Pinecone, HF, LangChain, LlamaIndex, Haystack

Deep Learning: TensorFlow, Keras, Pytorch

Programming Languages: Python, R, SQL

Data Visualization Tools: Tableau, PowerBI, Matplotlib, Seaborn

Statistical Analysis: Hypothesis Testing, A/B Testing, Time Series Analysis

NLP Techniques: Sentiment Analysis, Topic Modelling, Word Embeddings

Data Wrangling: Pandas, NumPy, Scikit-learn

Version Control: Git, GitHub

Cloud: AWS, GCP

Certification

TensorFlow Developer Certificate

Timeline

Senior Principal Data Science

Affine Analytics
04.2025 - Current

Principal Data Scientist

Affine Analytics
10.2023 - 03.2025

Lead Data Scientist

LEAD School
04.2022 - 05.2023

Lead Data Scientist

Bombay Shirt Company
11.2020 - 04.2022

Senior Data Scientist

Medikabazaar (Boston Ivy Healthcare Pvt Ltd)
04.2019 - 11.2020

Some College (No Degree) - Machine Learning & Deep Learning

GreyAtom
01.2018 - 12.2018

Manager Analytics

Market Realist
07.2015 - 08.2018

Data Analyst

CONNECT COMPUSYS PRIVATE LIMITED
11.2011 - 01.2015

SVITS Indore
01.2004 - 01.2008
Tejaswi DubeySenior Principal Data Science