Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Kunal Khandagale

Pune

Summary

Accomplished Site Reliability Engineer with over 10 years of experience designing and implementing observability, automation, and cloud infrastructure solutions for global enterprises. Proficient in APM tools (Dynatrace, Datadog, Grafana), cloud platforms (Azure, Kubernetes), and IaC (Terraform, Git). Reduced system downtime by 40% and enhanced deployment efficiency by 50% through automation and proactive monitoring. Adept at incident response, SLO/SLI frameworks, and ITSM processes, ensuring high availability and operational excellence

Overview

10
10
years of professional experience
1
1
Certification

Work History

Principal Infrastructure Architect

Cognizant Technologies Solutions
10.2023 - Current
  • Implemented end-to-end observability using Dynatrace, configuring Real User Monitoring, Synthetic Monitoring, and Database Monitoring, achieving 99.99% uptime for critical healthcare applications.
  • Deployed Azure infrastructure (AKS, Virtual Networks) using Terraform and Git, reducing provisioning time by 50% and ensuring version-controlled deployments.
  • Led P1/P2 incident resolution in war rooms, reducing mean time to resolution (MTTR) by 40% through proactive monitoring and root cause analysis (RCA).
  • Configured least-privilege RBAC in Azure, enhancing security compliance by 30% across cloud services.
    Designed business dashboards in Dynatrace, providing real-time insights into application performance, improving stakeholder decision-making by 25%.
  • Streamlined ITSM processes in ServiceNow, automating incident creation from priority alerts, reducing manual effort by 35%.

Infrastructure Engineer

Maersk Global Service Centre
10.2022 - 10.2023
  • Configured Grafana dashboards with Loki (logs) and Tempo (traces), enabling real-time observability for Java and .NET applications, improving incident detection by 30%.
  • Automated performance data extraction using Python scripts and Datadog APIs, reducing manual analysis time by 40%.
  • Wrote Prometheus and Loki queries to monitor application metrics, ensuring proactive identification of performance bottlenecks.
  • Installed and maintained Grafana, enhancing time-series data visualization for global teams.

Senior System Engineer

Larsen & Toubro Infotech
09.2017 - 10.2022
  • Migration of legacy monitoring tools like MicroFocus SiteScope and BSM to Datadog & Dynatrace.
  • Designed a completed new process of onboarding Applications with the help of Service Now Request forms.
  • Deployed monitoring tools (Dynatrace, Datadog, SolarWinds, AppDynamics) for full-stack observability across Java, .NET applications.
  • Automating of monitoring agents deployment through Ansible during migration thus reducing the manual effort and increase efficiency by 75% to 80%.
  • Created Real Time monitoring dashboards for mission critical applications to avoid any business impact
  • Conducted monthly meeting with different stakeholders to discuss on Application Performance Monitoring and provide feedback on same
  • Designed Powerpoint presentation on importance of Application Performance Monitoring to onboard multiple clients
  • Conducted monthly performance clinics, providing RCA findings and recommendations, enhancing application reliability by 20%.
  • Trained Freshers and Juniors across different locations in India on APM tools and SRE Terminologies

System Administrator

HSBC
07.2016 - 09.2017
  • Established global performance testing environments using Micro Focus SiteScope and LoadRunner, ensuring scalability for applications across regions.
  • Automated server performance reporting (daily, weekly, monthly), reducing manual effort by 50% and improving stakeholder visibility.
  • Configured monitors in SiteScope, integrating with LoadRunner for live application load analysis, reducing performance issue detection time by 30%.
  • Collaborated with vendors to optimize tool performance, contributing to a 15% improvement in system reliability.

Associate Software Engineer

Accenture Technology Solutions
05.2015 - 07.2016
  • Installed and maintained Micro Focus ALM and LoadRunner, enabling performance testing for pharmaceutical applications.
  • Configured RBAC and customized project workflows, improving team efficiency by 20%.
  • Created dashboards and reports for defect analysis, providing actionable insights to development teams
  • Supported ITSM processes using BMC Remedy, ensuring compliance with incident and change management protocols.

Education

Bachelor of Science - Information Technology

SIEC College of Arts, Science & Commerce
Mumbai, India
06-2014

Higher Secondary Certificate - Science

N K Acharya And D K Marathe College, Mumbai
Mumbai, India
06-2011

Skills

  • Dynatrace
  • Datadog
  • SolarWinds
  • Grafana
  • Micro Focus BSM
  • Micro Focus SiteScope

  • Microsoft Azure
  • Amazon Web Service
  • Service Now
  • Terraform
  • GitHub and Git Actions
  • Powershell

Certification

Dynatrace Certified Associate

Datadog APM certified

MicroSoft Azure Administrator

AWS Solution Architect Associate

Micro Focus SiteScope Certified Professional
Micro Focus APM Certified Professional

Timeline

Principal Infrastructure Architect

Cognizant Technologies Solutions
10.2023 - Current

Infrastructure Engineer

Maersk Global Service Centre
10.2022 - 10.2023

Senior System Engineer

Larsen & Toubro Infotech
09.2017 - 10.2022

System Administrator

HSBC
07.2016 - 09.2017

Associate Software Engineer

Accenture Technology Solutions
05.2015 - 07.2016

Bachelor of Science - Information Technology

SIEC College of Arts, Science & Commerce

Higher Secondary Certificate - Science

N K Acharya And D K Marathe College, Mumbai
Kunal Khandagale