Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Timeline
Generic
Shyam Khamkar

Shyam Khamkar

Data Science - Credit Risk
Pune,Maharashtra

Summary

Results-driven Data Scientist with 2+ year of experience in delivering end-to-end analytical solutions across BFSI and IT sectors. Skilled in machine learning, model development, data engineering (Python, PySpark, SQL), and BI tools like Power BI. Adept at Agile project execution and cross-functional collaboration, with a strong passion for driving data-driven insights and innovation.

Overview

2
2
years of professional experience
4045
4045
years of post-secondary education

Work History

Unit Manager - Data Science and Analytics

Bajaj Finserv
04.2023 - Current
  • 1. Marginal Customer Segmentation for Risk Mitigation

    Technologies: Databricks SQL, Python, PySpark, Decision Trees, WOE/IV, Optimal Binning

    Designed and implemented a data-driven customer segmentation framework to identify and flag high-risk (marginal) segments in ongoing sourcing for unsecured lending products (e.g., Personal Loans)
    Evaluated current portfolio KPIs against monthly benchmarks to define "bad" behavior aligned with business policies
    Created a sampling framework using dependent KPI variables and over 17,000 TTD variables from OFFUS and ONUS feature marts
    Applied advanced feature reduction techniques (IV, WOE, Optimal Binning, Variable Clustering) to select top 300–400 variables for model input
    Built and validated a decision tree model on out-of-time datasets to isolate segments with 1.5x higher bad rates than monthly benchmarks
    Proposed high-risk segments for sourcing cutbacks, resulting in 2–5% monthly swap-out savings and improved portfolio health
    Contributed to a ₹15 Cr cost savings in Personal Loan segment through proactive identification of marginal customer cohorts

2. Acquisition Risk Scoring Model Development


Technologies: Python, XGBoost, Boruta, SHAP, SQL, ModelOps


Developed a machine learning–based risk scoring model for customer acquisition using XGBoost, aimed at improving early risk assessment during sourcing
Performed feature selection using Boruta algorithm to identify key predictive variables from a high-dimensional dataset
Conducted hyperparameter tuning to optimize model performance and prevent overfitting
Translated model output probabilities into a risk scorecard, ensuring interpretability and ease of integration with business rules
Evaluated model using statistical metrics including KS, AUC, Gini, and monitored stability using Population Stability Index (PSI)
Employed SHAP values to explain feature contributions and support model transparency for stakeholders and compliance teams
Successfully integrated the model into existing acquisition workflows and decision systems, improving risk discrimination at point-of-sale

Education

Post Graduate Diploma - Advance Computing, Big Data Analytics

CDAC
Pune
03.2023

Bachelor Of Technology - Chemical Engineering

Institute Of Chemical Technology
Mumbai
08.2021

Skills

  • Predictive Modeling: Logistic/Linear Regression, Decision Trees, Clustering, PCA, Random Forest, Gradient Boosting (XGBoost, LightGBM)

  • Statistical Analysis: Hypothesis testing, Probability, Time series, Model validation

  • Tools & Programming: Python (pandas, scikit-learn, NumPy), R, Sql, Pyspark, Power Bi, Big Data Technologies etc

  • Cloud : MS Azure, AWS, Databricks

  • Model Governance: AUC, KS, Gini, Precision/Recall, Scorecard development & monitoring

  • Credit Risk Modeling: Scorecards (application, behavior)

  • BFSI Analytics: Retail & commercial credit, customer & sales analytics

Accomplishments

  • Recipient of the Debutant Award for exceptional performance, demonstrating rapid proficiency and outstanding contributions in my initial year.

Timeline

Unit Manager - Data Science and Analytics

Bajaj Finserv
04.2023 - Current

Post Graduate Diploma - Advance Computing, Big Data Analytics

CDAC

Bachelor Of Technology - Chemical Engineering

Institute Of Chemical Technology
Shyam KhamkarData Science - Credit Risk