Summary
Overview
Work History
Education
Skills
Certification
Work Availability
Timeline
SoftwareEngineer
Mukul Kumar

Mukul Kumar

Pune

Summary

With 19+ years of experience, I am a meticulous and result-oriented data science architect. My track record demonstrates analytical acumen in deploying complex machine learning and statistical modeling algorithms and techniques to identify patterns and extract valuable insights. Throughout my career, I have successfully planned and executed multiple projects, collaborating with key stakeholders to identify and resolve business problem statements. Commitment to delivering excellent results consistently drives my work.

Overview

19
19
years of professional experience
1
1
Certification

Work History

Architect - Data Science

Globant
10.2023 - Current
  • Led the design and deployment of customized Generative AI solutions for EY, utilizing Azure OpenAI, Langchain, Vector DB, and Azure Embedding Models, achieving a dramatic reduction in SLA times from 90 days to 6 hours and significantly improving decision-making processes
  • Directed the architecture and development of data pipelines and API endpoints with Azure DevOps, ensuring robust and scalable AI solutions
  • Led the execution of high-impact MVPs leveraging LLMs, multimodal LLMs, AI agents, RAG, PEFT, and LoRA, advancing cutting-edge AI technologies and fine-tuning models for optimal performance
  • Drove business growth by contributing to proposals for major clients such as HSBC, CBA, and Deloitte, emphasizing the strategic application of Generative AI technologies
  • Optimized resource allocation and talent acquisition, forecasting data science needs and building high-performing teams to deliver results
  • Utilized Azure DevOps for continuous integration and continuous deployment (CI/CD) to streamline the deployment of data science solutions

Lead Machine Learning Engineer

ANZ Bank
03.2014 - 09.2023
  • Collaborated to build custom NER models for dynamic documents that led to a 30% increase in productivity
  • Led a Triage initiative for home loan to implement ML capability to automate documents verification.
  • This Increased operation efficiency ~50% drop in handle time = ~65hrs triage processing time reduction per day
  • Reduced turnaround time from current ~3 days SLA to ~1 day
  • Managed project progress by producing JIRA Dashboard that showcased allocated, tracked and closed work topics
  • Reviewed the progress with Senior Management
  • Partnered with engineering, design and customer teams to keep ML models in sync with business needs by collecting model feedback, retraining and deployment to achieve STP (> 90% accuracy)
  • Implemented 8+ models using cloud (GCP), JupterHub, Jenkins Pipeline, Codefresh
  • Document Classification - Designed ML models to classify similar types of documents into different documents like ITR, Payslip, Bank statements etc using NLP and supervised learning algorithm
  • Entity extraction - Built and deployed model by combining output of Spacy NER model and Random Forest using Sklearn pipeline to extract information from documents
  • Improved model metrics by applying various techniques
  • Signature Extraction and Verification - Created model to extract Signature using OpenCV and verification is done using CNN model
  • Sentiment Analysis - Collaborated to analysed customer interaction and feedback and classified into different sentiments using NLP and supervised learning
  • Topic Modelling - Analysed collection of documents and text files and automatically assign them topic using NLP and Non matrix factorization
  • Key Projects:
  • Knowledge Augmented Questionnaire: Led a team to build an AI-powered feature enabling users to generate customized questionnaires based on framework documents and specific inputs
  • Evidence-Based Assessment: Developed an AI-driven system for processing unstructured data and assisting users with analysis, improving workflow efficiency
  • AI-Assisted Insight & Sentiment Analysis: Delivered a feature enabling natural language querying of structured data to generate AI-powered insights, enhancing reporting capabilities
  • Evidence Document Generation: Created a system that enables users to generate evidence documents from contextual information, streamlining document creation
  • Question Data Sync Pipeline: Built and deployed a data pipeline to synchronize question data from SQL to vector DB, improving data consistency and accessibility

Application Development Lead

Barclays Bank PLC
05.2010 - 03.2014
  • Led a team of 10 to build a new loan origination system to capture the details of the loan application
  • Created and supported core banking application (Hogan), including requirement gathering, application development, writing test cases and testing the output

Sr. Software Developer

Aptara
07.2009 - 05.2010
  • Participated in support and enhancement of Loan Level Accounting application
  • Used SQL extensively to extract and present data to Product Owners for effective decision-making
  • Resolved arguments through data analysis and clear communication

Associate Consultant

Capgemini
12.2005 - 05.2009
  • Guided a team of 5 FTE to migrate workload scheduler from legacy system CA7 to TWS
  • Provided development support to multiple mainframe projects
  • Trained ~ 200 FTEs on TWS and received numerous appreciations for my training skills
  • Received 'Pat on the Back Award' from the client to lead, design, and develop TWS

Education

Master's in ML & AI -

Liverpool John Moore University
12.2022

PG Diploma in ML and AI -

IIIT Bangalore
12.2021

B.E. Mechanical Engineering -

Bhilai Institute of Technology
12.2005

Skills

  • Python
  • Pandas
  • Numpy
  • PyTorch
  • Gen AI
  • LLM
  • AI Agent
  • Prompt Engineering
  • PEFT
  • LangChain
  • RAG
  • Vector DB
  • Agent
  • Scikit-Learn
  • Tensor flow/Keras
  • NLP
  • Supervised ML algorithms
  • Unsupervised ML algorithms
  • Recommendation system
  • OpenCV
  • Computer Vision
  • Deep Learning
  • MLOps
  • Azure DevOps
  • Jenkins pipeline
  • Flask framework
  • Docker
  • Github
  • Jupyter Hub
  • Codefresh
  • Azure Data Factory
  • SQL
  • Teradata
  • Data Investigator
  • VSAM
  • DB2
  • Cloud
  • GCP
  • Vertex AI
  • Azure
  • IBM Mainframe
  • Hogan
  • COBOL
  • JCL
  • Rest API
  • Text Mining
  • Data Mining
  • Analytics
  • Predictive Modelling
  • Statistical Modelling
  • Data Visualization
  • Agile
  • Stakeholder Management

Certification

  • Generative AI with Large Language Model, DeepLearning.AI, 2023
  • Google Cloud Certified - Professional Machine Learning Engineer, 2023
  • Google Cloud Certified - Cloud Digital Leader Certification, 2022
  • Managing Machine Learning Projects with Google Cloud, 2022
  • Data science toolkit course, UpGrad, 2020
  • Leading SAFe Course (4.5), 2018

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Timeline

Architect - Data Science

Globant
10.2023 - Current

Lead Machine Learning Engineer

ANZ Bank
03.2014 - 09.2023

Application Development Lead

Barclays Bank PLC
05.2010 - 03.2014

Sr. Software Developer

Aptara
07.2009 - 05.2010

Associate Consultant

Capgemini
12.2005 - 05.2009

PG Diploma in ML and AI -

IIIT Bangalore

B.E. Mechanical Engineering -

Bhilai Institute of Technology

Master's in ML & AI -

Liverpool John Moore University
Mukul Kumar