Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic
Atul Kumar Shukla

Atul Kumar Shukla

Manager- R&D (ML &MLOps)
Hyderabad

Summary

Data Science and AI professional with a 9 year background in data science processes, covering machine learning algorithms, statistical analysis, predictive analytics, and MLOps. Demonstrates proficiency in Python, R, MySQL, and PySpark, showcasing a track record of skillfully applying these capabilities to foster business growth. Adept in steering cross-functional teams and delivering data-driven insights, offering a comprehensive understanding of data science principles to provide valuable perspectives and solutions.

Overview

9
9
years of professional experience
4
4
Certifications

Work History

Manager-R&D(ML&MLOps)

PepsiCo
06.2024 - Current
  • Deployed ML based applications across sectors and to increase operational efficiency.
  • Streamlined workflows by identifying bottlenecks in existing systems and implementing appropriate solutions.
  • Reduced operational costs through comprehensive process improvement initiatives and resource management..
  • Led sessions based on AI/ML to different sector audience.
  • Led cross-functional teams to achieve project goals, fostering collaboration and innovation.
  • Leveraged data and analytics to make informed decisions and drive business improvements.

Sr. Software Engineer

47 Billion
03.2020 - 03.2023
  • Engineered advanced deep learning model capable of utilizing visual data to identify tables and cells within scanned documents. Implemented customized post-processing technique to structure rows and columns within these tables, with specific emphasis on mortgage and electronic health record (EHR) documents.
  • Deployed and constructed model intended to identify document layouts through image analysis, specifically focusing on elements like tables, forms, titles, headers, and footers within documents associated with mortgages and used Fast API for deployment.
  • Developed specialized Named Entity Recognition (NER) model utilizing Hugging Face's Longformer architecture to extract entities such as borrower, co-borrower, trustee, vesting, and legal description date from documents like Deed and Deed of Trust, specifically within context of mortgage-related documents
  • Developed personalized image processing method utilizing image alignment, Optical Character Recognition (OCR), and OpenCV to retrieve data from US appraisal forms.
  • Developed distinctive machine learning algorithm to classify healthcare documents based on presence of handwritten text. This process included preprocessing images using OpenCV, extracting features such as area, orientation, extent, and solidity using python's scipy library, and utilizing random forest model for classification task.

Data Scientist

Canopus Data Insights Pvt. Ltd.
05.2016 - 02.2020
  • LAP tool was developed using R and R-shiny to analyze and predict errors using test-bed logs within vehicle engine manufacturing facility. This tool utilized time series modeling to forecast parameter values from device logs. With predicted parameters, tool classified whether log values indicated occurrence of an error or not using machine learning algorithm.
  • The goal was to retrieve sales data from Shopify-registered vendors using comma-feed and Firebase API, utilizing Python for data collection and then identify top-selling products based on current sales trends. Exponential Weighted Moving Average was applied to assess popularity of products using their inventory quantities.
  • RNN-based translation system using TensorFlow is developed to convert documents from English to Spanish, while .NET programming language is utilized to incorporate text into template. This collaboration involves cross-functional team members working for U.S. Insurance company
  • Formulated case study to analyze customer churn within telecommunications sector and established dashboard using R-Shiny for visualization.
  • Designed plotly-based dashboard to conduct Net Promoter Score (NPS) analysis on survey data for UK-based heavy machinery tools manufacturing firm.

Education

Master of Science - Statistics

Devi Ahilya Vishwavidyalaya
Indore, India
04.2001 -

Bachelor of Science - Information Technology

Devi Ahilya Vishwavidyalaya
Indore
04.2001 -

Skills

    Programming: Python (Numpy, Pandas, scikit-learn, opencv, matplotlib, seaborn, plotly), R, pytorch, keras, tensorflow, flask, fast API

    Tools: Jupyter, JIRA, Git, Github, Docker, Google Colab, AWS Sagemaker

    Deep Learning: CNN, RNN, LSTM, Generative AI, LLM

    Databases: MySQL, Graph Database-Neo4j

    MLOPs: Mlflow, DVC

    Miscellaneous Skills: Statistics, Predictive Analytics, Hypothesis testing, A/B testing, ETL development, Data Scraping

Certification

Machine Learning A-ZTM: Hands-On Python and R

Timeline

Manager-R&D(ML&MLOps)

PepsiCo
06.2024 - Current

Master Practical MLOps for Data Scientists and DevOps on AWS

10-2023

Sr. Software Engineer

47 Billion
03.2020 - 03.2023

Machine Learning A-ZTM: Hands-On Python and R

05-2019

Complete Python Boot camp: Go from zero to hero in Python 3

04-2019

Deep Learning A-Z: Hands-On Artificial Neural Networks

09-2018

Data Scientist

Canopus Data Insights Pvt. Ltd.
05.2016 - 02.2020

Master of Science - Statistics

Devi Ahilya Vishwavidyalaya
04.2001 -

Bachelor of Science - Information Technology

Devi Ahilya Vishwavidyalaya
04.2001 -
Atul Kumar ShuklaManager- R&D (ML &MLOps)