Summary
Overview
Work History
Education
Skills
Certification
PROJECTS
Timeline
Generic
PIYUSH AGARWAL

PIYUSH AGARWAL

Bengaluru

Summary

An Accomplished, determined and well-rounded individual with more than 2 years of experience as Data Scientist within Finance and Automotive industry. Focused and keen concentrated professional in Data Science, Machine Learning, Deep Learning, Natural Language Processing, Computer Vision and Generative AI. The contribution has been pivotal in interpreting complex datasets, developing and implementing data science models and algorithms, applying machine learning and deep learning techniques, utilizing natural language processing methods, and collaborating with cross-functional teams to translate business requirements into analytical solutions that drive significant impact.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Scientist I

Tekion
01.2024 - Current
  • Leveraged Open-AI capabilities while keep AI safety measures intact to generate actionable insights from raw business data, utilizing tree methods and GPT techniques for insightful carousels providing summaries and sentiments of business' key performance indicators (KPIs) i.e Profits and expenses
  • Implemented Smart Q&A functionality for intelligent suggestions and curated FAQs, reducing user effort and enhancing usability, along with Few Shots for GPT Feeds to guide accurate user queries
  • Automated vehicle check-in by extracting driver's license details with accuracy across 30 US states, using a faster RCNN model for detection and LayoutLMv3 for information extraction
  • Implemented barcode recognition and authentication procedures to validate extracted data, ensuring document authenticity and security.

Data Scientist Associate

Tekion
01.2022 - 12.2023
  • Developed highly accurate recommendations for service advisors to efficiently select parts for vehicle servicing tasks, achieving a remarkable 95% predictive accuracy using kneedling algorithm despite covering only 45% of organizational paid service data, mainly from OEMs
  • Utilized in-depth analysis, feature engineering, and XGBoost technique to enhance performance to 93%, aiding dealerships in managing inventory for sales and anticipating service parts needed for upcoming services
  • Created a G/L Account Mapping tool to automate accounting ledgers, reducing processing time from 3-4 days to just 4 hours with an 80% coverage of transactions and 95% accuracy, using Random Forest Classifier and heuristic approach
  • Led development of User Preference Services for vendors' Invoice Management, leveraging LayoutLMv3 for document detail extraction, resulting in significant improvements in clients' operations through a user-centric approach.

Data Scientist Intern

HighRadius
08.2021 - 12.2021
  • Conducted thorough analysis of Account Receivables and Account
  • Payables Data, identifying key insights and forecasting accounting trends to drive informed decision-making
  • Enhanced code bases through rigorous refinement, ensuring robustness and efficiency in algorithm performance
  • Executed comprehensive test cases to validate pipelines, effectively increasing coverage and reliability of data processing
  • Utilized exploratory data analysis and statistical methodologies to uncover complex patterns, enhancing algorithm performance and efficiency.

Education

BE - CSE- Machine Learning & AI

Chandigarh University
Mohali
05.2022

12th - Maths-Science

SDP SR. SEC. SCHOOL
Mandawa, JhunJhunu
05.2017

10th - Secondary Education

SDP SR. SEC. SCHOOL
Mandawa, Jhunjhunu
05.2015

Skills

  • Languages: Python, SQL, Java
  • Statistics: Descriptive Stats, Inferential Stats, Data Analysis and Consolidation, Hypothesis Testing, Feature Engineering
  • Machine Learning: Linear Regression, Logistic Regression, Decision Trees, Random Forests, Naive Byes, SVM, Bagging and Boosting, XGBoost, KMeans Clustering, DBSCAN Clustering, KNN Classifier
  • Deep Learning and NLP: Artificial Neural Networks, Convolutional Neural Networks, Recurrent Neural Networks, Transformers, Activation Functions, Optimizers, Cost Functions, Word Embeddings
  • Libraries and Frameworks: TensorFlow, Keras, Pytorch, LangChain, LlamaIndex, OPENAI API, Hugging Face, Pandas, Numpy, SciKit Learn, NLTK, SciPy, Seaborn
  • Tools: Jupyter Lab, Visual Studio, Pycharm, Git, Colab, Command Prompt, AWS/S3, Confluence, Jira

Certification

  • Machine Learning - Coursera : Relevant coursework - Supervised Learning Algorithms, Unsupervised Learning Algorithms, Exploratory Data Analysis Best Practices
  • Neural Networks and Deep Learning - Coursera : Relevant coursework - Mathematics behind Neural Networks, Working of Neural Nets, Forward Propagation, Backward propagation
  • Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization - Coursera : Relevant coursework - Activation Functions, Loss Functions, Optimizers, Hyperparameter Tuning, Regularization for improving performance of Neural networks
  • Sequence Models - Coursera : Relevant coursework - Introductions to Recurrent neural networks and journey to attention mechanism
  • Data Science & Machine Learning Training InternShala : Relevant coursework - Practical Learning of Machine Learning and Data Exploration

PROJECTS

Guard-Railing LLM Applications

Implemented safety measures like Misinformation, Hallucination, generating correct, truthful, and consistent outputs.

Ensured structured outputs using Pydantic data models and Rail Specs of GuardRails AI.


Sentiment Analysis

Implemented three common word embeddings in NLP: Word2Vec, Tf-Idf and BERT.

Trained and evaluated fine tuned BERT architecture with custom output layer.



Timeline

Data Scientist I

Tekion
01.2024 - Current

Data Scientist Associate

Tekion
01.2022 - 12.2023

Data Scientist Intern

HighRadius
08.2021 - 12.2021

BE - CSE- Machine Learning & AI

Chandigarh University

12th - Maths-Science

SDP SR. SEC. SCHOOL

10th - Secondary Education

SDP SR. SEC. SCHOOL
PIYUSH AGARWAL