Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Sumit Tripathi

Data Engineer
Prayagraj,India

Summary

Meticulous Data Scientist/Engineer accomplished in compiling, transforming and analyzing complex information through software. Expert in machine learning and large dataset management. Demonstrated success in identifying relationships and building solutions to business problems.

Overview

2
2
years of professional experience
4
4
years of post-secondary education
2
2
Languages

Work History

Data Engineer

SelfDecode
06.2021 - Current
  • Performed large-scale data conversions, transferring VCF data into standardized 23andme formats for integration into delta lake for creating complete imputation pipeline.
  • Experimented with various ways of improving performance of data access using non-spark cluster based environment from delta lake.
  • Created multiple dashboard for analyzing genetics data-based recommendation(prediction) for groups of traits of disease for improvement in predictive model.
  • Monitored incoming data analytics requests, executed analytics and efficiently distributed results to support modelling strategies.

Data Scientist Intern

Taiyō.AI
02.2021 - 06.2021
  • Build semantic and syntactical searching system using a combination of basic and advanced technique from tokenization, multi-n-grams generation, building vocab corpus using fasttext to pre- indexing using NMSLIB and BM25. So that one could search information using just few words or even using alteration in syntax based on kNN querying in corpus.
  • Established recommendation system for most influential economic indicators using combination of one vs rest modelling, anomaly detection and curating feature to recommend based on weighted feature importance of each model.

Computer Vision Software Engineer Intern

KritiKal Solutions
12.2020 - 02.2021
  • Exploring and experimenting with various ways for table detection, cell detection, and information extraction from invoice documents for OCR performance improvement[ mAP@0.5 of 0.65 to 0.77].

Computer Vision Engineer Intern

MirragAI
12.2020 - 02.2021
  • Used advanced augmentation techniques to improve accuracy of Yolov3 for PPE detection from 81.2% to 91.4%.
  • Building intrusion detection system using OpenCV and Yolov3 with current mean average precision(mAP@0.5) of 0.55.

Machine Learning Intern

Techimax IT Services Pvt Ltd
05.2020 - 10.2020
  • Recording Interactive Machine Learning, Deep Learning, Mathematics For Machine Learning and Computer Vision Course.
  • Structured courses, created content, projects and performed end to end maintenance of projects.

Machine Learning Intern

MToV Inc
05.2019 - 07.2019
  • Identified driving styles using classification models such as SVM, RF and Neural Network with in-vehicle data can provide automated feedback to drivers on their driving behavior, particularly if they are driving safely. Neural Network performed better than SVM and RF in all four evaluation criteria as TP Rate: 0.968, FP rate: 0.045, Precision: 0.969 and F-Measure: 0.968.
  • Using Machine Learning predictive algorithms and continuous ETL from OBD of vehicle predicting fuel usage, condition of vehicle, etc.

Education

Bachelor of Technology - BTech - Electrical, Electronics and Communications

Jaypee University of Engineering And Technology
Guna, Madhya Pradesh
06.2017 - 06.2021

Skills

    Advanced analytics

undefined

Accomplishments

  • Publication : Unbiased Mortality Prediction for Unbalanced Data using Machine Learning ( DOI: 10.1109/UPCON47278.2019.8980003)
  • This paper analyses the performance of different oversampling techniques on biased data to predict the death chances of a person while they stay at the hospital based on their medical record.


  • Kaggle Expert
  • Intel Edge AI Scholar
  • 3rd, Mortality Prediction Challenge – CodaLab

Timeline

Data Engineer

SelfDecode
06.2021 - Current

Data Scientist Intern

Taiyō.AI
02.2021 - 06.2021

Computer Vision Software Engineer Intern

KritiKal Solutions
12.2020 - 02.2021

Computer Vision Engineer Intern

MirragAI
12.2020 - 02.2021

Machine Learning Intern

Techimax IT Services Pvt Ltd
05.2020 - 10.2020

Machine Learning Intern

MToV Inc
05.2019 - 07.2019

Bachelor of Technology - BTech - Electrical, Electronics and Communications

Jaypee University of Engineering And Technology
06.2017 - 06.2021
Sumit TripathiData Engineer