Overview
Work History
Education
Skills
Researchpublications
Course Projects
Courses taken
Interests
Timeline
Generic

Mohit Srivastava

Hyderabad

Overview

4
4
years of professional experience

Work History

Senior Software Engineer - Data Engineering

Tiger Analytics
Hyderabad
01.2023 - Current
  • Engineered and optimized end-to-end data pipelines using Databricks and PySpark to transform raw data for advanced analytics.
  • Collaborated with fellow engineers on migrating workflows to Simpel 2.0, ensuring MARS standards compliance and enhancing data processing.
  • Automated ETL processes, data ingestion, and DAG creation using Python, SQL, and Azure DevOps, resulting in a 30% boost in efficiency.
  • Integrated CI/CD pipelines, managing PySpark scripts and whl files on Databricks clusters, speeding up release cycles.
  • Conducted post-release optimizations and backend reporting, reducing data discrepancies by 15% and improving data stability.
  • Led data quality initiatives through rigorous unit testing and validation, ensuring robust, error-free deployments.
  • Managed tasks from code lineage to pipeline monitoring and backlog enhancements

Software Engineer - Data Engineering

Tiger Analytics
Hyderabad
11.2021 - 12.2022

Project: McAfee Data Governance

  • Improved data quality by reducing junk data to below 5% through enhanced telemetry validations and comprehensive L3/L4 checks.
  • Implemented data validation rules within Great Expectations framework to enhance data integrity.
  • Configured new attributes and events for every release, optimizing data tracking and reducing errors.
  • Collaborated on a POC with the Data Engineering team, implementing data lineage solutions and refining data governance practices.
  • Boosted team's proficiency through targeted training

IT Intern

Boston Scientific
Gurgaon
01.2021 - 07.2021

Project: Adobe Experience Manager (AEM)

  • Analyzed and tested CMS components, enhancing system functionality and security.
  • Increased code coverage to over 60% by writing JUnit tests, enabling automated CI/CD pipelines.
  • Addressed security vulnerabilities and infrastructure issues, improving the post-deployment performance of global websites.

Education

B.Tech - Computer Science Engineering

Sikkim Manipal Institute of Technology
Sikkim
01-2021

Skills

  • Programming Languages:
    Python, Java, C/C, SQL, PySpark
  • Big Data & Cloud Platforms:
    Azure (Data Lake, DevOps), AWS (S3, Redshift), Databricks, Delta Lake, Hive, Spark
  • Data Engineering & ETL Tools:
    Databricks, PySpark, SQL, MySQL, SSMS, Azure DevOps, Bitbucket
  • Data Quality & Governance:
    Great Expectations, Telemetry Validation, Data Lineage, Data Quality Frameworks
  • Development Tools & Frameworks:
    Simpel Framework, Git, Pycharm, JIRA, JUnit, JSON, AEM 65, Eclipse
  • CI/CD & Version Control:
    Azure DevOps, Git, Bitbucket,

Researchpublications

Survey on Captcha Recognition Using Deep Learning, Mohit Srivastava et.al., 2020, Third international conference on computing and communication, Springer (AISC series), Sikkim, INDIA

Course Projects

  • Captcha Recognition using Deep Learning (2020): Researched and developed strategies to break Captcha tests using deep learning for improved automated test recognition.
  • Automatic Text Summarization (2019): Built a system to extract concise summaries from large text documents using NLP techniques.

Courses taken

  • Machine learning using python | SQL for Data Science | Business metrics for data driven companies | Deep learning for business

Interests

  • Skating , Driving, Sketching

Timeline

Senior Software Engineer - Data Engineering

Tiger Analytics
01.2023 - Current

Software Engineer - Data Engineering

Tiger Analytics
11.2021 - 12.2022

IT Intern

Boston Scientific
01.2021 - 07.2021

B.Tech - Computer Science Engineering

Sikkim Manipal Institute of Technology
Mohit Srivastava