Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Adarsh Mahodaya

Pune

Summary

Postgraduate in Data Science with over 7 years of experience in Data Engineering and Analytics. Skilled in building end-to-end data pipelines on-premise and on the cloud, proficient in PySpark, Python, SQL, and Hadoop ecosystem. Experienced in team leadership, client interaction, and passionate about problem-solving. Seeking opportunities to leverage expertise and achieve organizational goals.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Expert Developer

Birlasoft
12.2021 - Current
  • Currently employed at Birlasoft as expert developer for TMNAS account, amajor US-based insurance company. Initially started as independent contributor, engaging directly with client. Led development and presentation of multiple POCs to stakeholders, successfully transitioning them to production and expanding team resources by two members.
  • In Docusign Project, orchestrated end-to-end pipeline from data ingestion to curation and storage of Docusign envelope data. Transformed XML source data into JSON format using ADF, curated data with PySpark, and stored it in external Hive tables. Conducted thorough unit and integration testing using a diverse tech stack including ADF, Linux, PySpark, ARM template, Azure Devops, Hive, Oozie, HQL, MySQL, CDP, and Azure.
  • For the Policy Repository Project, led a team of 3 in developing an end-to-end data pipeline. Managed complex XML data ingestion into ADLS, data flattening, curation, and storage into external Hive tables. Utilized Python request module for API interactions and PySpark for metadata XML creation based on API call results. Conducted comprehensive unit and integration testing with a tech stack encompassing RDBMS, Linux, PySpark, ARM template, Azure Devops, Hive, Oozie, HQL, MySQL, CDP, Chron, and Sqoop.

Data Engineer

SugarBox Networks
12.2019 - 12.2021
  • Sugarbox Networks, a product-based company headquartered in Mumbai, managed approximately 500 servers nationwide. These servers transmitted various data types, including consumption, user, and health monitoring data, in JSON and standard logs to Kafka.
  • Leading a team of 5, I oversaw the development and maintenance of data pipelines and warehouses, generating insights and reports via Excel and dashboards for stakeholders and management.
  • In ETL development, I handled end-to-end pipeline creation, from raw log extraction to dashboard visualization, conducting rigorous unit and integration testing. Our tech stack included Kafka, Scala, Spark, Hive, Nifi, Oozie, MySQL, Apache Superset, and Grafana.
  • I also spearheaded the development of stored procedures and dashboards tailored to product and business needs, leveraging technologies like Oozie, Grafana, Sqoop, and MySQL. Additionally, I designed an alerting system using Bosun, Open TSDB, Python, and Excel to monitor server health metrics and coordinate resolutions with support teams.
  • Furthermore, I managed application monitoring and improvement, liaising with internal and external support teams to address performance issues, utilizing tools such as Ambari, HDFS, Hive, MySQL, and Kafka.

Project Engineer

Wipro Technologies
12.2015 - 11.2019
  • Played pivotal role in devising strategies to meet SLAs, effectively managing operations to achieve targets for stakeholders, managers, and SVPs at Mattel Inc.
  • Identified areas for efficiency improvement, implemented robust queue management solution to enhance customer satisfaction, and fostered strong relationships with corporate clients. Additionally, I adeptly handled all minor escalations and queries.
  • Led User Access Management team of 9, facilitating training sessions for new members and conducting knowledge transfers onshore and offshore.

Education

Post Graduate Diploma in Data Science -

IIIT Bangalore & UpGrad
Banglore
09.2019

B.E - Computer Science -

Shri Ramdeobaba College of Engineering And Management
Nagpur
05.2015

Skills

  • PySpark
  • Python
  • SQL
  • Unix, linux, chronologically
  • Hadoop, HIVE
  • ETL & Data Engineering
  • Logic Building and Feature Engineering
  • Linear/Logistic Regression
  • Pandas, NumPy
  • Rest API, Postman, Python requests
  • Kafka, Nifi
  • Azure (ADLS, ADF, Azure Devops, ARM Template)
  • CDP, Data Warehouse, Data Lake
  • Process Improvement
  • Leadership & Training
  • Strong & Effective Communication Skills

Certification

Data Science Certification, 12/17, Completed a three-month certification in Data Science, during which I have learnt basics of Data science, Big Data and Hadoop. For practical, have performed complex queries on real time data using pig, hive, spark.

Timeline

Expert Developer

Birlasoft
12.2021 - Current

Data Engineer

SugarBox Networks
12.2019 - 12.2021

Project Engineer

Wipro Technologies
12.2015 - 11.2019

Post Graduate Diploma in Data Science -

IIIT Bangalore & UpGrad

B.E - Computer Science -

Shri Ramdeobaba College of Engineering And Management
Data Science Certification, 12/17, Completed a three-month certification in Data Science, during which I have learnt basics of Data science, Big Data and Hadoop. For practical, have performed complex queries on real time data using pig, hive, spark.
Adarsh Mahodaya