Summary
Overview
Work History
Education
Skills
Certification
Data Engineering Projects
Accomplishments
Software
Personal Information
Languages
Data Science Projects
Timeline
Generic

Dinesh Raut

Pune

Summary

Most recently, I worked as a Lead Engineer in a Data Analytics company at Hyderabad. I am looking for better opportunities in the field. I have 3 years of experience in Databricks, 4 years in Postgre-SQL and 5 years in Python. I also have 2 years of experience in Azure Data Factory and Machine Learning. Before joining the IT industry, I have worked in academia as a research-fellow and as a lecturer. As a researcher of Cosmology, I worked on problems that involved Data Analysis and Scientific Computing. The primary programming languages used were Python, C and Fortran. The projects that worked on, got me interested in Data Science and Big Data Analytics. I worked on some Data science projects on my own. I also pursued a course in Big Data Analytics from CDAC-Pune to facilitate the switch to the data analytics industry. I have published 3 papers in reputed Cosmology journals.

Overview

12
12
years of professional experience
18
18
years of post-secondary education
8
8
Certifications
3
3
Languages

Work History

Lead Engineer

Wissen Info Tech
02.2024 - 10.2024
  • I worked as a Lead Engineer in this company
  • This was more of a support project
  • Primary responsibilities involved working on incidents and enhancements
  • The technologies involved were Databricks, Postgre-SQL and pySpark

Senior Consultant

Capgemini
07.2023 - 10.2023
  • I worked as a senior consultant for Azure Data Analytics
  • The primary responsibilities involved using EXECL, SSMS, Databricks and ADF
  • The project also involved using GIT for repos and JIRA for tracking

Associate Big Data and EDW

Celebal Technologies
01.2022 - 06.2023
  • I worked as an Associate Big Data and EDW
  • The primary responsibilities were working on Databricks and ADF and using pyspark and SQL
  • The work also involved using Microsoft Excel and PowerPoint

Research Scholar

Tata Institute of Fundamental Research - NCRA
08.2014 - 02.2019
  • I worked on Cosmology (topics were Galaxies and Epoch of Reionization)
  • The work involved Scientific Computing, Data Analysis and Scientific Reporting
  • The programming languages used primarily were Python, C and Fortran
  • Python libraries that got mainly utilized were matplotlib, numpy and scipy
  • The work was done on computers with LINUX OS
  • I published one paper in Monthly Notices of Royal Astronomical Society while at the institute
  • I published 2 more papers (one in Astrophysical Journal and one in Frontiers of Astronomy and Space Sciences) in the field afterwards

Assistant Professor of Physics

Maharashtra Institute of Technology College of Engineering
06.2012 - 12.2013
  • I worked as an Assistant Professor of Physics and taught Engineering Physics to undergraduates

Education

Master of Science - Physics

Massachusetts Institute of Technology
09.2001 - 02.2005

B. Tech - Engineering Physics

IIT Bombay
08.1997 - 06.2001

HSC or Higher Secondary Certificate - Science

Maharashtra State Board
07.1995 - 06.1997

SSC or Secondary School Certificate -

Maharashtra State Board
06.1986 - 05.1995

Skills

DatabricksAzure Data FactoryADLSAWSPySparkPostgre-SQLETLPythonMachine LearningArtificial IntelligenceEXCELNLP

undefined

Certification

Databricks Certified Data Engineer Professional, 03/01/23

Data Engineering Projects

  • Post migration testing and support, 02/01/24 - 10/31/24, Monitored and supported the HR Data Lake activities of a client company. Worked on post migration file testing and enhancements using Pyspark, Databricks and SQL.
  • Building a unified platform for Data Analytics, 07/01/23 - 10/31/23, Involved in loading data from files to delta tables and consolidating data from different delta tables using ADF and Databricks notebooks.
  • Building an efficient framework for Reconciliation, 11/01/22 - 06/12/23, Tested the soundness of data migration from Teradata to Azure Databricks using SQL queries and PySpark.
  • Creating an optimal Data Analytics pipeline, 06/01/22 - 09/30/22, Created a complete data movement pipeline involving Azure Data Lake Storage, Azure Data Factory, Azure Databricks, Azure Synapse and PowerBI.

Accomplishments

National Talent Search Examination, Regional Mathematics Olympiad, National Eligibility Test (Physics) - Rank 1

Software

HEALPix, CosmoMC, AIPS

Personal Information

Languages

4,6,6

Data Science Projects

  • Genre Classification based on movie description, 2021, Processed movie data using pySpark and applied various Machine Learning algorithms to classify genres.
  • IEEE-CIS Fraud Detection competition on Kaggle, 2020, Segregated fraudulent credit card transactions using a novel approach and achieved a score of 0.75.

Timeline

Lead Engineer

Wissen Info Tech
02.2024 - 10.2024

Senior Consultant

Capgemini
07.2023 - 10.2023

Associate Big Data and EDW

Celebal Technologies
01.2022 - 06.2023

Research Scholar

Tata Institute of Fundamental Research - NCRA
08.2014 - 02.2019

Assistant Professor of Physics

Maharashtra Institute of Technology College of Engineering
06.2012 - 12.2013

Master of Science - Physics

Massachusetts Institute of Technology
09.2001 - 02.2005

B. Tech - Engineering Physics

IIT Bombay
08.1997 - 06.2001

HSC or Higher Secondary Certificate - Science

Maharashtra State Board
07.1995 - 06.1997

SSC or Secondary School Certificate -

Maharashtra State Board
06.1986 - 05.1995
Dinesh Raut