Data Engineer with over 4 years of experience in building and optimizing scalable data pipelines and data warehouses. Skilled in ETL development, automation, and data warehousing, I have successfully designed and implemented over 100 data pipelines and more than 90 database objects using Azure Data Factory (ADF) and Snowflake. Proficient in Agile methodologies, utilizing Azure DevOps to drive efficient project execution. Certified as a Databricks Certified Associate Developer for Apache Spark 3.0, I am passionate about exploring emerging technologies, and continuously advancing my expertise in data engineering.
Technologies: Azure Data Factory, ADLS, Snowflake, SQL Server, Databricks, PySpark, Azure HDInsight, Hive.
Nuclear radiation fallout range prediction - https://github.com/chetan-ade/FYP
Developed a Web Application that predicts and simulates the flow of nuclear clouds in the atmosphere in case of a nuclear accident. Obtained 86% accuracy in prediction of Wind Speed and Directions using RNNs.
Project Domains/Tech Stack: Machine Learning, Soft Computing, Data Visualization, Python, Flask, HTML, CSS, Javascript
Stack Overflow analysis - https://colab.research.google.com/drive/1N02GQCRhidmeK0A3kXBvP8cA5lj2hfna
Analysis of StackOverflow data to find frequent queries, current trends and association amongst programming languages.
Technologies/Tools/Algorithms: Python, Google BigQuery, Numpy, Pandas, Matplotlib, Apriori, Association Rules