Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

Aakash Mahawar

Pune

Summary

Working Professional having 7+ years of experience in Business Analytics|Data Engineering|Data Management & Automation

Overview

7
7
years of professional experience
1
1
Certification

Work History

DATA ENGINEER

Pepper Advantage
08.2023 - Current
  • ETL/ELT Pipeline Development: Designed and implemented automated ETL/ELT workflows to extract data from various sources (websites, databases, APIs, third-party vendors), transform it using dbt, and load it into Snowflake for downstream analytics and reporting
  • Data Source Identification: Conducted extensive research and analysis to identify and evaluate internal and external data sources, including APIs, databases, and relevant platforms, ensuring alignment with organizational objectives
  • Data Gathering and Automation: Developed and maintained automation scripts to extract data efficiently, leveraging APIs and automated tools to streamline data acquisition processes
  • Data Validation and Quality Assurance: Implemented robust data validation frameworks using tools like Great Expectations and performed data profiling to ensure data accuracy, completeness, and reliability
  • Conducted data cleansing and normalization to maintain high-quality data standards
  • Data Privacy and Compliance: Ensured compliance with legal and regulatory requirements (e.g., GDPR) for data acquisition, storage, and processing
  • Maintained strict adherence to data privacy and security standards
  • Cross-Functional Collaboration: Partnered with data scientists, data analysts, and business stakeholders to gather data requirements, provide curated data sets, and enable data-driven decision-making
  • Continuous Improvement: Regularly optimized ETL/ELT processes, evaluated emerging tools and technologies, and implemented best practices to enhance pipeline efficiency, scalability, and data quality
  • Skills: Python, Data Acquisition, Scrapy, Playwright, Selenium, Automation, Machine Learning (Sentiment Analysis, Captcha Solver), GCP Cloud Server, Git Bash, Snowflake, Redis, Airflow, AWS (lambda, ecr, cli, cloudwatch, API gateway, alb), FastAPI, Docker, Streamlit, DBT, Kafka, RabbitMQ, Tableau

DATA ENGINEER

DAT-A-CCURATE
07.2022 - Current
  • Data Acquisition and Management: Skilled in acquiring and maintaining databases from primary and secondary data sources
  • Expertise in collecting and analyzing text/data from websites and APIs using custom code and regular expressions
  • Data Quality Assurance: Proficient in ensuring the quality, accuracy, and completeness of data using tools like Great Expectations and conducting data profiling
  • Data Visualization and Analysis: Experienced in working with large datasets to derive actionable insights and developing interactive Tableau dashboards
  • Research and Sourcing: Adept at conducting sourcing and research for list-building projects, ensuring high-quality deliverables
  • Machine Learning Development: Expertise in designing and implementing custom machine learning models and algorithms, including classification, regression, sentiment analysis, and look-alike modeling
  • Database Management: Experienced in maintaining and protecting electronic databases, utilizing tools like Beautiful Soup, Scrapy, and Selenium, and working with MongoDB, MySQL, and Redis
  • Skills: Python, Scrapy, Data Acquisition, Machine Learning (Classification, Regression, Sentiment, Look-A-Like), MongoDB, MySQL, Redis, Tableau, Docker, AWS (dynamodb, cloud 9), Flask

BUSINESS DATA ANALYST

Wingify
04.2020 - 07.2022
  • Data Cleaning and Preparation: Skilled in cleaning and updating data for end-user analysis, ensuring enhanced quality, completeness, and consistency across datasets
  • Reporting and Quality Control: Proficient in leveraging standard reporting technologies to create, manage, and maintain reports, while adhering to rigorous quality control processes
  • Process Optimization: Experienced in reviewing checklists for process adherence and collaborating with specialists and stakeholders to address errors and inconsistencies effectively
  • Web and Mobile Data Extraction: Expertise in web scraping using Beautiful Soup, Scrapy, Splash, Selenium, and Flask, alongside mobile app scraping using Appium
  • Skilled in Python automation, API integration, and interaction with MySQL DB and AWS services
  • Tools and Platforms: Proficient in tools such as Salesforce Lightning, LinkedIn Sales Navigator, Tableau, BuiltWith, seamless.ai, adapt.io, Lusha, Investogain, Crunchbase, Similarweb, SimilarTech, proofy.io, ahrefs.com, and semrush.com
  • Leadership: Successfully led and managed a team of 3-4 analysts, ensuring timely delivery of high-quality outputs
  • Skills: Python, Scrapy, Data Acquisition, Salesforce, MySQL, AWS (ec2, s3, rds, cloud9), Tableau, Flask, Automated Tools, Team management

DATA MANAGEMENT ANALYST

Baromeeter
11.2017 - 03.2020
  • Data Management and Analysis: Proficient in managing and analyzing data, including conducting secondary research tasks such as company profiling and rapid research using platforms like LinkedIn, ZoomInfo, D&B Hoovers, RocketReach, Anymailfinder, and Bloomberg
  • Dashboard Monitoring and Data Extraction: Skilled in monitoring dashboards to extract raw data, transforming it into actionable insights for decision-making
  • Advanced Excel Expertise: Adept at utilizing Advanced Excel for deep data analysis, uncovering trends, and generating meaningful insights
  • Management Information System (MIS) Reporting: Experienced in creating comprehensive MIS reports on a daily, weekly, and monthly basis to meet project reporting requirements efficiently
  • Skills: Secondary Research, MS Excel, Google Sheets, Data Analysis

Education

Master Of Technology - Instrumentation & Control

Dr B R Ambedkar National Institute of Technology
Jalandhar, Punjab
08.2014

Bachelor of Engineering - Electronics & Communication

University of Rajasthan
Jaipur, Rajasthan
08.2009

Skills

  • Python
  • Data Acquisition
  • Scrapy
  • Playwright
  • Selenium
  • Automation
  • Machine Learning
  • Git Bash
  • Snowflake
  • Redis
  • Airflow
  • AWS
  • Flask/FastAPI
  • Docker
  • Streamlit
  • DBT
  • Kafka
  • RabbitMQ
  • Tableau
  • MongoDB
  • MySQL
  • Salesforce
  • Automated Tools
  • Team management
  • MS Excel
  • Google Sheets
  • Data Analysis

Certification

  • The Complete dbt (Data Build Tool) Bootcamp - Udemy
  • Snowflake : The Complete Masterclass - Udemy
  • Data Engineering using SQL, Python, and PySpark - Udemy
  • Business Intelligence (Python, MySQL, Tableau) - Udemy
  • Modern Data Analysis using Python Programming - Udemy
  • Data Science Learning (R, Python) - Edwisor.com
  • Marketing Analytics (R Programming) - Edx.org
  • Scrapy: Powerful Web Scraping & Crawling with Python - Udemy
  • Pyspark with Databricks - Udemy

Languages

Hindi
English

Timeline

DATA ENGINEER

Pepper Advantage
08.2023 - Current

DATA ENGINEER

DAT-A-CCURATE
07.2022 - Current

BUSINESS DATA ANALYST

Wingify
04.2020 - 07.2022

DATA MANAGEMENT ANALYST

Baromeeter
11.2017 - 03.2020

Bachelor of Engineering - Electronics & Communication

University of Rajasthan

Master Of Technology - Instrumentation & Control

Dr B R Ambedkar National Institute of Technology
Aakash Mahawar