Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Aayushi Jha

New Delhi

Summary

Experienced Data Engineer with 7 years of expertise in developing scalable data pipelines, crawling, and web scraping systems. Proficient in Python (Scrapy, Selenium, Playwright), AWS (ECS, Lambda, S3), and relational databases, with hands-on experience in designing and optimizing data processing workflows. Skilled in automating data workflows, building efficient data pipelines, and implementing CI/CD pipelines to streamline testing, deployment, and data transformation. Proven ability to deliver high-quality, data-driven solutions that enhance operational efficiency and support real-time analytics.

Overview

2025
2025
years of professional experience

Work History

Senior Software Engineer

Bitwise Solutions Pvt Ltd
Pune, India
1 1 - Current
  • Collaborated with Gartner clients to deliver data-driven insights by gathering requirements, translating complex analytics into actionable business solutions, and addressing diverse stakeholder needs through effective data modeling, transformation, and crawling techniques.
  • Designed and developed scalable data crawling frameworks and pipelines, leveraging AWS technologies (ECS, Lambda, API Gateway), automating data collection and processing, reducing manual handling by 50%, and improving decision-making accuracy.
  • Engineered a browser tool simulating human behavior to bypass site-blocking measures, improving data crawling and extraction efficiency by 30%, and ensuring continuous, uninterrupted data availability for clients.
  • Automated LinkedIn Sales Navigator operations for data extraction and lead generation, developing Python scripts for scraping profiles, identifying target personas, and extracting market insights, increasing client outreach efficiency by 20%.
  • Optimized and maintained data pipelines and crawling systems using AWS services and Python libraries, enhancing processing speed and reducing latency by 40% for real-time analytics, enabling efficient handling of large datasets.
  • Streamlined project workflows using Jira for tracking and prioritization, ensuring timely completion of deliverables, and reducing project turnaround time by 15% through effective issue tracking and prioritization.

Senior Software Engineer

Springbord
New Delhi, Delhi
08.2022 - 06.2023
  • Built a Python-based system to extract structured data from PDF invoices, significantly reducing manual data entry by 70%.The extracted data was efficiently stored in Excel, streamlining the data processing workflow.
  • Developed and maintained automation frameworks for extracting data from multiple sources. These frameworks ensured a steady flow of high-quality, reliable data, essential for analysis and reporting.
  • Implemented best practices in data cleaning, transformation, and normalization, boosting data integrity and enhancing processing speed by 30%, which enabled accurate and timely analytics.
  • Worked closely with DevOps teams to establish CI/CD pipelines, enhancing deployment processes and enabling rapid iteration cycles. This collaboration was instrumental in consistently meeting project deadlines.

Senior Analyst

Course5i
Gurugram, Haryana
09.2021 - 08.2022
  • Conducted in-depth consultations with multiple clients to understand their unique business challenges, analyzing requirements to deliver tailored, data-driven solutions that supported informed decision-making.
  • Designed and implemented scalable web scraping tools to extract data from diverse online sources, ensuring high standards of data integrity and accuracy by integrating automated validation checks.
  • Leveraged advanced data extraction techniques to capture, cleanse, and periodically store large datasets, maintaining data relevance and quality for accurate analysis across various client projects.
  • Worked closely with data science and engineering teams to align data strategies with client goals, enhancing project outcomes through regular feedback loops and cross-functional expertise.


SDE-1

Smile Internet technologies Pvt Ltd
Gurugram, Haryana
02.2020 - 08.2021
  • Designed and implemented a scraping engine architecture using Scrapy, Selenium, and BeautifulSoup to extract data from multiple websites for an in-house product.
  • Developed a scalable data processing pipeline to efficiently extract, clean, and store data.
  • Conducted sentiment analysis using NLP and machine learning techniques to gain insights into customer sentiment and behavior, and utilized natural language processing to analyze and categorize customer feedback for product improvement.

Python Developer

Head Field Solutions Pvt. Ltd.
Noida, Uttar Pradesh
08.2017 - 01.2020
  • Developed over 100 scrapers using Python, Requests, BeautifulSoup, and Selenium for the company's inbuilt tool, Intellisent.
  • Worked on sentiment analysis of data from various social channels and web forums, employing NLP techniques to identify and extract opinions within text.
  • Developed frameworks for automating and maintaining a constant flow of data from multiple sources, ensuring efficient storage in databases.

Education

Master of Computer Applications - MCA - Computer Application

Guru Gobind Singh Indraprastha University
New Delhi

Bachelor of Computer Application - Computer Application

Guru Gobind Singh Indraprastha University
New Delhi

Skills

  • Web scraping
  • Python
  • SQL
  • AWS
  • Selenium
  • Scrapy
  • BeautifulSoup
  • Playwright
  • AWS ECS
  • Data Automation
  • Pandas
  • Numpy
  • FlaskApi
  • CI/CD Pipelines
  • Data Transformation
  • AWS Lambda

Projects

E-commerce Product Price Monitoring System
  • Designed and implemented a robust web scraping system to track product prices, descriptions, and availability across leading e-commerce platforms, enabling real-time competitive analysis.
  • Leveraged Python libraries (BeautifulSoup, Scrapy, Selenium) to extract dynamic product data, overcoming JavaScript-rendered content and anti-scraping measures.
  • Developed an end-to-end data pipeline to clean, store, and analyze data for real-time pricing trends and competitor strategies.
  • Created PowerBI dashboards to visualize historical trends, seasonal fluctuations, and competitor pricing strategies, empowering data-driven decision-making for market positioning.
Project: Customer Sentiment Analysis Dashboard
  • Led a project to analyze customer sentiment across social media platforms by extracting and processing user-generated content.
  • Automated data collection with custom bots, ensuring a steady flow of real-time data for sentiment analysis.
  • Applied NLP techniques to classify feedback into sentiment categories, uncovering actionable insights to enhance customer engagement.
  • Integrated predictive insights into an interactive PowerBI dashboard, enabling teams to track sentiment trends and improve customer-centric decision-making.

Timeline

Senior Software Engineer

Springbord
08.2022 - 06.2023

Senior Analyst

Course5i
09.2021 - 08.2022

SDE-1

Smile Internet technologies Pvt Ltd
02.2020 - 08.2021

Python Developer

Head Field Solutions Pvt. Ltd.
08.2017 - 01.2020

Senior Software Engineer

Bitwise Solutions Pvt Ltd
1 1 - Current

Master of Computer Applications - MCA - Computer Application

Guru Gobind Singh Indraprastha University

Bachelor of Computer Application - Computer Application

Guru Gobind Singh Indraprastha University
Aayushi Jha