Summary
Overview
Work History
Education
Skills
Certification
Projects
Work Availability
Timeline
Generic

Ananya

Haryana

Summary

Experienced Software Engineer with a proven track record of ingesting, processing and implementing data solutions. Proficient in various big data technologies like Pyspark, HDFS, Hive and tools like Azure Data factory, Azure Kafka. With proficiency in developing CI/CD pipelines, I am adept at managing and analyzing large datasets to drive business insights.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Zscaler
06.2022 - Current
  • Created a cloud-first data ingestion that improved processing speed of data by 74%, used Python, SQL, and PySpark to collaborate with 2 interns and a senior engineer
  • Designed and implemented a real-time data pipeline to process semistructured data by integrating 150 million raw records from 30+ data sources using Kafka and PySpark
  • Automated ETL processes across billions of rows of data, which reduced manual workload by 29% monthly
  • Worked with client to understand business needs and translate those business needs into actionable reports in Tableau, saving 17 hours of manual work each week.
  • Drafted technical documentation for internal business areas and processes, incorporating factors such as technical design, data manipulation, ETL and storage management.

Associate Software Engineer

Hughes Systique
05.2021 - 05.2022
  • Used PySpark to distribute data processing on large streaming datasets, improving ingestion and speed by 67%
  • Communicated with Product Management to understand needs, and translated their feedback into actionable reports in Tableau, saving 46 hours of manual work each month
  • Maintained a complaint resolution rate of 93% by resolving customer complaints.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Improved data collection methods by designing surveys, polls and other instruments.

Education

BTECH CSE -

MAHARAJA SURAJMAL INSTITUTE OF TECHNOLOGY
07.2021

Skills

  • Python frameworks- Numpy, Pandas,Matplotlib, Flask
  • Programming languages- Mysql
  • Bigdata Tools/Process - HDFS, Hive, Spark,ETL,Apache Kafka,Apache Data Factory,Apache Data lake
  • Software - Microsoft Excel,Tableau
  • Platforms - GITHUB,Bitbucket
  • AWS
  • LINUX

Certification

  • Google Analytics Certification
  • AWS Certified Cloud Practitioner
  • Udemy Certificate : Complete Linux Training Course

Projects

RE - CHARGING SYSTEM, 06/2020, 09/2020, Conducted comprehensive analysis of COVID-19 impact on billing and charging services for the company, including invoice and SMS services. Built web Scrapper in python to acquire data from the websites and built an ETL. Managed to achieve 1st position out of all 15 projects.

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Timeline

Data Engineer

Zscaler
06.2022 - Current

Associate Software Engineer

Hughes Systique
05.2021 - 05.2022

BTECH CSE -

MAHARAJA SURAJMAL INSTITUTE OF TECHNOLOGY
Ananya