Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Pooja Sandesh Gunjawate

Senior Data Engineer | Senior Big Data Engineer
Pune

Summary

Data Engineer at J.P. Morgan Chase & Co. with over 10 years of extensive experience in PySpark, Neo4j, Spring Boot, and AWS. Demonstrates a strong commitment to excellence and a results-oriented approach, consistently delivering high-quality solutions in fast-paced environments. Effective interpersonal skills facilitate collaboration with cross-functional teams, driving projects to successful completion. Passion for tackling challenges and dedication to continuous learning ensure mastery of emerging technologies and industry trends.

Overview

12
12
years of professional experience
1
1
Certification

Work History

Big Data Engineer

J.P. Morgan Chase & Co.
09.2021 - Current
  • Crescendo acquires company, contact, and investor data from third-party providers such as Bloomberg, Factset, Zoominfo etc. This raw data undergoes matching to create a unified, high-quality golden records that are then exported to Neo4j. Additionally, a Spring Boot application is deployed on top of this infrastructure, providing endpoints for efficient consumption of this data.
  • Project: Crescendo
  • Client: J.P. Morgan Chase & Co.
  • Role: Big Data Engineer
  • Technology Stack: Python, Spark, Spring-boot, SQL, Neo4j, Oracle RDS, Neo4j Cypher Query Language, Bitbucket, Dev-shell, Putty, Jenkins, Spinnaker, IntelliJ IDEA, Maven, Pytest, Mockito
  • Cloud services: AWS-EMR, AWS-S3, AWS-EC2, AWS-ECS, AWS-IAM, AWS-Step Functions, AWS-Glue, AWS-Lambda, RDS, SQS, API Gateway, AWS-Glue, Athena.
  • Responsibilities:
  • Built Extract, Transform, Load Pipelines that process over 600 million records of data using Pyspark.
  • Set up and configured AWS Services such as EC2 instances for API & Neo4j database hosting, Elastic Map Reduce to run the ETL pipelines, S3 buckets for data storage, Lambda functions, and Auto scaling groups using Terraform.
  • Built Spring Boot APIs to provide data from a Neo4j graph database using Cypher Query Language. This data is used by around 30,000 bankers in the firm to fetch information and provide insights.
  • Setup deployment pipelines using Jenkins and AWS CodeDeploy. Also wrote Bash script to automate the pipelines. Recently started working on migration from Jenkins to Spinnaker for CD.
  • Proactively addressed potential bottlenecks in the ETL process through regular monitoring, enabling seamless workflow operations.
  • Automated routine tasks through scripting languages, reducing manual effort and human error risks.
  • Increased operational efficiency by automating repetitive tasks using Python scripts, allowing focus on higher-priority projects.
  • Collaborated with cross-functional teams to determine business requirements and translate them into functional specifications.

Big Data Developer

Tata Consultancy Services
03.2018 - 08.2021
  • Nielsen study consumers in more than 100 countries to give the most complete view of trends and habits worldwide. Nielsen understands the importance of knowing what consumers watch and buy and that's their passion and the very heart of Nielsen business.
  • Project: DAR Harmonization
  • Client: Nielsen
  • Technology Stack: Java, SQL, Scala, Hadoop, Spark, Airflow, Pig, Hive, Bitbucket, Putty, IntelliJ IDEA, Maven.
  • Cloud services: AWS-EMR, AWS-Lambda, AWS-S3, AWS-RDS, AWS-EC2, Auto Scaling Groups, Security Groups
  • Role 1: Big Data Developer
  • Worked on client's product, custom report generator, which is used to generate customized reports, that contains information about digital advertisement ratings of advertising campaigns published on various digital platforms such as e-commerce websites, social networking sites and OTT platforms. The product was a system, which extracts and transforms data based upon report type and parameters given by end users and provides them the required information in the form of reports.
  • Responsibilities:
  • Optimized performance of Spark applications.
  • Developed Spark applications using Scala for batch processing leveraging Spark SQL, Java RDD, Dataset, and DataFrame.
  • Created Directed Acyclic Graphs (DAGs) with Python to manage tasks, dependencies, SLA monitoring, and time sensors using Apache Airflow for workflow automation.
  • Proficient in AWS services including EC2, S3, Auto Scaling Groups, Security Groups, CloudFormation, and EMR.
  • Monitored production jobs using Hue, analyzed logs for failures, identified unhandled exceptions, bugs, configuration flaws, compatibility, and version issues.
  • Developed unit test cases utilizing the JUnit framework.
  • Role 2: Web Application Developer
  • Developed client's tool, Custom Report Management Tool, which is used to submit customized reports on oozie console using UI. By Using this tool client is able to create, update and monitor workflow using UI.
  • Responsibilities:
  • Developed Custom Web application for submitting Custom workflows on Oozie console.
  • Migrated tool from Spring MVC framework to Spring RESTful web services.
  • Application is hosted in AWS ECS Service as Docker container-based application.
  • Documented REST API using Swagger tool.

Nielsen
01.2017 - 04.2019
  • The project deals with data science and analytics that collects the user’s data using Netsight meter software that is installed on user’s devices, this collected data is sent to crediting team for further optimization and report generation.
  • Project: Netsight Meter
  • Client: Nielsen
  • Technology Stack: Java, Spring Boot, Junit, Git-bash, MySQL Server, Eclipse IDE, Apache Tomcat Server, GitHub, Maven
  • Responsibilities:
  • Contributed to the development of a Control API utilized by various teams for executing meter-related tasks.
  • Authored JUnit test cases for the developed code, employing Mockito for mocking dependencies.
  • Collaborated on improving and maintaining automation scripts used by the QA team for regression testing.
  • Enhanced the existing automation framework by integrating tools and languages such as Java, Ruby, and Accessibility Inspector.

Nokia
03.2014 - 10.2016
  • ULTIMA (User License Tool Installation Management Application) application is used by Nokia to manage all tool related information and provides user access rights to the Applications and Tools. It has four main working areas i.e. User Access Management, Product management, Portfolio Management, Release Management.
  • Project: ULTIMA
  • Client: Nokia
  • Technology Stack: Java, Spring, Struts, Hibernate, MySQL server, GitHub, Eclipse IDE, Maven, Apache Tomcat Server.
  • Responsibilities:
  • Involved in development of client required features of the project.
  • Responsible for End-to-end code deployment on different server (i.e. Development, QA, Production Server) and environment (Linux).
  • Check and resolve application environment issues on Linux system

Education

Bachelor of Engineering - Electronics & Telecommunication

University Of Pune
01.2013

Higher Secondary Certificate - Science

Jai Hind Junior College
Pune

Secondary School Certificate - undefined

Jai Hind High School
Pune

Skills

Proficient in AWS

Data analysis

System development

Apache Kafka

Certification

AWS Certified Cloud Practitioner

Timeline

Big Data Engineer

J.P. Morgan Chase & Co.
09.2021 - Current

Big Data Developer

Tata Consultancy Services
03.2018 - 08.2021

Nielsen
01.2017 - 04.2019

Nokia
03.2014 - 10.2016

Secondary School Certificate - undefined

Jai Hind High School

Bachelor of Engineering - Electronics & Telecommunication

University Of Pune

Higher Secondary Certificate - Science

Jai Hind Junior College
Pooja Sandesh GunjawateSenior Data Engineer | Senior Big Data Engineer