Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Pravallika Malepati

Bangalore

Summary

Experienced IT professional with 6.1 years of expertise in AWS technologies like AWS Glue, AWS Athena, AWS Lambda, AWS Redshift, and AWS S3, with a strong background in Informatica PowerCenter, IICS, and Oracle

Overview

6
6
years of professional experience

Work History

Data Engineer

Schneider Electric
Bangalore
05.2022 - Current

• Designing, developing, and implementing AWS Glue jobs to extract, transform, and load (ETL) data from various sources into a data warehouse .

• Writing and optimizing ETL scripts using Apache Spark and Pyspark within AWS Glue to process large-scale datasets.

• Configuring and managing AWS Glue data catalog, including defining schemas, tables, and partitions.

• Working with other AWS services such as Amazon S3, Amazon Redshift, AWS Lambda, and AWS CloudFormation to build end-to-end data solutions.

• Designed, developed, and supported Extraction, Transformation, and Load (ETL) processes for data migration using Informatica power center, IICS, AWS Redshift and Oracle.

• Designing and implementing Glue crawlers to automatically discover and catalog data in various formats, enabling efficient querying and analysis with Athena.

• Utilizing the Redshift unload command to efficiently export data from Redshift to external storage or S3 for further processing or archiving and copy command for vice versa.

• Monitor data pipelines and systems using AWS CloudWatch.

• Implementing serverless event-driven architectures using AWS Glue triggers and AWS Lambda to automate data processing tasks based on events or schedule.

• Orchestrating and scheduling data processing tasks and dependencies using AWS Glue Job Scheduler or external workflow orchestration tools like AWS Step Functions.

• Writing PySpark code to perform data transformations, aggregations, joins, and filtering operations on large-scale datasets.

Data Engineer

Cognizant Technology solutions
Bangalore
01.2018 - Current

• worked on creating AWS Glue jobs to extract, transform, and load (ETL) data from various sources to target database or datawarehouse.
•Writing and optimizing ETL scripts using Apache Spark and Pyspark within AWS Glue.
• Designed, developed, and supported Extraction, Transformation, and Load (ETL) processes for data migration using Informatica power center, IICS, AWS Redshift and Oracle.
• Created various mappings using Mapping Designer and utilized transformations such as Aggregator, Lookup, Filter, Router, Joiner, Source Qualifier, Expression, Stored Procedure, Sorter, and Sequence Generator.
• Designed and developed Informatica Mappings and Sessions based on business user requirements and business rules to load data from different sources like flat files and RDBMS tables to target tables.
• Utilizing the Redshift unload command to efficiently export data from Redshift to external storage or S3 for further processing or archiving and copy command for vice versa.
• Good understanding of Partitioning , Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance.
• Developed complex mappings involving Slowly Changing Dimensions (SCD1 and SCD2) and implemented business logic.
• Conducted performance tuning activities in Redshift and Informatica, identifying and rectifying performance bottlenecks, optimizing indexes in oracle and Informatica.
• Validate and ensure data quality by performing data profiling, data validation, and data cleansing activities within Athena.
• Worked on Optimization of queries by applying Sort and Distribution keys on the Redshift Tables.
• Have done Performance tuning and space optimization for the queries.

Education

B.E - Computer Science And Engineering

R.M.D Engineering College
Chennai
04-2017

Intermediate - MPC

Narayana Educational Institutions
Nellore
04-2013

SSC - General

Sri Geethanjali
Nellore
04-2011

Skills

Technical Skills

AWS Services: AWS Glue, AWS S3, AWS Athena, AWS Lambda, AWS Redshift, Redshift Spectrum

ETL Tools: Informatica power center 1041, IICS

Databases: Redshift, Oracle 11g, PostgreSQL

Spark, Sqoop, Hive, Data ware housing

Accomplishments

  • Has been a consistent high performer which has been recognized with 2 promotions in span of 4 years at cognizant.
  • Stood in the top 1 percentile of people in board examinations at +2 level

Timeline

Data Engineer

Schneider Electric
05.2022 - Current

Data Engineer

Cognizant Technology solutions
01.2018 - Current

B.E - Computer Science And Engineering

R.M.D Engineering College

Intermediate - MPC

Narayana Educational Institutions

SSC - General

Sri Geethanjali
Pravallika Malepati