Summary
Overview
Work History
Education
Skills
Key Skills And Knowledge
Projects And Key Roles
Certification
Timeline
Generic
Murali Krishna

Murali Krishna

AWS DATA ENGINEER
Hyderabad

Summary

Experienced and certified AWS Data Engineer with 6.6 years of expertise in developing and optimizing ETL processes. Proficient in PySpark, AWS, SQL, and Python, delivering scalable and 100% efficient data solutions. Demonstrated success in optimizing workflows, ensuring data quality, and collaborating with cross-functional teams to achieve business objectives.Successfully built scalable and robust ETL processes on AWS, ensuring efficient data extraction, transformation, and loading. Developed and maintained data pipelines using AWS Glue, leveraging PySpark for data transformations and optimizing workflow performance with 100% efficiency.

·Implemented and managed data lakes on Amazon S3, ensuring secure and efficient storage of large datasets. Delivered an event-driven system using AWS SNS and SQS for seamless communication and coordination with upstream system. Utilized AWS Lambda for serverless computing, automating data processing tasks and improving overall system efficiency, reducing manual efforts . Integrated diverse data sources into cohesive pipelines, demonstrating proficiency in handling data from Amazon S3, AWS Glue Catalog, AWS EMR .Collaborated closely with stakeholders to gather data requirements and transform them into effective Spark SQL queries and dataframe operations.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

IBM
01.2023 - Current

EDP DATA LAKE DATA INTEGRATION HUB

Data ingestion and integration frameworks provide a standardized, easy and reliable way for various Barclays departments to load their data to data-warehouse. Frameworks are built using enterprise et1 tools. Frameworks reduce coding complexity from users, and tool specific knowledge, and at the same time let applications use enterprise level features such as security, data lineage, high performance, high reliability. Standardized, reusable solution for ingesting and integrating data from various sources at the bank, to central data warehouses. The frameworks are used by Barclays UK, Cards and payments, Corporate, Investment bank, Group risk and technology, legal, Fin-crime, HR, compliance tech, wealth.

• Collaborating with Stakeholders to gather requirements and understand data integration needs.

• Developing data integration using aws services and defining integration patterns.

• Performance monitoring and optimization of data integration workflows.

• Orchestrating the ETL jobs using AWS Step Functions.

• Develop and maintain ETL jobs using AWS Glue for data extraction, transformation, and loading.

• Developing scripts to define data transformation logic.

Working with other roles to establish data partitioning, organization, and backup strategies

Senior Software Engineer (Senior cloud data engineer)

CGI
06.2021 - 01.2023
  • Company Overview: Working for CIGNA in MOS (Monitoring Operating System) team.
  • Developed Spark scripts based on PySpark as per requirements.
  • Implemented Spark by using data frame API's, SparkSql for faster data processing.
  • Involved in extracting data from various data sources into AWS S3.
  • Maintain the day-to-day data in Datawarehouse without lagging from different source systems.
  • Developing the ETL AWS GLUE Jobs to transfer data from source to AWS Athena and then after performing transformations on Athena tables moving data from Athena to Postgres SQL.
  • Load the daily injecting data into ATHENA tables and to create Partitioning to improve the query performances.
  • To Check the Postgres SQL Database for monitoring daily ingestion jobs are successful if not debugging the issue.
  • Orchestrating the ETL jobs using AWS Step Functions.
  • Running Analytics Pipeline for Aggregating and joining the multiple daily ingestion tables to meet the stakeholder's requirements.
  • Scheduled the jobs and transferred final output data to Business team for reporting purpose.
  • Creating the cluster group in Databricks for running Daily ingestion, Analytics Pipeline jobs.

Bigdata Engineer

Persolkelly Services
01.2018 - 07.2021
  • Worked on creating Hive managed and external tables based on the requirement.
  • Implemented Partitioning on Hive tables for better performance.
  • Installing, Upgrading and Managing Spark Cluster in Qubole Data Lake Collaborated with the infrastructure, network, database.
  • Creating the tables in Athena and integrating with looker dashboard. In Looker Stakeholders will create there-own dashboards with processed final output data for business requirements
  • Deploying the oozie workflows by using the automated Jenkins Tool.

Education

B-tech - CSE

jntu -k

Master of Science - CSE

VIT University
Chennai
04.2001 -

Skills

Cloud

Key Skills And Knowledge

AWS, AWS-Databricks, Python, Scala, SQL, EMR, Glue, Athena, Lambda, sagemaker, Airflow, SNS, SQS, Redshift, RDS, DynamoDB, Terraform, Docker, Jenkins, Git, Kubernetes, Windows, UNIX

Projects And Key Roles

  • IBM,Barclays Bank, working as senior data engineer and my role is to create standardized, reusable solution for ingesting and integrating data from various sources at the bank, to central data warehouses. The frameworks are used by Barclays UK, Cards and payments, Corporate, Investment bank, Group risk and technology, legal, Fin-crime, HR, compliance tech, wealth..
  • CGI, Working for CIGNA in MOS (Monitoring Operating System) team.,A Management Operating System (MOS) is the set of tools, meetings, and behaviors used to manage people and processes to deliver results. Developed Spark scripts based on PySpark as per requirements.
  • Persolkelly Services, Autodesk , Was part of Bigdata development team for DB., Project is to collect structured and semi- structured data from different sources and dumping into ADP. After Collecting the raw data from different source systems running the ETL pipelines to cleansing the data and converting into business needs.

Certification

AWS DATA ENGINEER ASSOCIATE

Timeline

Senior Data Engineer

IBM
01.2023 - Current

Senior Software Engineer (Senior cloud data engineer)

CGI
06.2021 - 01.2023

Bigdata Engineer

Persolkelly Services
01.2018 - 07.2021

Master of Science - CSE

VIT University
04.2001 -

B-tech - CSE

jntu -k
Murali KrishnaAWS DATA ENGINEER