Summary
Overview
Work History
Education
Skills
Affiliations
Timeline
Generic

Raghuraman Kaliyaperumal

Lead Data Engineer
Guadalajara,

Summary

Cloud Data Engineer with 8 years of experience in IT having extensive skills in Google Cloud Platform and its components involved in developing the ETL pipelines to ingest and transform the data for decision analytics.

Overview

8
8
years of professional experience
3
3
Languages

Work History

Cloud Data Engineer

Tata Consultancy Services
05.2020 - Current

Client: The Home Depot Inc, USA


  • Responsible for developing, testing and implementing the data streaming pipelines in Jenkins through ETL/ELT method which extracts the data through Sqoop.
  • Expert in analyzing the tables volume, frequency of updates, identifying the keys, handling the large datasets through partitions and writing the efficient joins to increase the query performance and optimize the slot consumption in BigQuery.
  • Consumed streaming JSON data (Receipts, Inbound and Outbound transfers) in Cloud Pubsub topic and added the subscription to write the messages directly into GBQ tables and extracted the required fields alone to load in target and expose it to the views.
  • Retirement of App Engine instances for cost reduction and migrated the code to cloud function.
  • Exporting the batch data to GCS bucket from HDFS to load the data in the landing table and transforming in the stage layer that involves transformations, cleansing etc.. to push the final data to target.
  • Designed the logic for complex validations pertaining to PO and landed cost metrics to validate the data between source and target systems and also developed a python script for SOX compliance validations with Pandas and Google cloud packages.
  • Developed and owned approximately 900 pipelines and also provided the production support to find the root cause by tracing the BQ logs and deployed the fix accordingly.
  • Built the AtScale data models and mapped the dimensions and Metrics with the necessary attributes for cube reporting.
  • Scheduling the jobs through IBM Tivoli (Maestro) that invokes the jenkins pipeline to complete all the ETL stages.

Data Engineer

Tata Consultancy Services
01.2019 - 04.2020

Client: Ascena Retail Group, USA


  • Successfully completed the migration of CRM data from MS-SQL Server to Hive and built a customer profile model by creating a persona for each attributes of the customer and their transactions happened in Ecommerce and Stores.
  • Integration of Hive Queries in Spark using Spark SQL.
  • Driving the daily, weekly scrum calls and coordinated with onshore team for getting the business use case to do the transformations for Lane Bryant, Catherine, Justice promotions and campaign data.
  • Played a major role in Integration for a juncture and implemented CICD which eventually reduced the time and effort for integration, deployment and saved enormous effort.

Product Test Engineer

Vxceed Software Solutions
12.2017 - 01.2019

Client: Unliever (Malaysia, Thailand, Philippines, Indonesia)


  • Point of Contact for Xnapp Mobility Solution for 4 countries.
  • Analysing & Delivering the Functional Requirements as per business demand
  • Takes care of most of the activities that fall under the SDLC Cycle.
  • Writing Test cases for all the CRs before go-live and getting the UAT approval from customer.
  • Integrating the application configuration changes and validating the data in SQL Server tables related to Master data and presales store orders.
  • Performing the sanity testing on every development release.
  • Giving walkthrough to the downstream application teams and stakeholders that involves taking orders, applying different types of promotions for each SKU with calculations and syncing the order for EOD.

Education

Bachelors Degree - Electronics And Communication Engineering

Sri Sairam Engineering College

Skills

Tech Stack: Data Engineering

Languages: SQL, Python, Groovy

Job Scheduling and Orchestration: Jenkins, IBM Tivoli, Airflow

Version Control: Git

Database: SQL Server, IBM DB2, PostgreSQL

IDE: Jupyter Notebook, Spyder, PyCharm

Data Warehouse: BigQuery, Hive, Teradata

BI Tools: AtScale, Excel

GCP Native Components: GCS Bucket, BigQuery, Pubsub, Cloud Function, App Engine, IAM

Affiliations

  • Special Initiative Award (1) - Issued by TCS
  • On-Spot Award (3) - Issued by TCS
  • Star of the Month (1) - Issued by TCS
  • Star Award for Entrepreneurial Spirit (1) - Issued by The Home Depot.

Timeline

Cloud Data Engineer

Tata Consultancy Services
05.2020 - Current

Data Engineer

Tata Consultancy Services
01.2019 - 04.2020

Product Test Engineer

Vxceed Software Solutions
12.2017 - 01.2019

Bachelors Degree - Electronics And Communication Engineering

Sri Sairam Engineering College
Raghuraman KaliyaperumalLead Data Engineer