Summary
Overview
Work History
Education
Skills
Timeline
Generic

Rudra Prasad Barik

Data Architect
BANGALORE

Summary

  • Overall 12+ years of IT experience which includes 4+ Years of experience as data architect, performing Designing solutions on AWS Cloud platform using Informatica PC/IICS, Python, spark , Qlik Replication, Elastic search and Snowflake etc
  • Worked on AWS environments such as Amazon EMR, S3, EC2, AWS Data Pipeline, Athena, Redshift, Glue and Crawler etc.
  • Certified as Vantage Certified Associate issued by TERADATA. In depth knowledge in MPP Database architecture and having expertise in industry standard best practices and performance tuning approaches towards building DWH solutions in both Teradata and Redshift managed services.
  • Experienced in ETL processing of large datasets of different forms including structured, semi- structured and unstructured data. Extensive ETL experience in IICS/Informatica Power Center, Oracle, Teradata, Shell/Python Scripting, Big data, Elastic Search and Clod computing platforms.
  • As an ETL Developer built Mappings, maplets, Sessions, workflows for extracting data from various source systems and loading data into staging/target layer, using Informatica. Emphasized on performance Optimizations in ETL applications at various levels.
  • Experience in various Python libraries such as NumPy, Pandas, SciPy and Scikit-learn and visualization using Matplotlib, Exploratory Data Analysis (EDA) techniques, Feature Engineering in Machine learning.
  • Good understanding of machine learning algorithms like Linear Regression, Logistic Regression, Decision Trees etc.

Overview

10
10
years of professional experience
4
4
years of post-secondary education

Work History

Data Architect

Brillio Technologies
Bangalore
12.2021 - Current

Currently deployed in the role of a resident offshore Data Architect for the client called UMG(Universal music group). The project is all about building a royalty portal which is expected to be accessed by various singers across the globe for Artist Royalty, Copyright related statements etc. The underlying technology stack is primarily built on top of AWS cloud services.

Manager- Projects

Cognizant Technology Solutions
Bangalore
06.2019 - 12.2021

Medtronic is one of the largest health care equipment providers on the Globe. The goal of this project is to perform integration on CSOD data ,which serves as a platform where the intended users could leverage that data and draw insight from it.

  • Created data pipeline in AWS Glue using Pyspark and Python.
  • Prepared Pyspark/python scripts and performed unit testing both locally and using AWS GLUE provided dev end point , before making a script ready as a whole for glue jobs.
  • Participated in POC on IICS- Snowflake Integration in the early phase of the project.
  • Contributed on building Synchronization task ,Mapping task etc in IICS in order to build a robust data pipeline which would populated 200 gb+ data daily in the target system.
  • Designed and created tables in Snowflakes and used snowpipe for ingesting data in near realtime.
  • CICD approach was followed with the help of Jenkins and GIT, so contributed to code deployment across different environments.

Technical Lead

Teradata India ltd
Mumbai
12.2017 - 05.2019

Project Entail: The goal of this project is to build a Green filed DWH from scratch that would accommodate a range of technologies such as Big Data (AWS EMR), Advanced Analytics (R, SAS for statistical analysis), AWS Redshift, Web services, Message Queue (Amazon SQS), Teradata & AWS Glue (ETL)

  • Engaged with other teams and projects to enable best architecture practice for the implementation of complex system landscape comprising of Data Lake Warehouse to align with enterprise business policies, principles and standards.
  • Involved in loading data from the different Data sources like (Teradata and Oracle) into S3 and Redshift using Glue.
  • Design and Develop ETL Processes in AWS Glue to migrate data from external sources like S3, ORC/Parquet/Text Files into AWS Redshift.
  • Create external tables with partitions using Hive, AWS Athena and Redshift.

Senior Associate- Projects

Cognizant technology solutions
Mumbai
05.2016 - 12.2017

Jio Analytics program is an initiative to give an edge to business by providing the right information to the right people at the right time, enabling them to take the right decisions.

  • Routine health check of all applications, monitoring the informatica/ Oozie workflows and sharing the job status of scheduled batches report to the respective stakeholders.
  • Migrated Informatica objects from Dev to Prod environment by using deployment group.
  • Involved in leveraging Hadoop ecosystem components including Pig and Hive for data analysis, Sqoop for data migration.
  • Worked with different data sources like csv files, JSON files, SQL server and Oracle to load data into Hive/relational tables.

Senior Software Engineer

IGATE Global solutions
Mumbai
05.2014 - 05.2016

Genworth’s Adminserver takes care of various annuities related reports say fixed annuities, variable annuities etc. Adminserver Screen front-end application accesses the underline data marts in order to address various business queries.

  • Used Informatica client tools - Source Analyzer, Target designer, Mapping Designer, Mapplet
  • Designer, and Transformation Developer for defining Source & Target definitions and coded the process of data flow from source system to a staging area.

Software Engineer

Biz Technologies
MUMBAI
09.2012 - 04.2014

Singapore Telecommunications Limited, commonly abbreviated as SingTel, is a telecommunication company based in Singapore is one of the largest mobile operator in the world .SingTel has a well- established and extensive communications network and infrastructure in Singapore and Australia.This Project is to Design and Construct EDW for SingTel Mobile Singapore Inc.

  • Involved in analyzing and importing source tables from the respective databases and flat files
  • Used various transformations as part of mappings like source Qualifier, Filter, Aggregator, Expression, and lookup, Sequence Generator
  • Developed standard and re-usable mappings and mapplets using various transformations
  • Created Mappings, Sessions, workflows for extracting the data from various source systems, flat files and load data in to the staging layer using informatica objects.

Education

Bachelor of Technology - IT

Biju Patnaik University of Technology
07.2004 - 08.2008

Skills

AWS Services: Redshift, Glue, DataPipeline, S3, lambda etc

ETL Tools :Informatica PC/IICS, Qlik Replication

Big Data Ecosystems: HDFS, Hive, Sqoop, DataBricks, PySpark

Python, Shell scripting, Elastic search, SOAP UI,Gitlab, GitHub, Jira Confluence

Timeline

Data Architect

Brillio Technologies
12.2021 - Current

Manager- Projects

Cognizant Technology Solutions
06.2019 - 12.2021

Technical Lead

Teradata India ltd
12.2017 - 05.2019

Senior Associate- Projects

Cognizant technology solutions
05.2016 - 12.2017

Senior Software Engineer

IGATE Global solutions
05.2014 - 05.2016

Software Engineer

Biz Technologies
09.2012 - 04.2014

Bachelor of Technology - IT

Biju Patnaik University of Technology
07.2004 - 08.2008
Rudra Prasad BarikData Architect