Summary
Overview
Work History
Education
Skills
Certification
Projects
Training
Disclaimer
Languages
Websites
Timeline
Generic

VISHNU KRISHNAN

Kannur

Summary

Passionate Apache Spark Big Data Engineer with 2+ years of experience in designing and implementing large-scale data processing solutions. Seeking to leverage expertise in Spark, Hadoop, and related technologies to contribute effectively to a dynamic team.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

TATA COUNSULTANCY SERVICE PVT LTD
02.2022 - Current
  • Nielsen Computation Television Viewership, Retail and Customer Trends
  • Involved in the whole Lifecycle of the project.
  • Requirement analysis and design
  • Involved Code Deployment and Bug Fixing in Scala & Python using IntelliJ
  • Testing the code and deployed to the production.
  • Worked for developing and optimizing Apache Spark jobs to process terabytes of data daily.
  • Designed and implemented data pipelines using Spark RDDs, DataFrames, and SparkSQL for efficient data processing.
  • Monitored job progress through command line interface tools like yarn, spark-submit, spark-shell .
  • Designed and developed data pipelines to ingest structured and unstructured data into HDFS using Apache Spark.

Intern

BrainScript Technologies Pvt. Ltd
04.2021 - 12.2021
  • The project aims to find the propensity of buying the product from the Non-customers and find the hidden affluent Customers
  • Data Science: Used the model created from the Customers data of the car dealer
  • Customer segmentation of Car Dealer using K-Means clustering
  • Data Engineering: Transferred the Car Dealer data from SQL Server to ADLS G2 container for cleansing and EDA, transformed and loaded to the secondary container using ADF V2, pushed data to the Azure SQL DW from that container using ADF V2 and slice and dice using Power BI.

Education

BSc Computer Science
01-2020

Computer Science

Plus Two
01-2017

SSLC
01-2015

Skills

  • Data Engineering : Apache Spark, Hadoop, Kafka, HBase, Power BI
  • DevOps : Git Version Control
  • Programming Languages : Scala, Python, Java
  • Databases : MySQL, MSSQL
  • Cloud Platforms : AWS, Azure
  • Tools & Frameworks : Spark RDDs, DataFrames, SparkSQL, Spark Streaming, HDFS, Hive

Certification

DP-900 Azure Data Fundamentals

Projects

Understanding business requirement from business analyst, EDA using Power BI and Jupiter Notebook, Migration from SSAS Data mining to Azure Machine Learning Parallel Data Injection to SQL pool from Azure Data Lake Storage GEN 2 (Polybase), Participated in DAR for selecting ETL tool, Created Data Pipeline in Azure Data Factory Version 2 (ADF V2) from Azure Blob Storage to Azure SQL Server Participated in data value assessment and data dictionary creation, Helped in the creation of Work Breakdown Structure, SQL Server Machine Learning Dimensional Model Design, Participated in creation of priority metrics, SCRUM team member CRISP-DM framework based data analysis

Training

  • Data Engineering & Machine Learning
  • BSc Computer Science - 2020
  • Plus Two, Computer Science - 2017
  • SSLC - 2015
  • Informatica data integration and administration (2 days)
  • Understanding Deep Learning using Neural Network (3 days)
  • Snowflake dataware house understanding (5 days)
  • Altrex understanding (4)
  • XML understanding (3)
  • Rapid Miner (5)

Disclaimer

This resume template showcases relevant experience, skills, and achievements specific to Apache Spark Big Data engineering roles. Tailor it further based on your specific experiences and achievements to highlight what makes you a standout candidate.

Languages

Malayalam
First Language

Timeline

Data Engineer

TATA COUNSULTANCY SERVICE PVT LTD
02.2022 - Current

Intern

BrainScript Technologies Pvt. Ltd
04.2021 - 12.2021

BSc Computer Science

Computer Science

Plus Two

SSLC
VISHNU KRISHNAN