Summary
Overview
Work History
Education
Skills
Languages
Social Links - Linkedin
Timeline
Generic

Kumar Viplav

Chennai(Remote),Tamil Nadu

Summary

Results-driven data engineering professional with solid foundation in designing and maintaining scalable data systems. Expertise in developing efficient ETL processes and ensuring data accuracy, contributing to impactful business insights. Known for strong collaborative skills and ability to adapt to dynamic project requirements, delivering reliable and timely solutions.

Overview

5
5
years of professional experience

Work History

Data Engineer

Tiger Analytics
04.2023 - Current
  • Transformed data workflows with PySpark on AWS EMR and Snowflake.
  • Orchestrated data workflows using Apache Airflow and developed CI/CD pipelines with AWS CodePipeline and CloudFormation
  • Worked closely with Data Science teams in the insurance domain to build and optimize data pipelines for ML forecasting and predictive models
  • Developed data preprocessing, feature engineering, and post-processing workflows using AWS SageMaker preprocessing jobs and Jupyter Notebooks for model training in underwriting, fraud detection, and policy pricing
  • Led AI/ML model deployment, monitoring, and optimization to ensure scalability and performance in production
  • Designed transformation modules for key business metrics and collaborated with business and product teams for analytics and reporting
  • Implemented security and compliance measures in AI/ML development, ensuring adherence to regulatory standards such as HIPAA and GDPR
  • Passionate about improving pipeline efficiency, scalability, and reliability, leveraging AWS SageMaker, PySpark, and Jupyter Notebooks to enhance model training and deployment workflows in the insurance industry
  • .Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.
  • Experienced Data Engineer with expertise in migrating and optimizing ETL/ELT pipelines using AWS services and Snowflake
  • Optimized cloud-based ETL pipelines, reducing data processing time by 70% and lowering compute costs by 50% through efficient resource utilization and query tuning.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.

Big Data Engineer

Amdocs
07.2020 - 03.2023
  • Converted SQL queries and MapReduce programs into Spark transformations
  • Built ETL pipelines using Spark-Scala
  • Scheduled and orchestrated jobs using Oozie
  • Implemented partitioning buckets in HIVE
  • Developed and managed a scalable Hadoop cluster environment
  • Wrote HIVE queries for Business Extracts
  • Executed high-performance data processing for structured and unstructured data
  • Used Avro and other data formats to store in HDFS
  • Built and maintained message configuration and flows and provided issue analysis on Kafka applications
  • Developed custom visualization tools for better interpretation of complex datasets, aiding in strategic decision making.

Education

B.Tech/B.E. - Computers

SRM Institute of Science And Technology
Chennai
01.2020

Skills

  • SQL
  • Python
  • Spark
  • Data Warehousing
  • Hive
  • Pyspark
  • Snowflake
  • Airflow
  • AWS
  • Big Data
  • GIT
  • ETL development
  • Agile methodology
  • Data warehousing
  • Git version control

Languages

English
Hindi

Social Links - Linkedin

www.linkedin.com/in/krvip0310

Timeline

Data Engineer

Tiger Analytics
04.2023 - Current

Big Data Engineer

Amdocs
07.2020 - 03.2023

B.Tech/B.E. - Computers

SRM Institute of Science And Technology
Kumar Viplav