Summary

Overview

Work History

Education

Skills

Languages

Social Links - Linkedin

Timeline

Kumar Viplav

Chennai(Remote),Tamil Nadu

Summary

Results-driven data engineering professional with solid foundation in designing and maintaining scalable data systems. Expertise in developing efficient ETL processes and ensuring data accuracy, contributing to impactful business insights. Known for strong collaborative skills and ability to adapt to dynamic project requirements, delivering reliable and timely solutions.

Overview

years of professional experience

Work History

Data Engineer

Tiger Analytics

Chennai(Remote)

04.2023 - Current

Transformed data workflows with PySpark on AWS EMR and Snowflake.
Orchestrated data workflows using Apache Airflow and developed CI/CD pipelines with AWS CodePipeline and CloudFormation
Worked closely with Data Science teams in the insurance domain to build and optimize data pipelines for ML forecasting and predictive models
Developed data preprocessing, feature engineering, and post-processing workflows using AWS SageMaker preprocessing jobs and Jupyter Notebooks for model training in underwriting, fraud detection, and policy pricing
Led AI/ML model deployment, monitoring, and optimization to ensure scalability and performance in production
Designed transformation modules for key business metrics and collaborated with business and product teams for analytics and reporting
Implemented security and compliance measures in AI/ML development, ensuring adherence to regulatory standards such as HIPAA and GDPR
Passionate about improving pipeline efficiency, scalability, and reliability, leveraging AWS SageMaker, PySpark, and Jupyter Notebooks to enhance model training and deployment workflows in the insurance industry
.Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.
Experienced Data Engineer with expertise in migrating and optimizing ETL/ELT pipelines using AWS services and Snowflake
Optimized cloud-based ETL pipelines, reducing data processing time by 70% and lowering compute costs by 50% through efficient resource utilization and query tuning.
Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.

Big Data Engineer

Amdocs

Pune

07.2020 - 03.2023

Converted SQL queries and MapReduce programs into Spark transformations
Built ETL pipelines using Spark-Scala
Scheduled and orchestrated jobs using Oozie
Implemented partitioning buckets in HIVE
Developed and managed a scalable Hadoop cluster environment
Wrote HIVE queries for Business Extracts
Executed high-performance data processing for structured and unstructured data
Used Avro and other data formats to store in HDFS
Built and maintained message configuration and flows and provided issue analysis on Kafka applications
Developed custom visualization tools for better interpretation of complex datasets, aiding in strategic decision making.

Education

B.Tech/B.E. - Computers

SRM Institute of Science And Technology

Chennai

01.2020

Skills

SQL
Python
Spark
Data Warehousing
Hive
Pyspark
Snowflake
Airflow

AWS
Big Data
GIT
ETL development
Agile methodology
Data warehousing
Git version control

Languages

English

Hindi

Social Links - Linkedin

www.linkedin.com/in/krvip0310

Timeline

Data Engineer

Tiger Analytics

04.2023 - Current

Big Data Engineer

Amdocs

07.2020 - 03.2023

B.Tech/B.E. - Computers

SRM Institute of Science And Technology

Kumar Viplav

Summary

Overview

Work History

Data Engineer

Big Data Engineer

Education

B.Tech/B.E. - Computers

Skills

Languages

Social Links - Linkedin

Timeline

Data Engineer

Big Data Engineer

B.Tech/B.E. - Computers

Similar Profiles

Sainath MandadiSainath Mandadi

Raja Bandi - DataEngineerRaja Bandi - DataEngineer

Sakhena Meghana KuthadaSakhena Meghana Kuthada

Vivek Sai Madhav Suri Vivek Sai Madhav Suri null

SATISH KUMAR KANTASATISH KUMAR KANTA