Summary
Overview
Work History
Education
Skills
Certification
Timeline
Projects
Publications
Generic

Venkata Yaswanth Karri

Hyderabad

Summary

Results-driven Data Engineer with 2+ years of experience designing and maintaining scalable ETL pipelines, integrating HRIS and payroll systems, and optimizing cloud-based data workflows. Skilled in SQL, Python, and Microsoft Azure for building and monitoring reliable data solutions. Strong track record of troubleshooting production issues and improving performance across distributed systems. Collaborative team player passionate about developing efficient data infrastructure to power analytics, reporting, and business intelligence.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Associate

Strada Global
07.2023 - Current
  • Developed and deployed scalable ETL pipelines integrating HRIS platforms (Workday, SuccessFactors) with payroll systems (SAP, RHPro) across 10+ countries.
  • Supported production workflows for 2,000+ users, resolving data pipeline issues within SLA timelines using Azure Logs and Kubernetes.
  • Monitored pipeline performance using Azure Container Logs, App Insights, and WebMethods, improving system reliability and throughput.
  • Collaborated with cross-functional teams and mentored new hires, enhancing onboarding and team productivity.
  • Built and maintained custom data mappings, resolving non-standard integration issues to ensure accurate payroll data flow for 60,000+ employees.
  • Diagnosed and resolved file transfer and data integrity issues using Azure Logs, Blob Storage, and Software AG Designer.
  • Released queue locks and resolved Infotype errors in SAP euHReka, maintaining uninterrupted payroll operations.
  • Participated in UAT, SIT, and regression testing in QA environments to validate data migration and prevent production delays.
  • Acted as primary escalation point for technical issues, ensuring timely resolution and stakeholder satisfaction.


Project: Strada Exchange Implementation – ExxonMobil (US/CA)

  • Led technical implementation of data migration integrating SuccessFactors with SAP euHReka via proprietary ETL tool Strada Exchange.
  • Built and maintained custom data mappings, resolving non-standard integration issues to ensure accurate payroll data flow for 60,000+ employees.
  • Diagnosed and resolved file transfer and data integrity issues using Azure Logs, Blob Storage, and Software AG Designer.
  • Released queue locks and resolved Infotype errors in SAP euHReka, maintaining uninterrupted payroll operations.
  • Participated in UAT, SIT, and regression testing in QA environments to validate data migration and prevent production delays. • Acted as primary escalation point for technical issues, ensuring timely resolution and stakeholder satisfaction.

Data Science Intern

DRDO
06.2022 - 07.2022

Project: Log Data Analysis using Elastic Stack

  • Deployed the Elastic Stack (Elasticsearch, Logstash, Kibana) to build a centralized log monitoring system for Apache server logs.
  • Developed data ingestion pipelines using Filebeat and Logstash with regex-based parsing to extract structured data from raw logs.
  • Designed Kibana dashboards to track request volume, error trends, and response times for real-time insights.
  • Optimized data flow and indexing strategy, improving log query performance by around 25%.
  • Documented pipeline architecture and setup steps to streamline maintenance and team onboarding.

Education

Bachelor of Technology - Computer Science

GITAM University
Visakhapatnam
06-2023

Skills

  • SQL, Python, R
  • Microsoft Azure (Blob Storage, App Insights, Databricks, Kubernetes Logs, Data Studio)
  • ETL & Data Integration (Strada Exchange, Custom ETL Mappings, Data Mapping/Validation, Error Resolution)
  • Elastic Stack (Elasticsearch, Logstash, Kibana, Filebeat)
  • Software AG Command Central & Designer
  • UAT/SIT Testing

Certification

  • Introduction to Big Data with Spark and Hadoop, IBM
  • Database Management Essentials, University of Colorado System
  • Cloud Computing Foundations, Duke University

Timeline

Associate

Strada Global
07.2023 - Current

Data Science Intern

DRDO
06.2022 - 07.2022

Bachelor of Technology - Computer Science

GITAM University

Projects

Project: Custom RAG Chatbot for Document Search and Summarization

  • Built a chatbot using the Retrieval-Augmented Generation (RAG) approach to search and summarize large PDF documents (600+ pages) with GPT-based answers.
  • Created a pipeline for reading PDFs, splitting text, generating embeddings (FAISS/Chroma), and retrieving relevant content.
  • Added chat history and memory so the bot could handle multi-turn, context-aware conversations.
  • Used LangChain, LangGraph, and Streamlit to build an interactive and scalable web app.
  • Improved response speed by ~40% with caching and better memory management.

Publications

  • CARDIOCARE – A Cardiovascular Disease Prediction System Based on KNN and Random Forest ML Algorithms, JETIR, ISSN: 2349-5162, Vol. 10, Issue 3, pp. e405–e410, March 2023.
Venkata Yaswanth Karri