Experienced Engineer with 4 years of expertise in big data processing, ETL development, and workflow automation. Proficient in Apache Spark, Hadoop, and SQL, with a focus on building scalable data pipelines and optimizing performance. Demonstrated proficiency in data transformation, migration, and distributed computing for seamless data integration. Successful in troubleshooting complex workflows, improving processing efficiency, and providing valuable business insights. Committed to leveraging data-driven decision-making to enhance enterprise analytics platforms.
Overview
4
4
years of professional experience
Work History
PySpark Engineer
TATA Consultancy Services
12.2020 - Current
Designed and optimized data pipelines using Apache Spark and Hadoop, improving data processing efficiency by 30%
Migrated ETL processes from Data Stage to Talend for APAC and EMEA regions, streamlining data integration workflows and reducing processing time by 40%
Developed complex SQL queries for data transformation, analysis, and reporting, ensuring high data accuracy
Automated workflows with Shell Scripting, reducing manual intervention by 50% and improving system efficiency
Implemented data quality checks to enhance accuracy and reliability for business analytics
Provided training and support to teams for Talend-based ETL workflows, facilitating seamless adoption
Optimized batch processes to meet stringent SLAs, reducing data processing time by 35%
Strong problem solving skills with focus on performance training and optimization
Expertise in building and maintaining robust data workflows using Apache Spark and Hadoop
Expertise in talend workflow design for performance and scalability
Proficient in Crafting Complex SQL Queries for Data transformation and analysis and reporting
Prioritizing Strong optimization and quality assurance with documentation and training
Committed to innovation through staying updated with latest trends
Education
B.Tech ( ECE ) -
SOA University
Bhubaneswar
01-2020
Skills
Apache Spark
Hadoop
HDFS
Talend
Data Stage
MySQL
Snowflake
SQL
Shell Scripting
Documentation & Training
Data Analytics
Performance Optimization
System troubleshooting
Projects Initiatives
Led a data migration initiative that successfully transitioned legacy ETL processes to a modern Talend-based architecture.
Developed custom data processing workflows that enhanced the efficiency of data pipelines.
Collaborated with cross-functional teams to ensure seamless data integration and analytics.