Innovative and results-driven Big Data Engineer with a proven track record of high productivity and efficient task execution. Skilled in the Hadoop ecosystem, Spark programming, and advanced data modeling, with a passion for solving complex data challenges. Known for strong analytical thinking, problem-solving, and communication skills, enabling effective collaboration with cross-functional teams to deliver scalable and insightful data-driven solutions
Big Data Ecosystem:
Apache Spark (PySpark, Spark SQL), Hadoop (HDFS, MapReduce), Apache Hive, Apache Kafka, Pig
ETL & Data Warehousing:
Talend,Data Modeling, Snowflake
Programming Languages:
Python, SQL
Data Storage:
HDFS
Data & System Skills:
Data Quality & Validation, Performance Tuning (Spark & SQL), Distributed Computing, Shell Scripting, Linux/Unix
Soft Skills:
Cross-functional Collaboration, Problem Solving, Agile Methodology, Effective Communication