Accomplished Big Data Engineer with over 9.5 years of experience in managing complex data systems and utilizing Hadoop tools including HDFS, Hive, Sqoop, and Spark. Specializes in developing and optimizing Spark RDD workflows with Scala and Python to improve query performance and resource efficiency. Skilled in processing large structured and semi-structured datasets through Spark DataFrame APIs, ensuring data quality and integrity.
