Seasoned Senior Data Engineer with background in 7.5+ years of experience in Big data/Hadoop technology stack, specializing in Apache Spark, PySpark, Scala, Spark Core, Spark SQL, Spark Streaming, Hive, Sqoop, PIG, HBase, Kafka. Proficient in AWS (Glue, S3, Redshift), GCP (Big Query, Data Proc), Python, Talend Data Integration ETL and Talend Data Quality. Skilled in optimizing Spark applications and creating complex ETL mappings using Talend tools.
A proven track record in developing, testing, and maintaining data architectures. Possess strong skills in database management systems, Big Data processing frameworks, data modeling and warehousing. Have successfully led teams in creating innovative data solutions to improve system efficiency and business decision-making processes. Demonstrated impact through enhanced data availability and accuracy in previous roles.