Experienced Data Engineer with 4+ years of outstanding performance record in designing and implementing efficient data pipelines. Proficient in Python, SQL, PySpark and Cloud-based data platforms. Master in collaborating with cross-functional teams to deliver high-quality solutions.
Programming Languages: Python, SQL, Scala
Big Data Technologies: Spark/Pyspark, Hadoop, Hive, Kafka, Structured Streaming
Cloud based Tools: Microsoft Azure: Data Bricks, Data Factory, ADLS Gen 2
AWS (Basic): S3, Redshift, EMR, Athena, Glue
Databases: MySQL, HBase
Others: Git, Docker, Kubernetes, Jenkins, Linux, Agile Methodology