IT professional with 5+ years of experience in the Spark and big data ecosystem, including Hive, Sqoop, Hadoop, Impala, and Python PySpark. Skilled in Scala, Java, MSSQL Server, and AWS, as well as Azure cloud. Seeking to broaden horizons in the field of Big Data and apply strong interpersonal and technical skills in a collaborative team environment. Committed to contributing to organizational growth and achieving job satisfaction.
SQL
Python
Pyspark
Scala
core Java
Linux bash Scripting
Libraries: numpy,pandas,Boto3,FastParquet
Oracle
MySQL
MSSQL Server
MongoDb
Hadoop
Hive
Sqoop
Impala
Spark
ETL development
Data pipeline design
Data modeling
Data warehousing
Big data processing
Performance tuning
Spark framework
Data governance