Having total 6 years of experience in Data Engineering and 10 years of overall experience in various domains including Retail, Telecom, Health Care and Finance. Design, develop and maintenance of bigdata and distributed data pipelines in a highly available configuration. Having expertise on BIG DATA Technology like Databricks, Pyspark, Hadoop, Hive, Spark SqL, Sqoop etc. Experienced in implementing ETL jobs using Sqoop from RDBMS to HDFS and vice versa. Experience in analyzing data using HiveQL. Good knowledge at using Spark APIs to cleanse, explore, aggregate, transform, and store data. Experience on Hadoop clusters using major Hadoop distributions like Cloudera 5.14. An effective team player with good interpersonal, analytical and client serving abilities.
Databricks
PySpark
Unity Catalog
Spark SQL
Hive
Sqoop
Airflow
Oozie
Oracle (PL/SQL)
MySQL
Windows
Linux
JIRA
Subversion (SVN)
Github
Bitbucket