Having total 6 years of experience in Data Engineering and 10 years of overall experience in various domains including Retail, Telecom, Health Care and Finance. Design, develop and maintenance of bigdata and distributed data pipelines in a highly available configuration. Having expertise on BIG DATA Technology like Databricks, Pyspark, Hadoop, Hive, Spark SqL, Sqoop etc. Experienced in implementing ETL jobs using Sqoop from RDBMS to HDFS and vice versa. Experience in analyzing data using HiveQL. Good knowledge at using Spark APIs to cleanse, explore, aggregate, transform, and store data. Experience on Hadoop clusters using major Hadoop distributions like Cloudera 5.14. An effective team player with good interpersonal, analytical and client serving abilities.
Databricks