Cloud Engineer with over four years of experience in cloud engineering and seven years in IT. Expertise in cloud engineering, data engineering, and full-stack development. Skilled in data analysis, report generation, batch job scripting, and client interaction. Proficient in Django, Apache Hive, ETL processes, and data import/export. Strong foundation in building efficient and scalable solutions. Passionate about solving technical challenges, mentoring teams, and taking on lead responsibilities.
Company Portal
StubHub is an American ticket exchange and resale company , it provides services for buyers and sellers of tickets for sports,concerts,theater and other live entertainment events.
• Analyzing the Data for Email marketing.
• Data Modeling for new requirements.
• Generating the reports using Spark-SQL.
• Scripting and scheduling batch jobs.
• Impact analysis , trouble shooting data issues (on hive and Netezza).
• Direct Interaction with the clients, Gathering and presenting the analyzed data.
Vsoft works on the cheque processing system for different major Indian banks, Generating the Reports on daily basis and automating in providing it to banks.
• Apache Hive (HQL) mostly to perform the ETL (Extract Transform Load) using both Hive CLI and Beeline interfaces involving creating tables, partitioning and bucketing of a table .
• Monitoring Production jobs and provides technical support
• Importing and exporting data from/to different databases like Mysql, Netezza into HDFS using Apache Sqoop.
• Impact analysis and troubleshooting data issues from Hive and Netezza
• Ingesting the files into HIVE tables.
• Maintaing Production database with 7days
• Data Preparation and validation
• Data cleaning(Pyspark)
• Data ingestion to HDFS
• Data Modeling
• RDBMS [SQL Data types,aggregations and advanced functions]
• Data Transferring – Sqoop
• Extract Transform Load - Hive QL
• Scripting for daily batch jobs – python and shell
• Manage and support for large scale data warehousing
• Knowledge on Aws services ( EMR)
• Sqoop connecting (Netezza,Db2)
• Loading the data From HDFS to Netezza
Become a Data Engineer: Mastering the concepts From Linkedin 07/2020
Big Data Analytics with Hadoop and Apache Spark From Linkedin 05/2020
Data Science Foundations: Data Engineering From Linkedin 05/2020