Resident Solution Architect with over 12 years of experience in Big Data Analytics, ETL, and Cloud Data Engineering. Proven track record in software development, enterprise application design, and implementation, with a strong emphasis on building scalable Big Data pipelines and Data Lake architectures.
Demonstrated expertise in Cloud Migration, Data Quality, and Data Management, with deep technical proficiency in technologies such as Scala, Java, Apache Spark, AWS, Azure, GCP, and Databricks.
Key contributor to Overwatch, a Databricks observability solution that generated $2.2M in annual recurring revenue (ARR), while driving strategic partnerships to enhance service offerings and accelerate growth.
Skilled in developing Agentic Applications using leading Agentic AI frameworks to build intelligent, autonomous solutions.
· Contributing in requirement analysis and solution discussion
· Analyzed log data from different sockets & files and provided insights about the risk associated with the user
· Developed Hive scripts for end user / analyst requirements to perform ad hoc analysis
· Solved performance issues in Hive with understanding of Joins, Group and aggregation and how does it translate to Map Reduce jobs.
· Developed simple to complex Spark jobs using Scala; contributed in the requirement and analysis phase
· Managed the importing of data from various data sources; performed transformations using Spark, Hive & Map Reduce
· Engaged in collecting the data from different data sources using SQOOP
· Migrated Map Reduce jobs to Spark
· Wrote Hive queries to analyses data in Hive Warehouse using Hive Query Language (HQL)
· Developed Hive and Spark SQL for the business logic
· Transformed structured data using Dataframe and HiveQL