Summary
Overview
Work History
Education
Skills
Interests
Accomplishments
Timeline
Generic
Vishal Domale

Vishal Domale

Big Data Engineer
Pune

Summary

Experienced and performance-driven Big Data Engineer with over 12 years of proven expertise in designing, building, and optimizing data pipelines and real-time processing systems using technologies such as Apache Spark, Kafka, Scala, Spring Boot, and AWS. Skilled in transforming complex business requirements into scalable, efficient, and production-ready data solutions across diverse domains including Cybersecurity,Housing,Search Engine, Healthcare and Real Estate.

  • Big Data Architecture & Engineering:
    Designed and implemented real-time and batch data processing frameworks using Apache Spark, Kafka Streams, and AWS cloud-native services. Successfully handled large volumes of structured and unstructured data in production environments.
  • Domain Expertise Across Verticals:
    Delivered end-to-end data engineering solutions for mission-critical applications in Security Information and Event Management (SIEM), Electronic Health Records (EHR), and Ad Platforms.
  • Microservices & Backend Development:
    Strong experience building scalable Microservices using Spring Boot and integrating them with data platforms, REST APIs, and messaging systems.
  • Team Leadership & Collaboration:
    Led cross-functional engineering teams, mentored junior developers, and collaborated with product owners and business analysts to streamline development workflows and ensure timely delivery.
  • DevOps & Cloud Native Solutions:
    Proficient in using AWS (EC2, S3, Lambda), CI/CD pipelines, Git, and Jenkins for automating deployment and monitoring of data platforms.
  • Agile & Product Ownership Engagement:
    Worked closely with agile teams to gather requirements, prioritize features, and reduce development cycles by identifying gaps early in the SDLC.

Overview

12
12
years of professional experience
4
4
years of post-secondary education
3
3
Languages

Work History

Team Lead

globant
Pune
04.2023 - Current

Hyde housing Data platform integration

  • Leading real-time big data pipeline projects using Spark Streaming, Kafka, and AWS services.
  • Developed and maintained scalable Microservices in Spring Boot for real estate and security analytics platforms.
  • Spearheaded canonical mapping design, data ingestion, and cost optimization strategies.
  • Ingestion API development in Spring boot
  • Led cross-functional teams to deliver high-quality software solutions, ensuring alignment with client requirements and project timelines.
  • Developed and implemented agile methodologies, improving team efficiency and collaboration in project execution.
  • Facilitated daily stand-ups and sprint planning sessions, promoting transparency and accountability within the team.

Principle Software Engineer

Securonix
Pune
05.2022 - 04.2023

cybersecurity platform

  • Worked on a large-scale SIEM (Security Information and Event Management) platform in the cybersecurity domain.
  • Onboarded client data from various sources including databases, application logs, and server logs.
  • Developed and maintained Spark Streaming jobs to ingest and process real-time data.
  • Applied detection policies on streaming data to identify potential threats and generate alerts.
  • Collaborated with security analysts and DevOps to ensure timely resolution of production issues.
  • Enabled real-time threat detection and alerting, improving response time and reducing false positives
  • Designed and implemented scalable software solutions using Java and Python, enhancing data processing efficiency for security analytics.
  • Developed robust, scalable, modular and API-centric infrastructures.
  • Implemented version control systems to streamline development processes and facilitate easier code integration and collaboration.

Team Lead

GLobant
Pune
02.2017 - 05.2022
  • ZapLabs Real Estate Data Platform
    Developed APIs and Spark jobs to ingest and normalize property data from multiple MLS providers.
    Built microservices using Spring Boot and Dropwizard.
    Integrated with Elasticsearch for advanced metadata recommendations.
    Implemented custom scheduler using Quartz.
  • JWT Authorization Platform
    Built SSO with OpenID and Spring Security using RSA256 JWT tokens.
    Integrated authentication with AWS IAM and Vault DB.

Technologies Used: Scala, Spark, Kafka, AWS (EC2, S3), Elasticsearch, Spring Boot, Quartz Scheduler, JWT, Dropwizard

  • Led cross-functional teams to deliver high-quality software solutions, ensuring alignment with client requirements and project timelines.

Software Engginer

EZDI
Pune
09.2015 - 02.2017

Mediscribe Platform

  • Worked on the Mediscribe Platform, a multi-module cloud-based software for managing medical transcription workflows.
  • Contributed to the development of ezCAC, a medical coding application that applied NLP to suggest medical codes from processed charts.
  • Developed RESTful web services in Java and Spring Boot.
  • Installed and managed Solr search clusters on Linux, and indexed healthcare documents.
  • Managed Solr index shards and optimized search schema.
  • Installed and maintained Cassandra clusters, including schema design for healthcare data.
  • Provided continuous support for Solr-Cassandra integration and real-time indexing.- Developed microservices in Spring Boot and managed Solr and Cassandra clusters.
  • Created data ingestion pipelines and designed database schemas for healthcare data.
  • Assisted with day-to-day operations, working efficiently and productively with all team members.

Software Engineer

Persistent Systems Limited
Pune
02.2013 - 08.2015

Cisco Search Relevance Platform

  • Involved in various phases of analysis of the existing system.
  • Prepared the analysis documents which will help in identifying the components required for customization.
  • Developed multiple Relevance Models. Wrote a custom Comparison Tool for compare two or more search engine search results.Worked on Cisco Search Relevance Platform, helping improve the quality and ranking of search results for sales and product documents.
  • Analyzed indexed documents and clickstream logs using Attivio Search Engine.
  • Developed and tuned multiple search relevance models to enhance accuracy.
  • Built custom comparison tools to evaluate search result sets.

NIC Government of India Search Engine

  • Worked on the NIC Government of India Search Engine project under the Department of Information Technology.
  • Developed MapReduce jobs in Java for crawling and indexing large-scale government websites using Nutch and Hadoop.
  • Installed, configured, and administered Hadoop clusters on RedHat Linux.
  • Designed search infrastructure using Solr for optimized relevancy and performance.
  • Loaded and processed terabytes of crawl data from UNIX file systems to HDFS for indexing and analysis.
  • Contributed to the improvement of public data access by fine-tuning the relevancy of search results.

Education

Bachelor of Science - Information Technology

Pune University
Pune
07.2007 - 06.2011

Skills

Apache Spark

undefined

Interests

Swimming
Reading books

Accomplishments

    Achievements

  • “You Made Difference” for improving the relevancy up to 79%
  • Attivio Solution Designer Certified.

Timeline

Team Lead

globant
04.2023 - Current

Principle Software Engineer

Securonix
05.2022 - 04.2023

Team Lead

GLobant
02.2017 - 05.2022

Software Engginer

EZDI
09.2015 - 02.2017

Software Engineer

Persistent Systems Limited
02.2013 - 08.2015

Bachelor of Science - Information Technology

Pune University
07.2007 - 06.2011
Vishal DomaleBig Data Engineer