Seeking a Data Architect role to design and implement cutting-edge data solutions that drive business growth and innovation. Skilled Database Architect with 12+ years of experience in designing, optimizing, and managing large-scale data pipelines for telecom and cybersecurity analytics. Expertise in processing 500 PB of Ethernet packet data for realtime threat detection, network forensics, and behavioral analysis. Strong background in distributed systems, data modeling, and high-performance computing to support cybersecurity products. Experience with planning design Bigdata migration offline and also in cloud. Still capable of coding in python (Flask API), shell, scala (SBT) and core Java (Maven framework). Good knowledge of microservices, CICD, ETL pipelines and Machine learning oops.
Overview
16
16
years of professional experience
1
1
Certification
Work History
Tech Lead
Tech Mahindra
10.2025 - Current
Include managing all data sources (Hadoop, Elasticsearch, and Redis) as well as producing Docker Compose files for cloud architecture
On the other hand, complete the AWS infrastructure, ranging from EC2 to Pipelines
I currently oversee the AWS ETL tools and database support teams
Database Architect
Vehere Technologies PVT LTD
06.2023 - 10.2025
Include managing all data sources (Hadoop, Elasticsearch, and Redis) as well as producing Docker Compose files for cloud architecture
On the other hand, complete the AWS infrastructure, ranging from EC2 to Pipelines
I currently oversee the AWS ETL tools and database support teams
As a Vehere product, we accept responsibility for being included in both the Government Organization and Enterprise editions
One of the most important aspects of my career from beginning to end is client management
Data Engineering Master Course: Spark/Hadoop/Kafka/MongoDB, 2025-03-24
Languages
English
Hindi
Bengali
Current Project
Data Modernization Pipeline, This project aims to migrate an on-premises MySQL database to AWS cloud while implementing a robust CI/CD pipeline for automated deployments and leveraging PySpark for data processing and transformation. The solution ensures scalability, reliability, and cost-efficiency by utilizing AWS services like Amazon RDS/Aurora, AWS DMS, S3, Glue and CodePipeline., Reduced operational overhead with managed AWS services., Automated CI/CD for seamless updates., Scalable PySpark processing for big data workloads., Cost optimization with serverless options (Glue, Lambda).
Previous Projects
Cyber Terrorism Threat, Cyber terrorism refers to the use of digital tools and cyber-attacks to disrupt critical infrastructure, spread fear, or advance political or ideological goals. In cybersecurity, it poses a significant threat as terrorists target government systems, financial networks, or public utilities to cause chaos. These attacks may include hacking, data breaches, ransomware, or Distributed Denial-of-Service (DDoS) attacks aimed at destabilizing nations or organizations. Combating cyber terrorism requires strong cybersecurity measures, international cooperation, and advanced threat detection systems to prevent large-scale disruptions and protect sensitive data., Data Architect along with handling database (Hadoop and ELK) in both on-premises and AWS
Network Security in Enterprise Banking Systems, Banks and financial institutions rely on robust network security measures to protect sensitive data, prevent cyberattacks, and ensure regulatory compliance. Key features include firewalls (nextgen and application-aware) to filter malicious traffic, intrusion detection/prevention systems (IDS/IPS) to monitor and block threats in real time, and end-to-end encryption (TLS, VPNs) to secure transactions. Multi-factor authentication (MFA) and zero-trust architecture restrict unauthorized access, while segmented networks isolate critical systems (e.g., payment processing) from less secure zones. Advanced AI-driven threat detection tools analyze anomalies, and SIEM (Security Information and Event Management) solutions like Splunk provide centralized monitoring. Regular penetration testing and patch management further harden defenses against evolving cyber threats, ensuring trust and operational integrity in banking networks., Data Architect along with handling database (Hadoop and ELK) in both on-premises and AWS
Cigna Data Lake Implementation (Accenture), I was associated with the implementation of the data lake project (Hadoop, Podium with python and spark). Cigna, the healthcare giant, was associated with Accenture, and I was a team member of that implemented project. I am leading that team from Datawarehouse (hive and spark) including Sqoop jobs creation. Our source was oracle, and the data migrated to Hadoop., spark scripts developer for Hadoop and AWS Datawarehouse (redshift and S3)
Looking For
Seeking an AWS Data Architect role to design and implement cutting-edge data solutions that drive business growth and innovation.