Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Sujit Parmar

Sujit Parmar

Bigdata Consultant
Pune

Summary

Accomplished Big Data Platform Architect with expertise in analytics solutions and a proven track record of optimizing cloud architectures and enhancing system performance. Adept at collaborating with cross-functional teams and mentoring talent to drive success. Skilled in designing and implementing robust data integration solutions, leveraging cutting-edge cloud technologies to deliver measurable business impact.

Overview

13
13
years of professional experience
1
1
Certification
3
3
Languages

Work History

Bigdata Platform Architect

EXL
Pune
05.2023 - Current

Cloud Architecture & Data Platform Expertise Migration & Data Integration Databricks Benchmarking & Administration Automation & DevOps Collaboration Technical Support & System Monitoring Training & Documentation Mentoring & Team Collaboration

  • SME for cloud-based data platforms, specializing in big data, data lakes, data warehouses, and machine learning solutions.
  • Expertise in AWS analytics stack: EMR, Redshift, Glue, Glue Catalogs, and S3.
  • Skilled in data platforms: EMR, Cloudera, Hive, Presto, Redshift, Snowflake, and Databricks.
  • Led migration from EMR to Databricks, ensuring optimized performance and security compliance.
  • Conducted performance benchmarking of Databricks clusters for workload optimization.
  • Managed cluster administration, security policies, and cost optimizations for Databricks.
  • Implemented monitoring, alerting, audits, and security best practices for Databricks clusters.
  • Strong understanding of automation processes and best practices for cloud infrastructure.
  • Worked closely with the DevOps team to implement CI/CD pipelines and automation workflows.
  • Ensured security controls, compliance, and governance were integrated into DevOps procedures.
  • Provided technical support and troubleshooting for infrastructure-related issues.
  • Monitored and tested application performance, identifying bottlenecks and implementing solutions.
  • Managed and optimized installed cloud infrastructure for high availability and efficiency.
  • Conducted end-user and developer training sessions, created Confluence documentation, and provided live demos.
  • Created visual architecture designs for cloud migration and integration strategies.
  • Mentored teams on cloud architecture best practices and encouraged continuous improvements.

Bigdata/AWS Cloud Administrator

Guardian Life Insurance
Gurgaon
06.2021 - 05.2023
  • EMR Administration & Maintenance:
    Administer and optimize EMR clusters, ensuring efficient resource utilization.
    Manage data processing frameworks like Hive, Presto, and Cloudera for seamless operations.
    Perform EMR 6 upgrades, ensuring compatibility and minimal downtime.
    Implement platform monitoring, alerting, audits, and security compliance.
    Maintain data governance, ensuring platform security and policy adherence.
  • Migration & Performance Optimization:
    Support and execute the migration of Hive workloads to Redshift, optimizing performance.
    Assist in the evaluation and implementation of Snowflake PoC for data integration.
    Monitor and test system performance, identify bottlenecks, and implement optimizations.
  • Automation & DevOps Collaboration:
    Work with DevOps teams to implement platform automation using Jenkins CI/CD, Puppet, and scripting.
    Write and maintain custom scripts to improve system efficiency and automation.
  • Technical Support & System Monitoring:
    Provide critical troubleshooting and technical support for EMR-related infrastructure issues.
    Ensure system uptime and availability, proactively addressing platform concerns.
    Participate in end-user and developer training sessions, maintaining Confluence documentation and delivering live demos.

.

Bigdata Cloud Administrator

Santarich Infotech LLP,
Pune
03.2016 - 07.2021
  • Hadoop Ecosystem Management & Troubleshooting:
    Recovering and troubleshooting issues across the Hadoop ecosystem, including Hadoop itself, HDFS, and related components like YARN and Hive.
    Root cause analysis of system failures, network issues, and performance bottlenecks in both Hadoop and Linux environments.
    Managing cluster health and ensuring uptime by implementing proactive monitoring, alerting, and effective remediation steps.
  • Data Management & Migration:
    Importing and exporting data from various databases (e.g., MySQL, Oracle, RDBMS) into HDFS using Sqoop for seamless integration.
    Facilitating data migrations across different clusters, including utilizing S3 for secure and efficient data transfer.
  • Resource Management & Scheduling:
    Efficiently managing and allocating resources through YARN schedulers to optimize performance and resource utilization within Hadoop clusters.
    Ensuring high availability and efficient resource distribution, especially in multi-tenant environments.
  • Storage Management & Backup:
    Analyzing and optimizing storage utilization and implementing strategies for backup and disaster recovery across clusters.
    Taking regular snapshots of data and performing comm-decommissioning of nodes as part of cluster maintenance.
  • Incident Management & Support:
    Handling high-priority tickets and resolving live system issues in real-time while coordinating with application teams for system updates and patches.
    Conducting thorough troubleshooting of complex production-level issues, ensuring minimal downtime and disruption.
  • Disaster Recovery & High Availability:
    Ensuring robust disaster recovery mechanisms, ensuring both data consistency and system availability during infrastructure failures.
    Implementing solutions for high availability and fault tolerance within the Hadoop environment.
  • Collaboration & Timely Resolution:
    Collaborating with cross-functional teams to ensure timely and efficient resolution of issues, particularly in business-critical applications.
    Maintaining a flexible schedule and responding to after-hours emergencies, ensuring continuous system operations.
  • Developed custom scripts for automating routine tasks, reducing manual intervention requirements.
  • Established proactive monitoring alerts to detect potential issues before they escalate into critical incidents.

System Administrator

Santarich Infotech LLP
01.2012 - 03.2015
  • Efficient Network and System Management: Proven expertise in managing, monitoring, and maintaining secure and reliable IT infrastructures, including servers, networks, and applications, ensuring optimal performance and minimal downtime.
  • Proactive Troubleshooting and Support: Skilled in diagnosing and resolving complex technical issues swiftly, providing seamless user support, and implementing preventative measures to enhance system reliability and security.

Education

Bachelors -

Bharti Vidyapeeth
Pune, India
04.2001 -

Skills

AWS EMR / Cloudera

Databricks / Snowflake

Redshift /Hive/Presto

AWS Solutions Architect

Linux , python

Certification

AWS Solution Architect

Timeline

Bigdata Platform Architect

EXL
05.2023 - Current

AWS Solution Architect

05-2022

Bigdata/AWS Cloud Administrator

Guardian Life Insurance
06.2021 - 05.2023

Bigdata Cloud Administrator

Santarich Infotech LLP,
03.2016 - 07.2021

System Administrator

Santarich Infotech LLP
01.2012 - 03.2015

Bachelors -

Bharti Vidyapeeth
04.2001 -
Sujit ParmarBigdata Consultant