Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Mukesh Dhakane

Bidkin, Chh. Sambhajinagar (Aurangabad)

Summary

Hadoop Administrator with 5.10 years of experience in managing and optimizing on-premises Cloudera (CDH/CDP) clusters. Skilled in end-to-end cluster administration, including provisioning, upgrades, monitoring, and performance tuning to ensure high availability and scalability. Proficient in core Hadoop ecosystem components such as HDFS, YARN, Hive, HBase, Spark and Impala, with strong expertise in cluster security, Kerberos authentication, and data governance. Experienced in Cloudera Manager, Replication Manager, and Data Warehouse (CDW) for cluster operations, data replication, and disaster recovery. Adept in Linux system administration and automation through scripting to streamline operations and enhance reliability. Proven track record of delivering stable, secure, and efficient Hadoop platforms in enterprise environments.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Platform Engineer

Cloudaeon Pvt. Ltd
11.2023 - Current
  • Designed, implemented, and maintained large-scale Hadoop clusters ensuring high availability, reliability, and performance.
  • Administered cluster provisioning, monitoring, and scaling in multi-node Hadoop environments, including CDH-to-CDP upgrades to improve performance and supportability.
  • Performed capacity planning, performance tuning, and resource optimization to improve cluster efficiency.
  • Optimized Apache Impala performance through query tuning, metadata synchronization, and resolution of memory, or execution related issues.
  • Configured and managed Hadoop ecosystem security including Kerberos authentication, Ranger policies, and SSL encryption.
  • Automated cluster administration tasks using Ansible, Shell scripting and Python scripts, reducing manual effort and downtime.
  • Monitored and troubleshot cluster issues related to HDFS, Hive, Yarn jobs and Impala queries.
  • Implemented backup, recovery, and disaster recovery strategies for mission-critical data pipelines.
  • Collaborated with Data Engineers, Developers, and Business Analysts to support ETL workflows and analytics workloads.
  • Conducted cluster upgrades (CDH-CDP), server patching, and migration activities with minimal downtime.
  • Designed and enforced governance policies for data retention, compliance, and security audits.
  • Delivered 24/7 production support for Hadoop and Big Data platforms, including on-call management of sev-1 issues.
  • Mentored junior admins and provided best practices for Hadoop cluster operations and optimization

Data and cloud ops Engineer

Clairvoyant an EXL company
11.2019 - 10.2023
  • Oversaw the deployment and lifecycle management of enterprise-grade Hadoop infrastructure, ensuring seamless operations and reliability
  • Monitored cluster health, job execution, and node availability using tools like Ambari, Cloudera Manager.
  • Maintenance activities such as balancing data across the nodes in the cluster, backing up metadata, and so on.
  • Performance tuning of Hadoop cluster (Yarn, Impala).
  • Preparation of technical documentation/MOPs.
  • Handling the tickets related to Hadoop issues to resolve them in accordance with SLA and priority responsibilities.
  • Hands-on experience in data pre-processing, data migration, data transformation, and optimization.
  • Take end-to-end responsibility for the Hadoop life cycle in the organization.
  • Created and maintained technical documentation, including SOPs, configuration details, RCA reports, and upgrade procedures.
  • Good team player with an excellent ability to learn any new technologies in a short time.
  • Self-starter with effective communication, Leadership, Interpersonal, analytical and organizational skills

Education

PG Diploma in Big Data Analytics -

Sunbeam Institute of Pune
09.2019

Bachelor of Engineering - E&TC

MIT College
06.2018

Diploma - E&TC

MIT College
06.2015

Skills

    Big Data & Hadoop Ecosystem:

  • Hadoop ecosystem management (HDFS, YARN, MapReduce, Hive, HBase, Spark)
  • Cluster design, deployment, scaling, and high availability (HA)
  • Hadoop distribution experience (Cloudera, Hortonworks, Apache,)
  • Hadoop security (Kerberos, Ranger, SSL)
  • Administration & Monitoring:

  • Cluster monitoring & troubleshooting (Ambari, Cloudera Manager, Grafana, Airflow, etc)
  • Performance tuning and capacity planning
  • Backup, recovery, and disaster recovery planning
  • Upgrade, patch management, and version migrations
  • Scripting/Automation & Data Governance/Security:

  • Proficient in Shell scripting to automate cluster administration tasks such as service restarts, log rotation, monitoring, and backups
  • Designed automation workflows/jobs for data ingestion, housekeeping, and maintenance, reducing manual effort and downtime
  • Implemented and managed Kerberos authentication to secure multi-node Hadoop clusters
  • Performed keytab management, ticket lifecycle monitoring, and periodic security audits to ensure compliance and data protection
  • Azure Cloud Platform:

  • Microsoft Azure (IaaS & PaaS): Understanding of deploying and managing cloud services
  • Azure Virtual Machines: Experience creating and configuring VMs for testing and small-scale workloads
  • Azure Active Directory (AAD): Familiar with user/group management and applying basic RBAC policies
  • Azure Monitor & Log Analytics: Exposure to setting up monitoring dashboards and reviewing performance logs
  • Azure Data Factory (ADF): Hands-on practice in creating simple data pipelines for integration and migration tasks
  • Other Relevant Skills:

  • Linux/Unix system administration
  • Networking basics (firewalls, DNS, load balancing in clusters)
  • SQL and NoSQL databases (MySQL, PostgreSQL, MongoDB)
  • Collaboration with Data Engineers, Analysts, and Business stakeholders
  • Strong problem-solving and root cause analysis skills

Certification

  • Azure Fundamentals (AZ-900)
  • Azure Administrator (AZ-104)
  • Azure DevOps AZ-400 (In-progress)

Timeline

Platform Engineer

Cloudaeon Pvt. Ltd
11.2023 - Current

Data and cloud ops Engineer

Clairvoyant an EXL company
11.2019 - 10.2023

Bachelor of Engineering - E&TC

MIT College

Diploma - E&TC

MIT College

PG Diploma in Big Data Analytics -

Sunbeam Institute of Pune
Mukesh Dhakane