Around 12+ years of IT experience including 7 years in Big Data Technologies.
Well versed with Hadoop Map Reduce, HDFS, Hive, Impala, Sqoop, Flume, Yarn, Zookeeper, and Oozie
In depth understanding/knowledge of Hadoop Architecture and various components.
Experience in installation, configuration, support and management of a Hadoop Cluster.
Worked on both Hadoop distributions: Cloudera and Hortonworks.
Strong Knowledge in Hadoop Cluster Capacity Planning, Cluster Monitoring.
Capability to configure scalable infrastructures for HA (High Availability)
Experience in balancing the cluster after adding/removing nodes.
Experience in Setting up Data Ingestion tools like Flume, Sqoop and Kafka
Experience in configuring Zookeeper to coordinate the servers in clusters.
Experience in setting up Name Node high availability for major production cluster
Experience in designing Automatic failover control using zookeeper and quorum journal node
Experience in creating, building and managing public and private cloud Infrastructure
Experience in working with different file formats and compression techniques in Hadoop
Experience in analyzing existing Hadoop cluster, Understanding the performance bottlenecks and providing the performance tuning solutions accordingly.
Strong experience in System Administration, Installation, Upgrading, Patches, Migration, Configuration, Troubleshooting, Security, Backup, on Linux (RHEL) systems.
Experience in working large environments and leading the infrastructure support and operations.
Migrating applications from existing systems like MySQL, oracle, db2 and Teradata to Hadoop.
Experience in administering the Linux systems to deploy Hadoop cluster and monitoring the cluster.
Experience on Commissioning, Decommissioning, Balancing, and Managing Nodes and tuning server for optimal performance of the cluster.
Knowledge in AWS cloud
Possess good knowledge in creating and launching EC2 instances using AMI’s of Linux, Ubuntu, RHEL, and Windows