Summary
Overview
Work History
Education
Skills
Websites
Certification
Languages
Timeline
Generic

Umar Rangrez

Pune

Summary

Results-oriented Sr. Consultant with 9+ years of experience specializing in Hadoop, Big Data, Cloud, and MapR Administration. Currently affiliated with TransUnion. Seeking a challenging role in an innovative organization that fosters professional growth. Excels in strategic management of Big Data solutions and possesses proven problem-solving abilities. A valuable team player with exceptional verbal and written communication skills.

Overview

9
9
years of professional experience
1
1
Certification

Work History

Sr Consultant

TransUnion
Pune
04.2022 - Current
  • Conducted consultant services for MapR, RStudio, and Tableau
  • Provided expertise in MapR, RStudio, and Tableau for client projects
  • Implementing, managing large-scale Hadoop clusters and optimizing Hadoop infrastructure ensuring seamless performance using various tools to ensure the availability, integrity, and confidentiality of application.

Sr Hadoop Consultant

Saksoft
Pune
03.2020 - 04.2022
  • Provided day-to-day HDFS support and maintenance, ensuring efficient cluster operations
  • Deployed and managed Hadoop environments on cloud platforms including Amazon AWS, Google Cloud Platform, and Microsoft Azure
  • Administered and supported large-scale production Hadoop environments, ensuring optimal performance
  • Secured Hadoop clusters using Kerberos authentication and integrated with LDAP
  • Implemented and supported Enterprise Hadoop environments, including cluster tuning and ecosystem performance monitoring
  • Managed Hadoop infrastructure, provided HDFS support, maintained systems, and set up new users
  • Created and managed data pipelines, loading data from multiple sources into the cluster
  • Analyzed log files to identify root causes and implemented recommended actions
  • Performed YARN administration, ensuring resource management and job scheduling efficiency
  • Created highly available Hadoop clusters in production environments
  • Migrated data between secure clusters, ensuring data integrity and security
  • Developed and implemented backup and disaster recovery processes for Hadoop environments
  • Managed node decommissioning and commissioning on running clusters, including HDFS data balancing
  • Administered AWS Cloud services such as VPC, EC2, S3, and EMR
  • Installed, configured, and administered Hadoop distributions including Cloudera and Hortonworks
  • Upgraded Hadoop environments, including MEP (6.1 to 6.3), CDH (5.16.0 & 6.1.0), and HDP (2.7.0 & 3.1.1)
  • Configured and deployed MapR clusters, ensuring robust and scalable operations
  • Configured Sqoop for efficient data import/export between Hadoop and MySQL databases
  • Troubleshot and resolved Hadoop issues, implementing preventive measures to avoid recurrence
  • Installed various Hadoop ecosystems and daemons, ensuring seamless integration and operation
  • Secured Hadoop clusters (Cloudera, MapR, Hortonworks) using Kerberos with Active Directory
  • Configured LDAP, Sentry for authorization, KMS for data encryption, and extended ACL for HDFS
  • Collaborated with multiple teams to design and implement BigData clusters in cloud environments
  • Addressed and resolved dynamic production cluster issues, providing support to data scientists, data engineers, and Big Data.

Hadoop Administrator

Spadeage Technologies
Pune
06.2015 - 03.2020
  • Expertly deployed Hadoop environments on cloud platforms such as Amazon AWS, Google Cloud Platform, and Microsoft Azure
  • Managed and supported large-scale production Hadoop environments to ensure optimal performance and reliability
  • Implemented and supported enterprise Hadoop environments, including cluster tuning and ecosystem performance monitoring
  • Administered new and existing Hadoop infrastructure, providing HDFS support and maintenance, and setting up new users
  • Developed data pipelines to load data from multiple sources into the cluster using tools like StreamSets
  • Analyzed log files to identify root causes and implemented recommended corrective actions
  • Performed YARN administration to manage and optimize resource allocation and job scheduling
  • Configured high availability (HA) for production clusters to ensure continuous operation and minimize downtime
  • Migrated data between clusters, ensuring data integrity and security
  • Established and managed backup and disaster recovery (BDR) clusters
  • Decommissioned and commissioned nodes on running clusters, including balancing HDFS data
  • Managed AWS cloud services such as VPC, EC2, S3, and EMR for efficient cloud infrastructure management
  • Installed, configured, and administered Hadoop Enterprise Data Hub (EDH) environments
  • Configured and deployed major Cloudera Manager (CM) and Cloudera Distribution Hadoop (CDH) upgrades
  • Ensured data governance by configuring authentication and authorization protocols, providing end-to-end security for the cluster.

Education

Bachelor of Engineering -

NB Navle Sinhgad College of Engineering Solapur
01.2015

Skills

  • Big Data Ecosystem
  • Kubernetes
  • AWS
  • GCP
  • Azure
  • DevOps
  • MapR
  • HDFS
  • YARN
  • MapReduce
  • Sqoop
  • Flume
  • Kafka
  • Spark
  • Hive
  • Zookeeper
  • HBase
  • Impala
  • Pig
  • Databases
  • MySQL
  • NoSQL
  • Big Data Distributions
  • Cloudera
  • Hortonworks
  • AWS Cloud Services
  • EC2
  • IAM
  • S3
  • VPC
  • Cloud Platforms
  • Ticketing Tools
  • Service Now
  • Jira

Certification

  • Lean Six Sigma Yellow Belt

Languages

Hindi
First Language
Hindi
Proficient (C2)
C2
English
Proficient (C2)
C2

Timeline

Sr Consultant

TransUnion
04.2022 - Current

Sr Hadoop Consultant

Saksoft
03.2020 - 04.2022

Hadoop Administrator

Spadeage Technologies
06.2015 - 03.2020

Bachelor of Engineering -

NB Navle Sinhgad College of Engineering Solapur
Umar Rangrez