Capacity Planning For Big Data Architectures
Big Data Infra Deployments (Cloudera)
Large Dataset Management
Data Aggregation Processes
Advanced Data Mining
Problem-solving abilities
Analytical Thinking
Impala
Decision-Making
Interpersonal Skills
PySpark
HDFS
YARN
MapReduce
Hive
Pyspark
Python
ETL Development
Git
Ansible
Jupyter Notebook
zeppelin