With over 8 years of experience building large-scale distributed data systems, I am an accomplished Data Engineer adept at architecting and implementing robust data pipelines to enable advanced analytics.
Leadership:
Core Competencies:
Languages: Python, SQL, Pyspark, Shell Script , R
Databases: MySQL, Oracle, Hive, MongoDB, Presto, Teradata
Technology: Pyspark, Airflow, Spark Streaming, AWS, Kubernetes, Docker, Kafka, Jupyter Note- book, Bamboo, Elastic Search, Jira , UNIX, Big Data, Hadoop, Tableau
Concepts: Architecture and Design of Data Pipelines, Data Science, Data Modeling, Data Analytics, Data Visualization, BI and Reporting, Agile Methodology
Domain Knowledge: US Healthcare, Telecommunications, Insurance