
I have 4.5 years of experience in IT, Results-driven GCP Cloud Data Engineer with over 3.5 years of hands-on experience in building and optimizing data pipelines using BigQuery, Airflow, and Dataproc. Skilled in data ingestion, transformation, and visualization to drive business insights. Proven ability to implement scalable data architectures and support real-time data processing in cloud environments.
Experienced with designing and optimizing data pipelines to ensure seamless data flow. Utilizes advanced SQL and Python skills to create and maintain robust data architectures. Track record of implementing scalable solutions that enhance data integrity and support informed decision-making.
Database : SQL, Stored Procedures, Hive,Big Query
Tools/Languages : Python, Scala,Spark, Hadoop, PySpark, Linux
Cloud Tools : Google Cloud Platform (GCP),BigQuery,GCP SDK,Dataproc,Cloud Storage,Pub/Sub,Cloud Composer
Other: Jenkins,Github,Winscp,Putty
Data Formats : CSV, JSON, XML, ORC, Avro, Parquet
Project Execution Tools : Agile – Sprint, Scrum, JIRA, GitHub
Operating Systems : Windows, Unix and Linux
Data warehouse: Data warehouse and data management , DBMS, shell scripting and spark, batch and streaming data ticketing