Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Ashish Kumar Patra

Ashish Kumar Patra

Noida

Summary

With approximately 3 years of experience in database management, data warehousing, and architecting Azure cloud solutions for a diverse clientele, I am a seasoned professional dedicated to driving impactful outcomes. Proficient in Microsoft Azure, specializing in Azure Data Factory, Blob, Azure Data Lake, Databricks, Azure SQL Data Warehouse (SQL DWH), Azure Data Explorer (ADX), and Event Hub. Skilled in leveraging Python, Scala, and Spark within the Azure Databricks environment. Extensive expertise in MS SQL Server and adept at end-to-end software product development, encompassing requirement analysis, system study, design, coding, testing, documentation, and implementation. Specialized in data migration and data warehousing, particularly within the Microsoft Data Platform domain.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Cilio Automation Factory
Noida
07.2021 - Current

Azure Data Engineering Project:

  • Developed ADF Pipelines to load data from various sources such as Oracle, SQL Server, REST APIs, GraphQL APIs, and the File system into Azure Data Lake Gen2, SQL, PostgreSQL, and S3 Bucket destinations. Configured auditing tables in Azure SQL Database.
  • Utilized Dataflow and Databricks Notebooks within Azure Data Factory for data transformation tasks.
  • Established External and Internal tables in Azure SQL Data Warehouse employing distribution methods like ROUND_ROBIN, HASH, and REPLICATE, alongside Polybase for the data serving layer.
  • Set up Event Hub and IoT Hub, along with Azure Stream Analytics jobs for real-time data processing, directing outputs to Azure Data Lake Storage and SQL Server destinations.

Azure Databricks Project:

  • Set up Azure Databricks from scratch, which entails creating clusters, configuring access policies, managing the workspace, Azure Storage, Blob Storage, and Databricks table mounting, as well as upgrading Databricks runtime versions.
  • Additionally, developed Databricks libraries, Databricks jobs, and various notebooks for performing data transformations using Spark, Python, Scala, and SQL.

Azure DevOps Project:

  • Set up DevOps, including repository creation, access group creation, user onboarding, policy and rules creation, and project management using Azure DevOps Board, following the best DevOps practices provided by Microsoft.
  • Established DevOps frameworks for multiple clients and projects, initiated CI/CD pipelines for Azure Data Factory (ADF), Databricks (JAR Library, Notebooks), Infrastructure as Code (IaaC), Azure Stream Analytics, Azure Functions, and facilitated App Service and WordPress website deployments.

Azure Vision AI Project:

  • Developed an API utilizing Azure Vision AI Form Recognizer to extract JSON data from various invoices and receipts.
  • Integrated a custom model for a distinct receipt template type, extracting, transforming, and storing the data in JSON format within both Azure Data Lake Storage (ADLS) and SQL Server.
  • Successfully deployed the API within an Azure Function App.

Time Series Insights (TSI) to Azure Data Explorer (ADX) Migration Project:

  • Migrated TSI data to ADX using Kusto Query Language (KQL).
  • Created views and various graph representations of data in ADX.

AWS to Azure Migration Project:

  • Migrated AWS Lambda Functions to Azure Functions and created a Standard Operating Procedure (SOP) document accordingly.
  • Transferred AWS S3 bucket contents to Azure Storage.

ETL using Python Project:

  • Developed a Python web application that parses bank statements in PDF format and extracts the bank information into JSON format.
  • Initially, individual parser scripts were created for each bank. Subsequently, a generic parser script was developed and deployed in a Docker container, enabling extraction of data from multiple bank statements.
  • Additionally, the web application was designed to be configurable and capable of plug-and-play functionality for new bank statements. Python's class and object concepts were utilized, allowing for the creation of new classes by extending the base class according to the requirements of new bank statements.

Intern Data Analyst

JBS Capacitors Pvt. Ltd
Bhubaneswar
01.2018 - 09.2018

Project : Automated Trading using Predictive Analysis

  • Developed python modules (and some C++ functions) for detecting varieties of chart-patterns and harmonic analysis patterns on minute level EUR/USD data.
  • Besides this, performed other ancillary support tasks for the development of other AI/Machine Learning based modules performing the same.
  • Also implemented a web-scrapper that performed sentiment analysis from Twitter to gauge the trends of popular crypto currencies (Bitcoin, Ethereum, Litecoin).

Education

MCA - Master of Computer Application

Silicon Institute of Technology
Bhubaneswar
01-2018

Bachelor of Science - Computer Science

Gangadhar Meher College (Autonomous)
Sambalpur
01-2014

Skills

  • Azure Data Factory
  • Azure Databricks
  • Azure Data Lake
  • Azure DevOps
  • Azure Data Explorer
  • Azure Environment
  • Azure Synapse
  • Python Programming
  • Apache Spark, PySpark
  • Scala (Basic)
  • Shell Scripting
  • SQL and Databases
  • Problem-Solving Abilities

Certification

  • DP-900: Azure Data Fundamentals
  • Python (Basic) - HackerRank

Timeline

Senior Data Engineer

Cilio Automation Factory
07.2021 - Current

Intern Data Analyst

JBS Capacitors Pvt. Ltd
01.2018 - 09.2018

MCA - Master of Computer Application

Silicon Institute of Technology

Bachelor of Science - Computer Science

Gangadhar Meher College (Autonomous)
Ashish Kumar Patra