Summary
Overview
Work History
Education
Skills
Accomplishments
Websites
Timeline
Generic

Krishnakumar Natthani

Pune

Summary

Skilled Data Architect with 9 years of experience in designing and developing enterprise data solutions. Proficient in end-to-end software development life cycles using agile methodology. Expertise in Spark, Python, SQL, Azure (Databricks, Synapse, ADF, ADLS), AWS (S3, Glue, Redshift, Lambda, Athena, Airflow), Snowflake, MSBI, PowerBI, and Tableau. Demonstrated ability to analyze and build data model for data lake and warehouse. Worked on building RAG solutions using services such as Azure OpenAI, Azure AI Search, Azure ML, and Azure Functions. Proven track record of leading teams, gathering business requirements, translating them into technical implementations, estimating story points, and successfully delivering work items.

Overview

9
9
years of professional experience

Work History

Data Architect

Globant
Pune
09.2020 - Current
  • Career Progression: Joined as a Senior Data Engineer, advanced to Lead Data Engineer, and now working as Data Architect.
  • Client Engagement: Collaborated with client business and technical teams to gather and analyze requirements, pain points, vision, budget, and tool preferences across various domains including Pharma, Rewards & Loyalty, HR, and Finance in different projects.
  • Architecture Design: Developed Low-Level and High-Level Design (LLD & HLD) for diverse data sources, including semi-structured, RDBMS, flat files, APIs, and unstructured data.
  • Architectural Frameworks: Designed and implemented Medallion, Kappa, and Data Mesh architectures.
  • Technical Expertise: Utilized a range of technologies, including PySpark, Python, SQL, Azure (Synapse, Databricks, ADF, ADLS, Blob, Logic Apps), AWS (S3, Glue, Redshift, Lambda, Athena, Airflow), Snowflake, and PowerBI in different projects
  • Metadata-Driven Framework: Built a PySpark and Python-based metadata-driven framework to generate Spark driver programs from JSON input, enabling automated control flow for data pipelines.
  • Gen-AI exposure: Built RAGs by extracting text and images from SharePoint files, processed images with GPT-4o deployed on Azure OpenAI, generated tags, chunked & embedded text, and created indexes in Azure AI Search.
  • Data Modeling: Developed comprehensive data models, including dimension flattening and fact-dimension relationships tailored to access patterns.
  • Data Security: Implemented data security measures including encryption, masking, tokenization, and access control to safeguard sensitive data.
  • Data Governance: Managed data governance tasks such as data lineage, retention policies, and quality checks.
  • Team Leadership: Led a team of 15+ members, overseeing MSL (Master Story List) preparation, technical debt items, iteration estimation and planning, on-time delivery with quality, and handling escalations.

Business Intelligence Developer

Retail Solutions Inc
Pune
03.2019 - 08.2020
  • Company Overview: RSi is a product-based company that collects data from retailers like Walmart and Target, extracts insights, and provides them to goods producers such as PepsiCo and P&G.
  • Data Ingestion: Received data in flat files and ingested it into ADLS as a landing layer incrementally, with folder structures organized by retailer and goods producer.
  • Data Transformation: Extracted and transformed data into a usable format, then loaded it into the Dedicated SQL Pool (formerly Azure SQL DW) using PySpark in Azure Databricks.
  • Analytics: Developed a tabular model on the top of Dedicated SQL Pool for each retailer-producer combination, applying row-level security.
  • Orchestration: Used ADF to ingest data into the raw layer, run Databricks notebooks, and refresh the tabular model.
  • Reporting: Exposed data to end customers via PowerBI, providing intuitive reporting.

Senior Systems Engineer

Infosys Ltd
Pune
08.2015 - 09.2018
  • Career Progression: Progressed from System Engineer to Senior System Engineer.
  • Initial Role: Began as an MSBI Developer, working with SSIS, SQL, Tabular Model, and PowerBI for Microsoft’s Sales & Marketing Data.
  • Migration Experience: Migrated projects from legacy MSBI to Azure, utilizing ADLS, ADF, Databricks, SQL, and Logic Apps.
  • Ingestion & Orchestration: Used ADF for data ingestion into the raw layer, orchestrated notebooks in Azure Databricks, and processed tabular models using Automation Runbooks for faster processing.
  • Transformation: Applied business logic transformations on raw data with PySpark, wrote data to ACID-compliant Delta tables, and optimized long-running jobs to reduce runtime and save costs.

Education

B.E. in Computer Technology -

Sinhgad College Of Engineering, Pune University
05.2015

Diploma in Computer Engineering -

Government Polytechnic
Khamgaon
06-2012

Skills

  • Apache Spark
  • Python
  • Azure
  • AWS
  • Data Modelling
  • Data Lake
  • RAG
  • ADLS Gen2
  • Azure Databricks
  • Azure Synapse
  • Azure Data Factory
  • Azure SQL
  • Azure OpenAI
  • Azure AI Search
  • Azure ML
  • Azure Logic Apps
  • AWS S3
  • AWS Glue
  • AWS RedShift
  • AWS Athena
  • AWS Lambda
  • AWS Airflow
  • AWS DynamoDB
  • Snowflake
  • PowerBI
  • Tableau
  • Team Management

Accomplishments

  • Microsoft Certified: Azure Data Engineer Associate
  • Earned the "Pat on the Back" award multiple times for exceptional performance by Globant
  • Received Core Value Award by RSi for showcasing timely efforts to migrate existing product on Azure technology and PowerBI

Timeline

Data Architect

Globant
09.2020 - Current

Business Intelligence Developer

Retail Solutions Inc
03.2019 - 08.2020

Senior Systems Engineer

Infosys Ltd
08.2015 - 09.2018

B.E. in Computer Technology -

Sinhgad College Of Engineering, Pune University

Diploma in Computer Engineering -

Government Polytechnic
Krishnakumar Natthani