Summary
Overview
Work History
Education
Skills
Languages
Affiliations
Timeline
Generic
Sanjay Karmakar

Sanjay Karmakar

Bangalore

Summary

With 28 years of unwavering passion for data, I have progressed through diverse roles and, for the past 15 years, have served as a strategic and results-driven Data Architect, aligning data vision with organizational goals to deliver transformative business value. Known for thinking beyond the task at hand, I excel at uncovering innovative, out-of-the-box possibilities that empower organizations to harness the full potential of their data. Recognized for an exceptional ability to design, implement, and optimize high-performance, scalable data models and pipelines that drive impactful decision-making and operational efficiency. I bring deep proficiency in the modern data stack (SQL, NoSQL, data warehousing/analytics), cloud platforms (AWS, Databricks, Snowflake), and distributed frameworks (Apache Spark, Kafka, Kinesis) to bridge technology and business outcomes seamlessly and elevate enterprise performance. Renowned for meticulous attention to data lineage, governance, and automation, I ensure smooth, compliant workflows tailored to enterprise needs. My leadership, analytical insight, adaptability to thrive in dynamic environments, and collaborative approach empower cross-functional teams to deliver robust, future-proof data solutions. With a unique blend of strategic vision, technical acumen, and interpersonal skills, I excel at solving complex data challenges and fostering an organization-wide commitment to data excellence. Adept at communicating complex technical concepts to all levels of leadership, I foster a culture of data excellence, adaptability, and continuous improvement, with a focus on scalability and quality.

Overview

29
29
years of professional experience

Work History

Senior Data Architect

NextGen Healthcare
Bangalore
10.2019 - Current
  • Architected and optimized scalable data solutions utilizing SQL, NoSQL, warehousing/ analytics technologies, including Snowflake and Apache Spark, to support complex operational and analytical workloads and drive significant increase, average 30% and above, in data processing efficiency, advanced insights and decision-making across multiple business units
  • Designed and implemented multiple MDM solutions, including an Enterprise Master Patient Index (EMPI) to unify patient records across healthcare facilities, providing a single, trusted patient identity through deduplication and record linking
  • Also developed a Real-Time Data Synchronization and Compliance solution using Kinesis, integrating automated data masking, encryption, and audit trails to ensure adherence to HIPAA, GDPR, and other healthcare regulations
  • Played a key role in developing a real-time data pipeline using Amazon Kinesis to process millions of records from S3, converting relational data into FHIR-compliant resources with Python
  • Integrated the pipeline with the Clinical Query Language (CQL) framework to calculate healthcare measure values, delivering timely and accurate insights for healthcare analytics and compliance
  • Led a data lake project consolidating data from over 200 clients into Snowflake, enabling centralized storage, improved accessibility, and streamlined analytics
  • Optimized data storage, achieving a 40% cost reduction through advanced data models and compression techniques
  • Automated data pipelines using Apache Airflow, reducing data processing time by 60% and significantly improving data availability
  • Developed and implemented CI/CD pipelines using Jenkins and GitHub Actions, streamlining deployment, ensuring reliability, and supporting agile development
  • Spearheaded cloud migration projects for 4,000+ databases to AWS, achieving a 35% reduction in storage costs and enhanced scalability
  • Partnered with analytics engineers, data scientists, and product managers to ingest new data sources, achieving a 20% improvement in data-driven decision-making
  • Directed Proof of Concept (PoC) initiatives with cross-functional stakeholders, evaluating new technologies to provide scalable solutions and strategic insights with 40% conversion ratio

Senior Architect

Philips Healthcare
Bangalore
06.2017 - 10.2019
  • Spearheaded the creation of a unified data model for the integration of three legacy products using Sybase and Oracle 6.0 into SQL Server 2016, achieving a 40% reduction in data volume, 30% improvement in resource utilization, and 40% increase in query efficiency
  • Integrated Python, Docker, and Kubernetes into data engineering workflows, significantly enhancing scalability, streamlining processes, and supporting continuous integration and deployment

Senior Database Architect

Allscripts
Bangalore
12.2012 - 05.2017
  • Increased data processing speed by 65% through Server Service Broker for parallel bulk data processing; reduced nightly data synchronization time by 60% for the code system repository, optimizing operational efficiency
  • Managed EHR database architecture, conducting in-depth code reviews to uphold high standards of performance, data integrity, and best practices
  • Decreased Non-Functional Requirement (NFR) response time from 5 seconds to under 1 second by implementing advanced indexing, data sharding, and partitioning
  • Engineered robust data warehousing solutions and ETL pipelines, enabling real-time analytics and ensuring compliance with security and governance standards to support complex business needs

Database Architect

Talisma Corporation Pvt. Ltd
05.2011 - 11.2012
  • Created logical models of data structures using ERwin tools.
  • Optimized SQL queries for better performance by creating necessary indexes on tables or by rewriting the query itself.
  • Monitored system resources consumption to ensure optimal utilization of resources.
  • Documented technical designs for new systems including entity relationships diagrams.

Database Architect

EF Information Systems
04.2010 - 04.2011
  • Recommended changes in database design based on changing requirements from users or new technologies available in market place.
  • Developed and implemented automated database maintenance tasks to improve data integrity.
  • Analyzed existing databases and identified areas of improvement for performance, scalability and reliability.
  • Established standards, procedures and policies related to database design, coding, security and backup and recovery operations.
  • Optimized SQL queries for better performance by creating necessary indexes on tables or by rewriting the query itself.
  • Monitored system resources consumption to ensure optimal utilization of resources.
  • Provided technical assistance in troubleshooting issues related to database performance, replication setup or any other issue that may arise during development or production phases.

DBA Manager

Ness Technologies
10.2008 - 04.2010
  • Configured user accounts, privileges, roles, profiles. in the database server.
  • Migrated existing databases from one platform to another with minimal downtime.
  • Resolved customer queries related to database issues promptly and accurately.
  • Ensured that all the stored procedures are optimized for better performance of the system.
  • Performed regular health checks on all databases to ensure optimal performance levels are maintained.
  • Deployed patches and upgrades on a timely basis to keep up-to-date with latest software releases.
  • Conducted training sessions for junior DBAs regarding best practices related to database administration.
  • Analyzed and developed technical and functional specifications for databases.
  • Created and monitored performance metrics to keep databases functioning at optimal capacity.
  • Defined database design specifications based upon project requirements.

SQL Engineer

Microsoft GTSC
05.2006 - 09.2008
  • Reviewed customer complaints regarding product performance or functionality issues.
  • Performed testing to determine functionality or optimization.
  • Advised customers on use of products or services.

Database Consultant

Globsyn Technologies Ltd
04.2004 - 04.2006
  • Identified user requirements by conducting interviews with stakeholders and analyzing existing databases.
  • Created comprehensive documentation on the design of databases including entity relationship diagrams and other diagrams.
  • Designed efficient database structures to store large amounts of data while optimizing performance, scalability, and reliability.

Database Consultant

Andrew Yule & Company Ltd
06.2000 - 03.2004
  • Developed stored procedures, triggers, functions, views, indexes, and other database objects as needed to support application development efforts.
  • Performed regular maintenance activities such as backups, integrity checks, index rebuilds and reorganizations to ensure optimal performance of the databases.
  • Provided technical guidance on best practices related to database architecture and design.
  • Researched new technologies that could be used to improve current processes or develop new solutions for clients' needs.

Consultant

CMC Limited
09.1995 - 05.2000
  • Collaborated with clients to develop action plans to address specific challenges and objectives.
  • Maintained strong relationships with key stakeholders throughout the duration of the project lifecycle.
  • Identified needs of customers promptly and efficiently.
  • Organized meetings between stakeholders to discuss project details and timelines.

Education

Certificate - Applied Computer Science

Computer Management Corporation Pvt. Ltd.
India
10-1995

Bachelor of Science - Computer Science

University of Calcutta
India
06-1994

Skills

Data Architecture and Engineering

  • Data Modeling & Schema Design: Erwin Data Modeler, PowerDesigner, IBM InfoSphere, Oracle SQL Developer, Apache Avro, JSON Schema, DBT, UML
  • Master Data Management (MDM): Enterprise MDM solutions, EMPI, Provider Data Management
  • Data Warehousing: Snowflake, Redshift, and Azure Synapse
  • Data Lakes: AWS S3, Azure Data Lake
  • ETL/ELT Pipelines & Data Integration:Apache Airflow, Informatica, Talend, AWS Glue, SSIS, Fivetran, Kafka Connect
  • Distributed Frameworks: Apache Spark, Kafka, Kinesis
  • Data Lineage, Governance, & Compliance:Informatica Data Governance, Collibra, Talend Data Fabric, AWS Glue Data Catalog, Azure Purview

Cloud Platforms and Infrastructure

  • AWS: S3, Redshift, DynamoDB, Lambda
  • Azure: Azure Data Lake, Azure Synapse
  • Databricks
  • Cloud Data Migration and Integration: AWS DMS, Azure DMS, Informatica Cloud, Fivetran
  • Cloud Cost Optimization and Resource Scaling

Database Management

  • Relational Databases: SQL Server, Oracle, Sybase, PostgreSQL, MySQL, Amazon RDS, Google Cloud SQL
  • NoSQL Databases: MongoDB, Cassandra, DynamoDB, Redis
  • Database Optimization and Performance Tuning
  • Data Security & Encryption:AWS KMS, Azure Key Vault, TLS/SSL, Field & Row-Level Security, TDE, CloudTrail

Programming and Automation

  • Programming languages: SQL, Python
  • Automation & Orchestration:Airflow, Kubernetes, Terraform, Jenkins, Docker, CI/CD, AWS Step Functions
  • Data Validation & Quality Assurance:Pytest, DataBrew
  • API Development & Integration:Swagger, Postman, API Gateway, Azure API Management

Analytics and Reporting

  • BI Tools: Tableau, Power BI, Sigma, QuickSight, Looker
  • Real-Time Data Processing and Advanced Querying

Healthcare Data Standards & Compliance

  • Healthcare Standards Expertise:FHIR, HL7, CDISC
  • Clinical Query Language (CQL) and OMOP
  • Medical Coding Systems: ICD-9/10, SNOMED, CPT, LOINC
  • Claims and Clinical Data Systems:EHR, EMR
  • Regulatory Compliance: HIPAA, GDPR
  • Data Integration and Quality

Interpersonal and Leadership Skills

  • Strategic Leadership and Vision
  • Agile Methodologies: Scrum, Kanban
  • Cross-functional collaboration
  • Stakeholder Engagement and Communication
  • Mentorship and Team Development
  • Problem Solving and Critical Thinking
  • Adaptability and Change Management

Languages

Bengali
First Language
English
Proficient (C2)
C2

Affiliations

  • Embraced unique learning experiences and perspectives through travel, which enrich my adaptability and understanding in cross-functional, multicultural teams.
  • Recognized for Best Innovation Idea twice, celebrated for team leadership, excellence in project delivery.

Timeline

Senior Data Architect

NextGen Healthcare
10.2019 - Current

Senior Architect

Philips Healthcare
06.2017 - 10.2019

Senior Database Architect

Allscripts
12.2012 - 05.2017

Database Architect

Talisma Corporation Pvt. Ltd
05.2011 - 11.2012

Database Architect

EF Information Systems
04.2010 - 04.2011

DBA Manager

Ness Technologies
10.2008 - 04.2010

SQL Engineer

Microsoft GTSC
05.2006 - 09.2008

Database Consultant

Globsyn Technologies Ltd
04.2004 - 04.2006

Database Consultant

Andrew Yule & Company Ltd
06.2000 - 03.2004

Consultant

CMC Limited
09.1995 - 05.2000

Certificate - Applied Computer Science

Computer Management Corporation Pvt. Ltd.

Bachelor of Science - Computer Science

University of Calcutta
Sanjay Karmakar