Summary
Overview
Work History
Education
Skills
Websites
Projects
Certification
Timeline
Generic
HARSH ANANT

HARSH ANANT

Kolkata

Summary

Results-driven developer with over 3+ years of experience in delivering projects for insurance client. Expertise in developing python scripts and ETL jobs ensuring successful project delivery. Good understanding of machine learning and deep learning algorithms and hands on experience in implementing them using tensorflow, keras and scikit-learn. Proficient in developing, training, feature engineering, model evaluation (cross-validation, accuracy metrics), and optimizing performance through hyper parameter tuning and regularization techniques for predictive analysis, classification and regression tasks.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Associate

Cognizant Technology Solutions
Kolkata
07.2021 - Current
  • Developed a daily Talend ETL workflow using Java and SQL to ingest and transform data by syncing new account enterprise identifiers from source systems with Operational MDM. Implemented matching logic based on Policy ID, or fallback on Name, Address, and Identifiers, processing over 23K records per run to ensure data consistency and reliability.
  • Engineered complex SQL queries for large-scale data analysis, joining over 10 disparate data sources, and handling 10+ million records to extract actionable insights, supporting financial risk and compliance reporting.
  • Automated a previously manual process using Talend to insert or soft-delete records in a pre-existing Excel sheet by matching key fields from input files stored in an S3 bucket, enhancing operational efficiency by 99%.
  • Built and deployed a monthly Talend job using Java and SQL to enable data synchronization of enterprise identifiers between Analytical and Operational MDM systems. Ensured real-time data consistency and integrity across platforms, handling over 1.9 million records per run.
  • Authored JIL scripts for AutoSys to orchestrate and schedule Talend jobs, providing automated and reliable execution of end-to-end data workflows.
  • Deployed Talend jobs across multiple environments using Jenkins and uDeploy, streamlining continuous integration and delivery (CI/CD) for production support and platform operations.
  • Collaborated in evaluating and adopting modern data engineering tools and big data technologies to drive innovation, scalability, and performance improvements in data ingestion and processing workflows.
  • Deployed Talend jobs to the server using Jenkins and uDeploy, ensuring consistent and efficient deployment across environments.

Education

Bachelor of Technology - Computer Science

Kalinga Institute of Industrial Technology University
08.2021

Skills

  • Python
  • Java
  • Bash
  • SQL
  • MySQL
  • Oracle
  • Statistical analysis
  • Predictive modeling
  • Machine learning
  • Neural networks
  • Tensorflow
  • Keras
  • Scikit-learn
  • Google Colab
  • Github
  • Jupyter
  • Talend Open Studio
  • IDMC data integration
  • Putty
  • Winscp
  • Azure
  • Jenkins
  • Udeploy
  • Autosys

Projects

Compare the performance of different machine learning and deep learning algorithms on the data set

  • Compared performance of different ML classifiers such as Logistic Regression, KNN, Decision Tree and Random Forest on one multiple class data set, and different fully connected Deep Neural Network architectures with optimizers such as SGD, Adam and RMSprop and also with dropout for any one of the architectures on other multi class data set with respect to recall, precision and f1 score.
  • Decision Tree achieved 99.9% accuracy in testing, as well as in training
  • Deep neural network with Adam optimizer achieved 52% accuracy in 12 epochs of training and validation

Design the Convolutional Neural Network and Test the Accuracy on the Data Set

  • Read the research paper on ‘Wild Animal Classifier Using CNN’ from ‘ResearchGate’.
  • I took the wildlife data set from Kaggle, followed the steps of data pre-processing, CNN model development, and training
  • Model achieved 95% training accuracy, and 92% testing accuracy

Certification

Microsoft Certified: Azure Data Scientist Associate

Timeline

Associate

Cognizant Technology Solutions
07.2021 - Current

Bachelor of Technology - Computer Science

Kalinga Institute of Industrial Technology University
HARSH ANANT