Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Publications
Timeline
Generic
Souvik Chattopadhay

Souvik Chattopadhay

Quantitative Researcher
Kolkata

Summary

Quantitative Researcher with over 5 years of expertise in data analysis and machine learning, focusing on data preprocessing, feature extraction, and statistical modeling.

Overview

6
6
years of professional experience
3
3
Certifications
1
1
Language

Work History

Data Science Intern

Cuvette
10.2025 - 01.2026
  • Wrote complex SQL queries involving JOINs, subqueries, window functions, and aggregations to extract, clean, and analyze structured datasets for downstream machine learning and visualization tasks.
  • Performed data preprocessing and feature engineering, label encoding using pandas and scikit-learn.
  • Identified and treated outliers using Cook’s Distance, zscore and Variance Inflation Factor (VIF); applied quartile-based capping techniques.
  • Implemented regularization methods (Lasso and Ridge) to mitigate overfitting and improve model generalization.
  • Built and evaluated classification models, including Logistic Regression and Decision Trees.
  • Designed and presented interactive Tableau dashboards to communicate key analytical insights.

Research Intern

GSI-Darmstadt
02.2025 - 07.2025
  • Developed a statistical analysis software (cbmroot) using Chi-Square test and Bayes theorem for automatic anomaly detection in histograms.

Consultant

WorldQuant Brain
08.2025 - 01.2025
  • As a Consultant at WorldQuant Brain, developed 10+ alpha strategies that met all platform evaluation criteria, including Sharpe ratio, Calmar ratio, turnover, and maximum drawdown etc. Alpha ideas were sourced from academic finance literature and systematically implemented on the Brain platform.
  • Implemented the Fama–French Five-Factor Asset Pricing Model on U.S. equity datasets, using features such as book value per share, outstanding shares, and time-series ranking operators. Additional alphas included implied-volatility spread signals based on the 120-day call–put volatility differential, as well as sentiment–volume divergence signals derived from social buzz versus trading volume trends.

Ph.D. Student | Senior Research Fellow

Variable Energy Cyclotron Centre
04.2021 - 01.2025
  • Analyzed large-scale experimental datasets, performing pattern analysis, multicollinearity diagnostics, and influence analysis to assess variable relationships and data stability.
  • Developed clustering algorithm on experimental particle-physics data involving particle position and momentum, using distance-based metrics. Implemented k-Nearest Neighbors (kNN) and Agglomerative Clustering, and extracted key cluster-level properties.
  • Designed and trained machine learning classification models for particle identification using TensorFlow and PyTorch, with GPU (CUDA) acceleration.
  • Conducted large-scale experimental data analysis using the ROOT framework for high-energy and nuclear physics experiments.
  • Developed prototype detector for physics experiment in Germany.

Project JRF

Indian Statistical Institute
02.2020 - 11.2020
  • Conducted multivariate analysis on India’s socio-economic conditions using census data.
  • Applied statistical methods and machine learning techniques to identify patterns and trends.
  • Contributed to three peer-reviewed research publications.

Education

M.Sc. - Physics

Jadavpur University
Kolkata
08.2019

B.Sc. - Physics

Jadavpur University
Kolkata
08.2017

Skills

SQL, Python, and C Programming languages Data manipulation and analysis

Version control systems: Git and GitLab

Machine learning libraries: TensorFlow, Keras, PyTorch Classification techniques: Sklearn, KNN, K-Means Cluster analysis methods: Agglomerative LLM transformer expertise: BERT

Data preprocessing with Pandas and NumPy Statistical analysis using SciPy Excel pivot table expertise

Data visualization tools: Matplotlib, Seaborn, Tableau Spreadsheet analysis: Excel (Pivot Chart)

Statistical analysis tools Multivariate analysis Hypothesis testing P-value and chi-square tests

AI tools: ChatGPT and Gemini

Operating systems: Linux and Windows Command line proficiency System administration

Certification

Google Data Analytics (Coursera)

Accomplishments

  • AIR 201 (2020): JEST – TIFR
  • AIR 619 (2020): GATE – Physics
  • AIR 12 (2018): CSIR-NET (LS)
  • INSPIRE Scholar (2014–2019): Govt. of India

Publications

  • Multi-scale analysis of rural and urban areas: a case study of Indian districts – European Physical Journal B (2024)
  • Strata-based quantification of distributional uncertainty in socio-economic indicators – Social Science Journal (2021)
  • Chiral magnetic effect in lattice models of tilted multi-Weyl semimetals – arXiv (2014)
  • Performance of a Real-Size, Low Resistivity Resistive Plate Chamber at GIF++ – JINST (2025)

Timeline

Data Science Intern

Cuvette
10.2025 - 01.2026

Consultant

WorldQuant Brain
08.2025 - 01.2025

Research Intern

GSI-Darmstadt
02.2025 - 07.2025

Ph.D. Student | Senior Research Fellow

Variable Energy Cyclotron Centre
04.2021 - 01.2025

Project JRF

Indian Statistical Institute
02.2020 - 11.2020

M.Sc. - Physics

Jadavpur University

B.Sc. - Physics

Jadavpur University
Souvik ChattopadhayQuantitative Researcher