Experienced Data and ML Infrastructure Engineer with a strong foundation in data science, backed by an MS in Data Science from the University Of Michigan, Ann Arbor and six years of professional experience from companies like University Of Michigan, TCS and GenWave. Proficient in building robust infrastructure and applying advanced algorithms to solve complex problems. Passionate about leveraging expertise to drive innovation and contribute effectively in a collaborative, dynamic team environment.
· Economic Dynamics of Conflict: Modeling U.S. War-GDP Impact
Gathered, and synthesize data from diverse macroeconomic data sources to analyze and model the "Impact of America's Involvement in Global Wars on U.S. GDP." Utilized LSTM, GRU, and 1D CNN for advanced multivariate time series GDP forecasting, applying sliding window techniques and hyperparameter tuning to elucidate the economic implications of global conflicts on the U.S. economy. Link
· Restaurant Radar: Machine-learning web application
Processed over 6 million unstructured Yelp reviews, establishing an AWS data pipeline to an S3 Data Lake and structuring data with Athena using SQL to overcome data challenges. Utilized BERT and fine-tuned the LLM (Llama-2-7B model) in PyTorch for nuanced aspect-based sentiment analysis and topic modeling. Designed a web app to showcase restaurants on Google Maps with sentiment analysis visuals, integrating Flask for the backend and deploying on AWS LightSail with DVC automation for continuous delivery. Link
· Price Pulse Model: Scalable House Price Prediction Model
Designed a scalable pricing model using PyTorch, achieved an RMSE of 0.15 and a cross-validation score of 0.91 with robust MLOps pipelines using MLflow, covering the entire spectrum from data preprocessing to model training, deployment, and regular retraining. Proactively tackled data drift issues to guarantee model robustness and relevance. Link
Python
· Awarded TCS- Star Quarterly for successful migration of scalable data pipelines