Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Tanya Sri Pati

Irving

Summary

Highly skilled Data Analyst with over 3 years of experience in data analysis, machine learning, and cloud technologies. Proficient in Python and SQL, with expertise in libraries like Pandas, NumPy, Scikit-Learn, NLTK, and BeautifulSoup. Experienced with relational databases such as MySQL and MS SQL, and NoSQL databases like MongoDB. Strong experience in building interactive dashboards and performing data visualizations using tools like Tableau, Power BI, and MS Excel. Certified AWS Cloud Practitioner with hands-on expertise in AWS services (EC2, S3, Lambda) for scalable data processing and model deployment. Proficient in ETL pipelines, data cleaning, data wrangling, and statistical modeling. Skilled in applying machine learning techniques like Regression, KNN, Decision Trees, Naive Bayes, and Random Forest for predictive analytics. Experience working with large datasets (up to 10TB) and optimizing cloud-based workflows for data processing and reporting. Strong background in data mining, data warehousing, and optimizing ETL processes. Adept at delivering high-quality, scalable analytics solutions to drive data-driven decision-making. Certified in Emotion AI-Facial Key-Points Detection, with experience in NLP and sentiment analysis of unstructured data.

Overview

4
4
years of professional experience

Work History

Data Analyst

CVS Health
07.2024 - Current
  • Applied NLP techniques using libraries (NLTK, BeautifulSoup) for text analysis and sentiment analysis, extracting meaningful insights from unstructured data.
  • Optimized complex SQL queries through stored procedures and triggers for efficient data extraction, analysis, and reporting. Implemented subqueries, joins, window functions, and CTEs (Common Table Expressions) to handle large datasets.
  • Implemented techniques such as Regression, KNN, Decision Trees, Naive Bayes, and Random Forest for predictive modeling, mainly focusing on analyzing customer behavior and market trends, providing valuable insights for strategic decisions.
  • Managed and maintained databases using MS SQL and MongoDB, executing efficient ETL (Extract, Transform, and Load) pipelines for data preprocessing.
  • Utilized Git for version control and collaborative work on Python scripts and SQL queries, ensuring consistency in data models and analysis workflows.
  • Developed ad-hoc reports in Excel for clients using multiple spreadsheets and advanced functions like VLOOKUP, Pivot tables, Power Query.
  • Designed interactive dashboards using Power BI, incorporating complex data transformations. Visualized KPI’s (Key Performance Indicators), facilitating data-driven strategies for business growth.
  • Analyzed datasets exceeding 10 terabytes on AWS, employing EC2, EBS, S3, and Lambda. Boosted data processing efficiency and reduced operational costs by optimizing cloud resource usage and automating analytics workflows, leading to more efficient and cost-effective data handling solutions.
  • Created and maintained SOP documents for analytical processes for standardization and delivering high-quality reports.

Data Analyst

Trigent Software Ltd
06.2021 - 05.2022
  • Performed data cleaning and wrangling techniques, optimizing data quality and integrity across multiple projects.
  • Utilized Python libraries (Pandas, NumPy, SciPy) and SQL for data transformation and normalization, reducing data discrepancies and increasing the efficiency of data processing workflows.
  • Executed complex SQL queries (subqueries and window functions) to extract meaningful insights from relational databases, maximizing query performance.
  • Conducted statistical analysis on large datasets, some exceeding 5 terabytes in size, to identify trends, correlations, and patterns.
  • Utilized Seaborn and Matplotlib libraries in Python for building statistical visualizations, resulting in effective communication and improved predictive model accuracy.
  • Developed Tableau dashboards and integrated Excel reports, decreasing reporting time and enabling faster data-driven decision-making for stakeholders.
  • Led a critical data migration project in MySQL, increasing database performance and normalized datasets to 3NF, resulting in a reduction in data redundancy and improved retrieval efficiency for strategic business analyses.

Education

Master of Science - Data Analytics Engineering

George Mason University
Fairfax, VA
05.2024

Bachelor of Engineering - Computer Science and Engineering

Jawaharlal Nehru Technological University
05.2022

Skills

  • Programming Skills: Python, SQL
  • Libraries: NumPy, Pandas, SciPy, Seaborn, Matplotlib, Scikit-Learn, BeautifulSoup, NLP, NLTK
  • Databases: MySQL, MS SQL, MongoDB
  • Visualization Tools: Tableau, Power BI, MS Excel
  • Certifications: AWS Certified Cloud Practitioner, Emotion AI-Facial Key-Points Detection by Coursera
  • Analytical Skills: Data Cleaning, Data Mining, Data Warehousing, Statistical Modelling, Data Wrangling, ETL, Data Visualization
  • Cloud Technologies: AWS (EC2, EBS, S3, Lambda)
  • ML Techniques: Regression, KNN, Decision Tree, Naive Bayes, Random Forest
  • Tools: Git, SSMS, SSRS, Anaconda, Visual Studio Code

Websites

Timeline

Data Analyst

CVS Health
07.2024 - Current

Data Analyst

Trigent Software Ltd
06.2021 - 05.2022

Bachelor of Engineering - Computer Science and Engineering

Jawaharlal Nehru Technological University

Master of Science - Data Analytics Engineering

George Mason University
Tanya Sri Pati