Results-driven ETL Developer with over four years of experience in designing and implementing robust data integration solutions. Proven track record in optimizing ETL processes, enhancing data quality, and reducing processing time by 50%. Proficient in SQL, Python, and various ETL tools, with a strong focus on leveraging data analytics to inform business decisions. Committed to continuous improvement and staying current with emerging technologies in the data field.
Overview
2
2
years of professional experience
Work History
Senior Associate(Data Engineer)
Morgan Stanley
BANGALORE
06.2023 - Current
Transformed an unorganized PanDQ in-house product into an organized shape to enhance the Data Quality Platform. Oversaw the management of various components, such as Rules Manager, Rules and Model Datastores, a Data Quality Engine, a Python-based ETL tool, a distributed Exception Datamart in Greenplum, and a Tableau Dashboard for reporting and analytics, with Exception management.
Designed and executed complex ETL workflows using tools like Databricks Workflows and Python, facilitating seamless data extraction, transformation, and loading across various platforms.
Proficient in SQL development, crafting and optimizing queries to support data retrieval, reporting, and advanced analytics.
Performed intricate data loads and transformations, parsing, and formatting datasets to meet analytical and business requirements.
Led the onboarding process and managed a team of five members.
Integrated data from multiple sources and formats into consolidated master data loads, enhancing accessibility and utility for stakeholders.
Consulted with multiple stakeholders to fulfill their data quality requirements while obtaining validation from the internal audit and the data governance team.
Utilized advanced analytics tools, such as Excel PowerPivot, to manipulate large volumes of structured and unstructured data sets.
Collaborating with business users globally and extensive experience in client onboarding to define requirements, analysis, development, and rule certification.
Primary focus in Data Quality, Data Analysis, ETL, Data Warehousing, Metadata Management.
Generated test case scenarios, performed unit testing, and built database tables necessary to configure the metadata for the automated process.
After comparing and analyzing tables from various environments to identify any discrepancies in the data, reports were generated and created on the basis of comparison.
Experience with JIRA and GitHub to create a productive, high-quality development environment.
Having practical experience with Putty, a solid understanding of Linux commands, and an understanding of Jenkins' CI/CD functionality.
Enthusiastic about learning business concepts, with strong interpersonal and analytical abilities.
Developed an efficient solution for easy log access and error handling across all DQ components.
Enhanced efficiency by developing a standardized setup procedure that automates the onboarding of client data models, the creation of DQ rules, and real-time verification to maintain rule integrity.
Provided technical support in resolving data processing or storage system issues, and analyzed process gaps to offer solutions.
Analyzed large datasets to identify trends, patterns, and correlations for business insights.
Recognized for Exceptional Delivery and Honored as "Rookie: Debut Talent" for Simplified onboarding procedure, cutting down onboarding time to incorporate multiple departments into PANDQ from 3 months to merely 1 week for Data Quality and Data Governance.