Team Size: 3
Project description:
- This project involves installing, upgrading, managing, and providing administrative support to project teams on integration services tools, such as Informatica and Boomi; data orchestration tools, including Apache Airflow by Astronomer and MWAA; and container orchestration platforms, including Kubernetes, EKS, and ECS.
- Upgrade the Informatica domain and applications from older versions (9.1, 9.5, 9.5.1, 9.6.1) to the latest version (10.2).
- Upgrade Apache Airflow, the Astronomer platform, and MWAA from earlier versions (1.10.5, 1.10.10) to the latest supported Airflow version (v2.0, v2.1).
- Upgrade the Kubernetes cluster from earlier versions (1.16, 1.17) to the latest supported versions (1.18, 1.19, 1.20).
- Identify the technologies and environments that will be impacted by the upgrades and communicate with stakeholders and teams.
- Determine contacts and pre- and post-upgrade steps and checks.
- Manage the Informatica upgrade and coordinate with support teams for other technologies.
- Keep stakeholders informed about the upgrade status.
- Support the client by collaborating with various teams—such as application, DBA, UNIX, and OS teams—and delivering solutions within SLA.
Responsibilities:
Astronomer Airflow + Kubernetes
- Planning installation and upgrades of the Astronomer platform, MWAA, Apache Airflow, Kubernetes (EKS) clusters, and other dependent services.
- Performing various upgrades and configuration changes during planned weekend maintenance activities.
- Creating migration plans and assisting project teams in successfully migrating their DAGs from older to newer versions of Airflow.
- Onboarding project teams on Astronomer by provisioning an Airflow instance on shared EKS clusters and providing access to requested users.
- Integrating Bitbucket with Astronomer-managed Apache Airflow to enable versioning of code and CI/CD pipelines for continuous, smooth deployments.
- Creating replica sets of Airflow components, like the web server, to facilitate Blue-Green deployments.
- Creating cross-account IAM roles and VPC peering between AWS accounts to integrate various AWS services such as S3, RDS, EMR, Redshift, AWS Secrets Manager, and others with Apache Airflow instances.
- Onboarding project teams on MWAA by provisioning new MWAA instances for each team using AWS CloudFormation templates, providing user access, and integrating AWS services with MWAA.
- Integrating MWAA with ECS Fargate and CloudWatch to execute custom OS scripts on ECS Fargate using Airflow.
- Creating MWAA disaster recovery (DR) plans and performing DR activities using backup and restore mechanisms for DAGs, variables, connections, and secrets in the DR region.
- Configuring SMTP servers to enable DAG success or failure alerts on Airflow for each project team.
- Configuring, enabling, and monitoring alerts for each Airflow instance on a cluster using tools like Prometheus, Grafana, AWS CloudWatch, and Splunk.
- Managing Astronomer platform deployments and Airflow deployments on Kubernetes clusters such as EKS.
- Creating daily backup scripts to export Helm deployments, variables, and connections of each Airflow instance on a cluster, and uploading them to S3.
- Developing custom roles to enable DAG-level authorization in Airflow for different operations teams within a project.
- Managing monthly chargebacks for Airflow instances for each project team.
- Decommissioning Airflow instances and other AWS services integrated with Airflow.
- Troubleshooting various performance issues in DAGs created by project teams on Airflow using the Kubernetes executor.
- Understanding all use cases and assisting project teams in creating data pipelines on Airflow, guiding them to follow best practices for authoring and scheduling DAGs.
Informatica + Autosys
- Planning and implementing installation and upgrade of Informatica PowerCenter tool from v9.6 to v10.1.
- Performing upgrades and configuration changes during planned maintenance activities over weekends.
- Creating migration plans and assisting project teams in successfully migrating their workflows from lower to higher versions of Informatica.
- Onboarding project teams to Informatica by provisioning an Informatica repository and its associated services on a shared on-premises Linux cluster, and providing access to requested users.
- Executing Informatica disaster recovery (DR) activities and end-to-end testing by switching all domains, repositories, and services to point to the DR region.
- Managing monthly chargebacks for Informatica and Autosys for each project team.
- Creating quarterly reports such as the Royalty report for Informatica and periodic access reviews.
- Decommissioning Informatica repositories and their associated services.
- Troubleshooting performance issues faced by project teams on Informatica by analyzing session logs and database sessions for each query.
- Developing daily backup check scripts to verify the backups of Informatica repositories and domain backups in a cluster, and sending a consolidated report to the support team via email.