CARRER OBJECTIVE: To Serve the organization as an efficient DevOps Professional which will enable to contribute the acquired technical skills & enhance further technical abilities which can result in continuous learning that simulate professional and personal growth. Skilled professional with robust background in infrastructure automation, continuous integration, and cloud services. Expertise in scripting, system administration, and container orchestration, ensuring seamless deployments and efficient system operations. Strong focus on team collaboration and adaptability, consistently delivering reliable and effective solutions. Passionate about leveraging technical skills to drive operational excellence and support organizational goals.
The Lab126 DevOpsSystems Engineer will be responsible for building, maintaining, and supporting the systems. The engineer will customize and integrate third-party software while continuously focusing on improving the existing infrastructure to ensure it continues to meet the operational excellence.\
Responsible for maintaining managing services end-to-end, including identifying and evaluating potential software solutions, designing/revamping the production infrastructure, integrating the software/solutions into the infrastructure, and deploying the final product to production. Some of such activities include:\
o Infrastructure (both on-premises and AWS) patching/upgrades/updates, configuration changes, redeployment/rehosting, disk/logical volume maintenance.\
o Security and vulnerability patches/updates; OS (Windows, Linux), Software installs, upgrades, patching/updates.\
o Day-2-day support for internal tools such as requirements management, quality management system, reliability quality analysis system, document management system/wiki, component information system including User/group/cost center onboarding, and safelisting users/groups/cost centers.\
o Emulation lab, HPC platform (Multiple Instances), CI/CD system, code review system, HPC VDI, License Servers, License install/update, Infrastructure testing/verification, python/bash/shell scripting for maintenance, AWS services maintenance such a Workdocs, S3 data aging process, AWS cost evaluation for the current and any new capacity planned.\
o Data backup, recovery and verification process\
o Game Day support for disaster recovery verification/validation.\
Work closely with engineering and business teams (US India teams) to plan, design, analyze, develop, and implement systems for product development such as design simulation.\
Collaborate with US on-site for a 24/7 support plan per customer needs.\
Provide technical leadership in evaluating, integrating, and deploying system solutions\
Some of common Day-2-Day activities include:\
New user onboarding on HPC platforms\
Installing Licenses [ quick SLA - needs prompt action]\
Installing tools on the HPC platforms\
Troubleshooting user issues at times with working sessions during user s convenient time-zone\
Capacity (Server, Storage, Networking) provisioning\
New user onboarding on to Emulation Lab\
Writing RCA/Whitepaper on critical issues\
Attending weekly sync up and daily status meetings\
Taking actions on critical Vulnerability issues within SLA\
Taking actions on critical alerts related to Service end-point, instances and single point of failures\
Providing data on tickets filed and addressed on a daily/weekly basis.