Summary
Overview
Work History
Education
Skills
Leadership and collaboration
Impact and value
Patent
References
Timeline
Generic

Preetish Kumar Tripathi

Bangalore

Summary

Accomplished technical leader with 15+ years of experience in systems architecture, engineering, and site reliability engineering (SRE) across diverse industries such as ad-tech, e-commerce, fintech, telecom, taxation, and fraud analytics (MLOps). Demonstrated expertise in designing reliable and scalable systems while effectively leading high-performing engineering teams. Skilled in managing end-to-end infrastructure and application lifecycles as a Systems Engineer, DevOps Engineer, and SRE.

Overview

16
16
years of professional experience

Work History

Lead SRE

Goldcast.io
10.2022 - 11.2023
  • Built a team of 6 engineers taking care of SRE, Infosec and NOC
  • Migration of microservices to infrastructure built using IaC (Terraform/aws)
  • Built the CI/CD pipeline for the engineering team using github actions
  • Scaling infra based on load using scaling server (Built inhouse tool in Python and Bash)
  • Integration of scaling server with github actions
  • Worked on streamlining End to End Monitoring of the Goldcast Application stack using NewRelic
  • Redesigned all the dashboard based on Business metrics and Customer journey
  • GDPR complaint infra setup in AWS EU region
  • Build Rate-Limiting using cloudflare and automated it using github action so that when the Infrastructure scales, the rate limits gets modified accordingly
  • Setup on-call rotation plan/run-books for 24x7 on-call

Sr Manager SRE/DevOps

PhonePe
07.2019 - 06.2022
  • Company Overview: To get to know the scale at which PhonePe works on, Please visit https://www.phonepe.com/pulse/
  • Worked as an IC to setup the stage environment in our new Data Center and build a frontend for our homebrew cloud (Python Flask, JS, HTML, CSS)
  • Added config search engine and Black Box testing for data sources (Galera, Aerospike, Elastic Search and RabbitMQ and a few more)
  • Full data center migration within 6 months (all micro services and data stores)
  • Build a team from 2 members to 15 direct reportees
  • 2 managers with further reportee under them
  • Responsible for setting clear goals and objectives for the team and managing SRE Sprints
  • Responsible for mid-year and end of year performance evaluation
  • Migration of an entire business vertical (Insurance) from on premise DC to Azure (Terraform + Ansible)
  • Running the Fraud Analytics pipeline (MLOps) with a maximum latency of 60ms per query (The 99%ile was less than 5 ms)
  • My team managed Hadoop Clusters (including the DWH which has around 5 Petabytes of data and ingests 4 billion records per day), Galera Clusters (including the largest galera cluster in PhonePe), Elastic Search, Aerospike, RabbitMQ, Nginx, Mesos, Traefik, Homebrew Cloud and a few more critical components
  • Presently supporting 7 business units end to end (One on Azure and rest in our own DC)
  • Two more business verticals are presently being setup on Azure Mutual Funds and Soundbox)
  • Total TPV across these business verticals is upward of 500 Million USD per month
  • Worked with multiple third party auditors to meet the compliance requirement for all the business verticals
  • To get to know the scale at which PhonePe works on, Please visit https://www.phonepe.com/pulse/

VP-Architect Infrastructure and DevOps

Goods And Service Tax Network
03.2017 - 06.2019
  • Company Overview: Was heading the Infra and DevOps of Good and Service Tax Network, the technical backbone of the one of the largest and most complicated Taxation Network in the world, serving ~12 million tax payers
  • Lead a team of engineers (MSPs) taking care of Infrastructure (Network, Storage, Security, Systems), Platform (Redis, Kafka, Hadoop, Storm, JDG, Kubernetes, Docker, MySQL) and Application (around 40+ different microservices)
  • The DevOps/Infrastructure org was approximately 100 prelaunch
  • 8 months after launch, it was cut down to 50 when the infra and application stabilized
  • Was heading the Business Intelligence and Fraud Analytics team
  • Running Machine Learning Algo on the returns filed and maintaining the mlops pipeline
  • Responsible for providing insights of GST to the various departments of Government of India including PMO, Finance Secretary and Chief Economic Advisor of India
  • Was heading the Infra and DevOps of Good and Service Tax Network, the technical backbone of the one of the largest and most complicated Taxation Network in the world, serving ~12 million tax payers

Lead Dev Operations Engineer

WalmartLabs
05.2015 - 03.2017
  • Initiated a project on running cassandra clusters under docker containers
  • The orchestration is done using CHEF + Python
  • Worked on PCI check automation for our entire fleet (20000+ Hosts)
  • Worked on Inventory management system where I wrote the client which captures data from the systems at regular intervals and does a POST to our framework
  • Also wrote a couple of API's for the backend for group aggregation
  • Writing spec files for packaging DevOps tools which we build for automation purpose
  • Automating Sanity checks for our build jobs
  • Worked as Technical Duty Officer, taking part in P1/P2 incidents for our 5 ecommerce pillars
  • As a TDO, it was my responsibility to do whatever it takes to bring the site back up

Lead Dev Operations Engineer (Architect Unix/Linux Infrastructure)

Citrix R&D
10.2013 - 05.2015
  • Worked on Foreman, Cobbler, Puppet, Cloudstack, Xen, AWS, Zenoss, Proteus IPAM, Strongmail, Sendmail
  • Was managing around 7000 Baremetal hosts and 3000 VMs
  • Wrote Host Build/Decommission Automation in Perl which took care of adding/deleting the host profile to Cobbler, Registering/UnRegistering it with Foreman and Puppet, Adding/Deleting DNS entries, Adding/Deleting Zenoss entry
  • Wrote SAN auditing for our entire fleet in Perl
  • Wrote Cloudstack load testing which would create upto 100 VM's and do sanity check on them and then destroy them
  • Automating kernel compilation for custom kernels used in Test environments

Technical Lead Unix/Linux

Yahoo
10.2011 - 10.2013
  • Was Technical Lead DevOps for Advertising Platform which had a fleet of around 16000 boxes responsible for Targeted Ads
  • Worked on Data Center Consolidation project moving from 13 DC to 6 DC
  • Worked on Migrating monitoring from Nagios to Inhouse Monitoring as a Service
  • Wrote Application endpoint data monitoring framework using Perl
  • Setup log monitoring using Flume and Splunk for a component responsible for generating 50TB of compressed logs per month

Manager India

Ipaccess
07.2010 - 10.2011
  • Was hired to setup the entire Datacenter and Office from scratch in India and did it single handedly
  • The services setup included, DNS, DHCP, VPN tunnels to various centers, Iptable Firewall, NFS, Autofs, PXE boot environment, Quagga, configured switches and Windows AD
  • Stripping RHEL to run on embedded devices
  • Reducing the size of kernel by disabling features and reducing the default config values

Engineer

Red Hat Software Services
02.2008 - 07.2010
  • Worked on issues related to kernel, Filesystem, Software RAID, LVM, Multipath, QPID AMQP Messaging, Condor GRID and Performance issues for all the RHEL provided services
  • Wrote detailed knowledge bases for rhn.com (now known as access.redhat.com)

Education

B.E - Electrical Engineering

Uttar Pradesh Technical University

Skills

  • Docker
  • Kubernetes
  • Jenkins
  • Terraform
  • Ansible
  • Puppet
  • Salt
  • AWS
  • Azure
  • CloudStack
  • Xen
  • Hadoop
  • Kafka
  • Cassandra
  • Redis
  • Neo4j
  • MySQL
  • Elastic Search
  • Aerospike
  • DNS
  • DHCP
  • TCP/IP
  • GDPR
  • Perl
  • Python
  • Bash

Leadership and collaboration

  • Servant Leader: Committed to the success of associates, promoting a collaborative, inclusive work environment.
  • Effective Communicator: Builds influence through clear goal-setting, defining expectations, and consistent follow-through.
  • Talent Acquisition: Skilled in creating hiring strategies to build high-caliber engineering teams.
  • Project Management: Experienced in continuous planning, incremental execution, and fostering ongoing improvement.

Impact and value

  • Goldcast.io, Built and led a team for SRE, InfoSec, and NOC; created scalable infrastructure using IaC (Terraform/AWS), set up CI/CD pipelines, optimized application monitoring with NewRelic, and established 24x7 on-call support.
  • PhonePe, Managed a large team overseeing critical components, including a 5 PB data warehouse and fraud analytics pipeline. Led full migration of a business vertical to Azure, achieving compliance with stringent security standards.
  • Goods and Service Tax Network, Headed the infrastructure and DevOps for India's GST system, processing over 5 billion invoices and $136 billion in payments. Led a team managing around 40 microservices, BI, and fraud analytics, providing insights for government stakeholders.
  • WalmartLabs, Pioneered deployment of Cassandra clusters in Docker, automated PCI compliance for 20,000+ hosts, and served as TDO for high-priority incidents across e-commerce platforms.
  • Yahoo! and Citrix R&D, Oversaw large-scale Unix/Linux infrastructure, optimizing monitoring, log analysis, and data center consolidation.

Patent

https://patents.justia.com/inventor/preetish-kumar-tripathi

References

Available on request

Timeline

Lead SRE

Goldcast.io
10.2022 - 11.2023

Sr Manager SRE/DevOps

PhonePe
07.2019 - 06.2022

VP-Architect Infrastructure and DevOps

Goods And Service Tax Network
03.2017 - 06.2019

Lead Dev Operations Engineer

WalmartLabs
05.2015 - 03.2017

Lead Dev Operations Engineer (Architect Unix/Linux Infrastructure)

Citrix R&D
10.2013 - 05.2015

Technical Lead Unix/Linux

Yahoo
10.2011 - 10.2013

Manager India

Ipaccess
07.2010 - 10.2011

Engineer

Red Hat Software Services
02.2008 - 07.2010

B.E - Electrical Engineering

Uttar Pradesh Technical University
Preetish Kumar Tripathi