Close

Manasseh Mmadu

Senior Site Reliability Engineer

Download Resume

About Me

I break things in production so you don't have to (just kidding... mostly). I spend my days wrangling Kubernetes clusters, convincing cloud providers to give us better pricing, and making sure systems stay up even when they really don't want to. I've saved companies hundreds of thousands of dollars—turns out deleting unused resources is easier than explaining to finance why we need them. When I'm not migrating global routing layers or building controllers that nobody asked for but everybody needs, you'll find me contributing to open-source projects or teaching monitoring stacks to actually tell me useful things. I believe in automation, reliability, and the occasional well-timed meme in the incident channel.

Experience

Zapier Inc., USA

Senior Site Reliability Engineer (Remote)

  • Led zero-downtime migration of Zapier's global routing layer from Kubernetes NGINX + CloudFront to serverless Envoy + Lambda@Edge architecture, enabling canary rollouts, region-aware routing, and reducing operational complexity
  • Drove organization-wide cloud optimization initiatives identifying $400k+ in annual cost savings through AWS VPC endpoints integration, resource cleanup, and infrastructure rightsizing across multiple teams
  • Spearheaded security and reliability improvements including signed Git commits enforcement, zero-downtime Kubernetes upgrades, and development of reusable infrastructure patterns aligned with platform strategy

Zapier Inc., USA

Site Reliability Engineer (Remote)

  • Built Kubernetes controllers including mutating admission webhooks and Kubechecks, reducing image pull costs by $100k annually and enabling faster release processes for 5,000+ applications
  • Improved monitoring infrastructure reliability by sharding Prometheus and implementing Thanos HA with query caching, increasing Grafana and Thanos availability from 99.5% to 99.9%
  • Led platform migrations (Heroku to AWS, Elasticsearch to OpenSearch) with minimal downtime and extended SLO operators to automate observability across the organization

Deimos

Site Reliability Engineer (Remote)

  • Created and maintained cloud infrastructures on AWS, GCP, and Azure
  • Automated infrastructure provisioning using Terraform and Infrastructure as Code practices
  • Set up and maintained Kubernetes clusters using Kops, kubeadm, and managed cloud services
  • Implemented monitoring solutions using Elastic Stack and Prometheus
  • Deployed applications on Kubernetes using Helm, Kustomize, and ArgoCD
  • Built and maintained CI/CD pipelines using Azure DevOps and GitLab CI

eHealth4Everyone

Backend Engineer (Remote)

  • Developed and maintained Django applications with focus on performance optimization and test coverage
  • Implemented background task processing using Celery for infrastructure automation with Ansible
  • Built SaaS platform for automated DHIS2 server deployment using Docker, Ansible, and RabbitMQ
  • Upgraded legacy applications from Django 1.11/Python 2 to Django 2.0/Python 3
  • Established CI/CD pipelines using GitLab CI for automated testing and deployment
  • Designed and implemented responsive web interfaces using Django templates and Bootstrap

Education

Federal University of Technology, Minna

Jan 2014 - November 2019

Bachelor of Engineering in Computer Engineering

  • Had some of the best classmates around where we learnt concepts of Computer Architecture, Data Structures and many computer related knowledges. Even worked on Arduino where we IoT devices, raspberry Pi, Assembly Language (: as well.
  • I was involved in a lot projects: part of a school research group where we focused on the advancement of SDNs and WSNs.
  • Graduated with First Class Honors
  • I was also part of the founders of the FUT Developers Circle. A developer community to mentor upcoming developers.

Projects

kubechecks

Infrastructure validation tool for Kubernetes deployments. Provides automated checks and validation for Kubernetes manifests and infrastructure configurations before deployment.

View Project

Reka

Cloud resource management tool to destroy, stop, resume, or clean up unused resources across multiple cloud providers (AWS, GCP, Azure).

View Project

Search Engine Parser

Python package to query popular search engines and scrape result titles, links, and descriptions. Supports multiple search engines with a unified interface. Widely used in the open-source community.

View Project

Signalum

Linux tool to detect and analyze WiFi and Bluetooth connections using Python. Features both CLI and GUI applications for network signal monitoring and analysis.

View Project
View Desktop Application

Gophie

Document retrieval system built with Go. Provides efficient search, streaming, and download capabilities with an ad-free interface. Includes API, web, and CLI interfaces.

View Project
Checkout More from my GitHub or my OpenSource Organization

Languages

Cloud & Infrastructure

Monitoring & Observability

CI/CD & DevOps Tools

Frameworks & Development

Certifications

Kubernetes and Cloud Native Application Development

Credential ID: Available at credential.net

Linux Kernel Internals and Development (LFD420)

Linux Foundation Training

Hobbies

Get in Touch