Available for new projects

Hi, my name is
Ajay.

I build reliable infrastructure & scalable MLOps platforms.

>

Over 6+ years working across AWS, Azure, GCP, Kubernetes, and Python. I bridge the gap between infrastructure and AI, ensuring that production systems and machine learning models scale predictably, securely, and silently.

{ }

01.About Me

I'm a DevOps & MLOps Engineer who treats infrastructure as a codebase and machine learning as a first-class production citizen. I keep Terraform modules small, well-named, and commented, so on-call engineers can reason about changes under pressure.

I usually end up owning public cloud foundations, VPC layouts, EKS/GKE clusters, and CI/CD deployment paths. Recently, my focus has been on architecting MLOps pipelines—moving ML workloads to Kubernetes and building automated model deployment pipelines that data scientists actually enjoy using.

Security is folded into my daily work—wiring IAM roles and policy checks into CI so least privilege is a default, not an afterthought.

Core Philosophy

  • Predictability over Magic: Make deployments feel boring. Standardize health checks and rollout strategies.
  • Code-Driven Ops: Wire CI/CD pipelines so code, infrastructure, and Helm changes follow identical review habits.
  • Actionable Alerts: Live in logs and metrics. Trim noisy alerts until only real problems wake people up.
  • Structural Clarity: Provide concise incident runbooks, post-incident reviews, and architecture diagrams.
  • ML Reliability: Treat model deployments like software releases. Ensure model serving is scalable, reproducible, and observable.

02.Technical Arsenal

Cloud Platforms

  • AWS (EC2, S3, RDS, EKS, Lambda, IAM)
  • GCP (Compute, GKE, Cloud SQL, IAM)
  • Microsoft Azure

Infrastructure as Code

  • Terraform
  • AWS CloudFormation
  • Ansible, Chef

Containers & Orch.

  • Kubernetes (EKS, GKE)
  • Docker
  • Helm

CI/CD & Version Control

  • GitHub Actions
  • Jenkins (Groovy Pipelines)
  • GitLab CI
  • Git, GitHub, GitLab

Languages & Linux

  • Python (Requests, boto3)
  • Go, Bash
  • Ubuntu, Amazon Linux, RHEL/CentOS

Observability & Security

  • Prometheus, Grafana, ELK Stack
  • CloudWatch, Stackdriver
  • AWS KMS, HashiCorp Vault
  • Security Groups, VPC Policies

AI & MLOps

  • MLflow, Kubeflow
  • Model Serving (Triton, Seldon)
  • SageMaker / Vertex AI
  • Vector DBs (Milvus/Pinecone)

03.Where I've Worked

DevOps/MLOps Engineer @ T-Mobile

Jun 2024 – Present | Seattle, WA

Built core AWS infrastructure using Terraform with S3 state and DynamoDB locking. Moved ML workloads from EC2 to EKS with auto-scaling to prevent job collisions. Set up GitHub Actions for model services including image builds and Helm deploys. Replaced IAM keys with IAM roles (IRSA) for least privilege.

Software & DevOps Engineer @ Treevah

Feb 2023 – Jun 2024 | Chicago, IL

Owned day-to-day AWS operations (EC2, RDS). Refactored legacy CloudFormation into modern, reusable Terraform modules. Enforced Git workflows for IaC. Wrote Python/boto3 tools for user automation, security audit checks, and migrated monolithic services to Serverless AWS Lambda.

DevOps Engineer @ Apollo Hospitals

Nov 2019 – Dec 2022 | Hyderabad, India

Administered daily operations for GCP (Compute Engine, Cloud SQL, GKE) treating infrastructure as code via Terraform. Containerized apps via GKE, configuring Cloud Build pipelines with image signing, and managed Stackdriver alerts via SLO-style dashboards to maximize performance.

04.Education & Certs

Academia

Master's in Information Technology

DePaul University | Chicago, IL

Jan 2023 – June 2024

Bachelor's in Computer Science

JNTU | India

March 2016 – March 2020

Certifications

  • AWS Certified DevOps Engineer – Professional

  • Microsoft Certified DevOps – Expert

  • Google Cloud AI Pathways

    • Intro to Generative AI Specialization
    • Intro to Large Language Models
    • Responsible AI Applying Principles