Scientific Computing & ML Platform (Novartis)
- Refactored a legacy monolith into 3 microservices using DDD, improving maintainability and system isolation.
- Overhauled CI/CD pipelines (Jenkins/Ansible), reducing execution time from 3 hours to 30 minutes.
- Standardized environments using Singularity containers and optimized service modules for idempotent operations within HPC clusters.
- Led OS migrations with zero downtime and 24/7 availability for GPU-accelerated ML workloads.
MD Projects – Reliability Engineering (Medecision)
- Managed SLIs/SLOs across distributed services with Dynatrace, Datadog, and Splunk for proactive incident detection.
- Led root cause analysis (RCA) for complex application failures and implemented structural fixes.
- Executed critical Oracle database operations and maintained high-volume ETL/Batch stability.
Cloud Migration & Data Platforms (Biogen)
- Orchestrated end-to-end migration of data workflows to AWS with VPC, EC2, Glue, Lambda, and API Gateway.
- Designed scalable cloud environments with Terraform and CloudFormation for reproducible infrastructure.
- Established GitOps practices using GitHub Actions and created CI/CD templates.