Welcome to DevOps Engineering & Cloud Infrastructure (Core Track)! 🎓
This 20-week advanced track develops practical DevOps capability across CI/CD, container orchestration, infrastructure as code, secure delivery, observability, reliability engineering, and operational excellence.
You will complete production-style labs every week, progressively building an end-to-end platform that integrates pipeline automation, Kubernetes operations, Terraform workflows, security controls, and incident response playbooks.
Your success is our priority. By graduation, you will be able to design, implement, and defend a production-ready DevOps platform aligned to modern engineering and governance expectations.
Detailed Weekly Curriculum
DevOps Foundations & CI/CD Overview
Concept Scope
- DevOps culture, CALMS principles, and value stream thinking.
- Linux administration essentials for production environments.
- Operational mindset for reliability and automation.
Hands-On Scope
- Create the Week 1 textbook folder tree (preclass-notes, lab-01..lab-04, weekly-project).
- Provision Linux + Git tooling and run first CI validation on pull request.
- Troubleshoot a failed pipeline run and document root cause + fix.
Expected Deliverables
- Week-01 repo tree and README runbook.
- CI run evidence with pass/fail logs.
- Failure remediation note with corrected commit.
Docker Fundamentals & Containerization
Concept Scope
- Git workflows, branching strategy, and commit hygiene.
- Repository governance and collaborative development patterns.
- Pull request quality controls and review standards.
Hands-On Scope
- Build a production Dockerfile and deterministic image build process.
- Run compose-based local stack with health checks and environment controls.
- Scan image vulnerabilities and remediate critical findings before publish.
Expected Deliverables
- Docker/compose package with run instructions.
- Security scan report and remediation proof.
- Tagged image artifact with runtime validation logs.
Advanced Docker & Multi-Container Applications
Concept Scope
- CI pipeline stages and quality gates.
- Build reproducibility, dependency management, and test automation.
- Pipeline failure diagnosis and remediation patterns.
Hands-On Scope
- Assemble a multi-container app (API, cache, DB) with startup dependencies.
- Configure network and persistence behavior across containers.
- Run service-failure simulation and verify recovery + data integrity.
Expected Deliverables
- Multi-container topology and manifests.
- Dependency and persistence validation logs.
- Failure simulation report with corrective actions.
Container CI/CD & Automation
Concept Scope
- Container runtime model and image lifecycle.
- Dockerfile best practices and multi-stage builds.
- Image security and deterministic builds.
Hands-On Scope
- Automate build-test-scan-publish stages for container releases.
- Implement gated promotion from staging to production tags.
- Execute rollback drill to a known-stable image.
Expected Deliverables
- CI/CD workflow with release gates.
- Promotion history and approval evidence.
- Rollback drill report with recovery timing.
Kubernetes Architecture & Core Concepts
Concept Scope
- Container networking modes and service communication.
- Volume management and data durability patterns.
- Runtime observability and debugging techniques.
Hands-On Scope
- Deploy first workloads to Kubernetes and validate cluster readiness.
- Create namespace and workload boundaries for platform/app services.
- Troubleshoot one scheduling or image-pull issue end-to-end.
Expected Deliverables
- Kubernetes baseline manifest bundle.
- Cluster readiness and deployment evidence.
- Troubleshooting report with root cause.
Kubernetes Workloads & Configuration
Concept Scope
- Kubernetes control plane and worker node responsibilities.
- Workload primitives: Deployments, ReplicaSets, and Pods.
- Scheduling behavior and resource management.
Hands-On Scope
- Implement ConfigMaps, Secrets, and probe strategy for workloads.
- Run rolling updates and verify zero-downtime behavior.
- Deploy stateful component and validate restart persistence.
Expected Deliverables
- Config and secret management manifests.
- Rolling update validation logs.
- Stateful persistence checklist and results.
Kubernetes Networking & Storage
Concept Scope
- Service discovery and traffic routing patterns.
- Ingress controller fundamentals and path/host routing.
- Network policies and segmentation controls.
Hands-On Scope
- Configure Services, Ingress, and TLS routing for app exposure.
- Provision PV/PVC workflows and validate storage behavior.
- Apply NetworkPolicies and test allowed/blocked communication paths.
Expected Deliverables
- Ingress/service/TLS configuration pack.
- Storage provisioning and durability evidence.
- NetworkPolicy validation matrix.
Advanced Kubernetes Patterns
Concept Scope
- ConfigMaps, Secrets, and secure configuration workflows.
- StatefulSets and persistent volume claims.
- Operational considerations for stateful workloads.
Hands-On Scope
- Apply HPA, anti-affinity, and PDB resilience controls.
- Run controlled failure scenarios to test self-healing.
- Tune resource requests/limits from observed metrics.
Expected Deliverables
- Resilience and scaling manifest set.
- Failure simulation incident log.
- Resource tuning report with before/after metrics.
Helm & Kubernetes Package Management
Concept Scope
- Helm chart structure and templating strategy.
- Values management across environments.
- Chart testing and release lifecycle.
Hands-On Scope
- Package workloads as Helm charts with env-specific values files.
- Publish versioned charts and deploy through release flow.
- Execute chart upgrade + rollback safety test.
Expected Deliverables
- Helm chart repo with version history.
- Environment values and release notes.
- Upgrade/rollback validation evidence.
Infrastructure as Code with Terraform
Concept Scope
- Terraform workflow, state, and dependency graph.
- Provider configuration and module decomposition.
- Idempotent infrastructure delivery principles.
Hands-On Scope
- Provision core infrastructure using modular Terraform code.
- Configure remote state backend with locking and access controls.
- Run validated plan/apply workflow with approvals.
Expected Deliverables
- Terraform module set and documentation.
- State backend/locking configuration evidence.
- Plan/apply logs with approval trace.
Advanced Terraform & Cloud Provisioning
Concept Scope
- Remote state backends and locking strategy.
- Reusable module design for platform teams.
- Environment promotion and drift management.
Hands-On Scope
- Refactor Terraform modules for multi-environment reuse.
- Run drift detection and controlled remediation workflow.
- Add policy-as-code checks to block unsafe plans.
Expected Deliverables
- Reusable module composition examples.
- Drift remediation log and impact summary.
- Policy gate outcomes and exceptions register.
GitOps & Advanced CI/CD
Concept Scope
- Core cloud services for compute, storage, and networking.
- IAM and least-privilege deployment controls.
- Reliability and scalability design decisions.
Hands-On Scope
- Deploy GitOps controller and repository-driven sync model.
- Implement PR-based promotion with approvals and policy checks.
- Trigger and observe automated drift reconciliation.
Expected Deliverables
- GitOps repo structure and app definitions.
- Promotion PR history and sync evidence.
- Drift/reconciliation incident report.
Monitoring, Logging & Observability
Concept Scope
- Release strategies: blue/green, canary, and rolling updates.
- Environment promotion and deployment approvals.
- Rollback strategy and release risk management.
Hands-On Scope
- Instrument services with metrics, logs, and traces.
- Define SLI/SLO targets and configure alerting thresholds.
- Perform outage RCA using telemetry correlation.
Expected Deliverables
- Observability dashboards and alert rules.
- SLO definition and threshold rationale.
- RCA document with corrective actions.
Advanced CI/CD Pipeline Engineering
Concept Scope
- Observability pillars and telemetry architecture.
- Prometheus/Grafana and centralized log pipelines.
- SLO/SLI/SLA model and alert tuning.
Hands-On Scope
- Build reusable multi-stage pipelines and template libraries.
- Enable artifact signing/provenance controls and approvals.
- Optimize pipeline runtime with cache/selective execution.
Expected Deliverables
- Pipeline template library and usage notes.
- Artifact signing/provenance evidence.
- Pipeline optimization benchmark report.
Advanced Deployment Strategies
Concept Scope
- Shift-left security in CI/CD pipelines.
- SAST, dependency scanning, and secret detection.
- Runtime security controls and policy enforcement.
Hands-On Scope
- Implement blue/green and canary deployment paths.
- Configure automated rollback on error threshold breach.
- Run deployment game day to validate safety controls.
Expected Deliverables
- Deployment strategy manifests and runbook.
- Rollback trigger and execution evidence.
- Game day findings and action backlog.
Service Mesh & Advanced Networking
Concept Scope
- GitOps operating model and desired state reconciliation.
- ArgoCD/Flux workflow and sync policies.
- Operational governance for platform teams.
Hands-On Scope
- Deploy service mesh and enforce mTLS between services.
- Apply retries/timeouts/circuit-breakers for unstable dependencies.
- Measure service path latency and failure propagation.
Expected Deliverables
- Service mesh policy configuration bundle.
- Traffic policy validation outputs.
- Service communication topology and latency report.
Advanced Observability & SRE Practices
Concept Scope
- SRE practices, error budgets, and reliability metrics.
- Incident command workflow and communication standards.
- Root cause analysis and corrective action planning.
Hands-On Scope
- Run incident simulation with on-call roles and timelines.
- Track MTTR and error-budget impact during response.
- Execute postmortem and implement prioritized fixes.
Expected Deliverables
- Incident response runbook updates.
- Simulation metrics and MTTR report.
- Postmortem with verified follow-up actions.
DevSecOps & Security
Concept Scope
- Cost visibility, allocation, and optimization strategies.
- Performance tuning methodology and bottleneck analysis.
- Capacity planning and scaling economics.
Hands-On Scope
- Embed SAST/dependency/container checks in CI pipelines.
- Implement secrets management and least-privilege runtime access.
- Map threat scenarios to control implementations and test closure.
Expected Deliverables
- Security pipeline execution evidence.
- Secrets/RBAC control implementation records.
- Threat-model artifact with remediation tracking.
Production Optimization & Cost Management
Concept Scope
- Architect an end-to-end DevOps platform solution.
- Integrate CI/CD, IaC, observability, and security controls.
- Prepare production-readiness evidence and governance assets.
Hands-On Scope
- Baseline cost and performance by service/environment.
- Apply rightsizing/scaling optimizations without SLO regression.
- Configure budget controls and anomaly alerting.
Expected Deliverables
- Cost baseline dashboard and optimization plan.
- Performance/cost comparison report.
- Budget guardrail and alert-response runbook.
Capstone Project & Production Deployment
Concept Scope
- Operational hardening and resilience validation.
- Executive communication of architecture trade-offs and risk posture.
- Final technical defense and roadmap recommendations.
Hands-On Scope
- Deliver complete capstone release from IaC provisioning to production deployment.
- Execute final production readiness review (security, resilience, observability, recovery).
- Present architecture and operations defense with evidence-backed decisions.
Expected Deliverables
- Capstone repository with IaC, CI/CD, and deployment assets.
- Readiness dossier with validation artifacts and sign-off checklist.
- Executive presentation deck and technical Q&A record.