Back to Full Curriculum
SE601Semester 63 (2-0-2)Major

Building Scalable Cloud Services

Scalability types (vertical, horizontal, diagonal), Stateless vs. stateful services, Load balancing algorithms (round-robin, least connections, IP hash), Layer 4 vs. Layer 7 load balancing, Health checks and graceful...

Syllabus

01

Unit 1: Scalability Patterns and Load Balancing

Scalability types (vertical, horizontal, diagonal), Stateless vs. stateful services, Load balancing algorithms (round-robin, least connections, IP hash), Layer 4 vs. Layer 7 load balancing, Health checks and graceful degradation, Auto-scaling policies (CPU/memory utilization, custom metrics, predictive scaling), Capacity planning and right-sizing.

02

Unit 2: Container Orchestration and Service Mesh

Docker containerization best practices, Kubernetes architecture (control plane, worker nodes, etcd), Pod lifecycle and controllers (Deployment, StatefulSet, DaemonSet), Kubernetes Services (ClusterIP, NodePort, LoadBalancer, ExternalName), Ingress controllers and API gateways, Service mesh patterns (Istio virtual services, traffic management, observability).

03

Unit 3: Event-Driven and Asynchronous Architectures

Message brokers (Kafka, RabbitMQ, SQS), Pub-sub patterns and fan-out/fan-in, Event sourcing and CQRS principles, Apache Kafka (topics, partitions, consumer groups, exactly-once semantics), Stream processing (Kafka Streams, KSQL), Serverless event sources (Lambda triggers, Cloud Functions events), Saga pattern for distributed transactions.

04

Unit 4: Distributed Caching and Data Stores

Caching patterns (cache-aside, write-through, read-through), Redis cluster architecture and data sharding, Memcached consistent hashing, CDN integration for static assets, NoSQL selection criteria (DynamoDB, Cassandra, MongoDB Atlas), Relational database scaling (read replicas, sharding), Multi-region database replication.

05

Unit 5: Observability, Chaos Engineering, and Cost Optimization

The three pillars (metrics, logs, traces), Prometheus/Grafana monitoring stacks, OpenTelemetry instrumentation, Distributed tracing analysis, Chaos engineering (Chaos Monkey, Gremlin), SLO/SLI/SLA definition and error budgets, FinOps practices, Reserved/spot instances, Cost allocation tagging, Multi-cloud cost comparison.