BIDA2025

Mr. Akaash Vishal Hazarika

Building Reliable Distributed Systems

Abstract:

This talk examines the critical challenges and practical solutions for constructing distributed systems that maintain reliability amid inevitable failures. We explore the fundamental tensions within the CAP theorem and how these constraints shape architectural decisions in modern distributed applications. The presentation covers essential reliability patterns including consensus algorithms (Paxos, Raft), state machine replication, and failure detection mechanisms that enable systems to maintain consistency and availability despite network partitions. We'll analyze implementation strategies such as circuit breakers, distributed transactions, and data partitioning techniques that translate theoretical concepts into robust production systems. The talk also addresses verification methodologies including chaos engineering and load testing that confirm system resilience under stress. Attendees will gain practical insights for designing distributed architectures that balance theoretical principles with engineering pragmatism, enabling systems that operate dependably at scale despite the unpredictable nature of distributed environments.