Resilience and Chaos Engineering in the Cloud


Duration: 50 mins
Nikos Katirtzis
Software Engineer, Hotels.com
Daniel Albuquerque
Software Engineer, Expedia

At Hotels.com (part of Expedia Group) we run microservices and infrastructure in production at a large scale. Where applications previously ran on fixed hosts for their lifetime, moving our services to AWS and on Kubernetes presented us with a whole new set of challenges that we must be prepared for.

Every production incident not only impacts our revenue but also our customers' trust. In an effort to build resilience into our highly performant and highly scalable services, we at Expedia Group explored processes and tools to stress and 'break' our systems on purpose and without impacting production.

In this talk, we'll talk you through the following:

  • Parallels between Resilience Engineering in tech and in other industries
  • Why resilience matters and how we can become better at that
  • Why we need Chaos Engineering and practical examples on cloud and Kubernetes
  • State of the art and current limitations

You may also be interested in

50 mins
What a Character

We will cover the concepts of representing scripts from different languages, and an introduction to Unicode. We will delve into...

50 mins
Uncovering Project Amber - Changes to the Java Language in v10 and Beyond

Evolution has always been in the Java DNA, and according to Darwin, "It is neither the strongest nor the most...

180 mins
Hands-on Cloud-native Java with MicroProfile, Kubernetes and Istio

Ever wondered what makes a cloud-native application “cloud-native”? Ever wondered what the unique challenges are and how best to address...

50 mins
Java Testing Skills with JUnit 5, Mockito 3, and More

The fundamental testing libraries in Java have undergone complete redesigns in the past few years. JUnit 5 redesigns the most...

50 mins
Deep Learning and Java

We do not usually think of Java when we think about modern Deep Learning systems. Syntactically and culturally it is...

25 mins
Building Reactive Systems with Reactor

Do you want to build reactive, efficient architectures on the JVM? Are Java threads cumbersome for you? Do you want...