Resilience and Chaos Engineering in the Cloud

Duration: 50 mins
Nikos Katirtzis
Software Engineer,
Daniel Albuquerque
Software Engineer, Expedia

At (part of Expedia Group) we run microservices and infrastructure in production at a large scale. Where applications previously ran on fixed hosts for their lifetime, moving our services to AWS and on Kubernetes presented us with a whole new set of challenges that we must be prepared for.

Every production incident not only impacts our revenue but also our customers' trust. In an effort to build resilience into our highly performant and highly scalable services, we at Expedia Group explored processes and tools to stress and 'break' our systems on purpose and without impacting production.

In this talk, we'll talk you through the following:

  • Parallels between Resilience Engineering in tech and in other industries
  • Why resilience matters and how we can become better at that
  • Why we need Chaos Engineering and practical examples on cloud and Kubernetes
  • State of the art and current limitations

You may also be interested in

50 mins
Create your own Interpreter with a JIT under 1 hour

During this talk I will describe how you can use Eclipse OMR technologies to easily create an Interpreter for a...

25 mins
Mastering your Eclipse IDE - Java Tooling, Tips & Tricks!

Eclipse IDE provides a lot of powerful features. With so much functionality at its disposal, the full potential of the...

50 mins
Mutation Testing

Most developers are familiar with the concept of unit testing, and how this is useful to ensure validity of your...

50 mins
Exploring Collectors: One of the Most Powerful Utility Class in the JDK

One of the most intriguing classes in the JDK is the Collectors utility class, with a collection of some highly...

50 mins
Uncovering Project Amber - Changes to the Java Language in v10 and Beyond

Evolution has always been in the Java DNA, and according to Darwin, "It is neither the strongest nor the most...

180 mins
Functional Modern Java

Java is now on a six-month release schedule, with new features being added all the time. This workshop will show...