Error handling

When monitoring a system, we are primarily interested in failures. We want to know when something goes wrong, and we want to know why.

This section of the course focuses on failures—in particular, how to capture their occurrence in our telemetry data.

Enjoy!

Exercise

The exercise for this section is located in 02_failures/00_intro