Not so long ago I wanted to read something about DevOps and processes in real organisations. So I chose Site Reliability Engineering: How Google Runs Production Systems. And it nicely explains about deploy, failures recovery, support and other SRE aspects from engineering and management points of view. Also it interesting to read about problems and solutions of huge systems.
However the book is informative, but it’s a bit boring. And almost most of cases are Google specific or can be a problem only on a very large systems.