A Kubernetes and Distributed Systems Reading List |

So, why this post? Well…

I'm (almost) done with my exams, so I'll finally have much more free time. What are some technical books you would recommend to me? 📚 I'm interested in Kubernetes, containers, distributed systems, security (although I'm a noob here), and more.
— Marko Mudrinić (@xmudrii) July 5, 2019

For those of you who don’t know Marko, he is a former GSoC student at the CNCF working on Kubernetes, and is a contributor to Kubernetes Cluster API, so I’m just going to take the idea that Marko’s a n00b with a pinch of salt the size of handfuls.

Anyway, it’s a common enough request that it’s probably worth documenting my 2p here. What follows is mostly things and authors that have interested me of. Other opinions are also available.

Books

Papers

On Cluster Orchestration

Choudhury, Diptanu Gon, and Timothy Perrett. Designing cluster schedulers for internet-scale services. Communications of the ACM 61 no. 6 (2018): 34-40. https://doi.org/10.1145/3190564
Leung, Andrew, Andrew Spyker, and Tim Bozarth. Titus: introducing containers to the Netflix cloud.. Communications of the ACM 61 no. 2 (2018): 38-45. https://doi.org/10.1145/3152529
Burns, Brendan, Brian Grant, David Oppenheimer, Eric Brewer, and John Wilkes. Borg, Omega, and Kubernetes. Communications of the ACM 59 no. 5 (2016): 50-57. https://doi.org/10.1145/2890784
Verma, Abhishek, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. Large-scale cluster management at Google with Borg. In Proceedings of the Tenth European Conference on Computer Systems (EuroSys '15). ACM, 2015. https://doi.org/10.1145/2741948.2741964.

Distributed Systems

Bailis, Peter and Kyle Kingsbury. The Network is Reliable: An informal survey of real-world communications failures. ACM Queue 12 no. 7 (2014): 1-13. https://doi.org/10.1145/2643130. http://bit.ly/2JfqCuO.
DeCandia, Giuseppe, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. Dynamo: amazon's highly available key-value store. ACM SIGOPS operating systems review 41 no. 6 (2007): 205-220. https://doi.org/10.1145/1323293.1294281
Lamport, Leslie. Paxos made simple. ACM SIGACT News (Distributed Computing Column) 32 no. 4 (2001): 51-58

Security

Frazelle, Jessie. Research for practice: security for the modern age. Communications of the ACM 62 no. 1 (2019): 43-45. https://doi.org/10.1145/3287295. http://bit.ly/2JqTfUB.

System Architecture

Saltzer, Jerome H., David P. Reed, and David D. Clark. End-to-end arguments in system design. Technology 100 (1984): 0661

Share Tweet

A Kubernetes and Distributed Systems Reading List

Books

On Kubernetes

On distributed systems

On organizational practice

Papers

On Cluster Orchestration

Distributed Systems

Security

System Architecture