Rebooting a datacenter: A decade later

Oxide and Friends - A podcast by Oxide Computer Company

Categories:

Back in May 2014 Joyent accidentally rebooted an entire datacenter (not just the handful of node as intended!). That incident--traumatic was it was--informed many aspects of the Oxide product. Bryan and Adam were joined by members of that former Joyent team to discuss, commiserate, and--perhaps--get some things off their chests. a live show weekly on Mondays at 5p for about an hour, and recording them all; here is the recording.In addition to Bryan Cantrill and Adam Leventhal, speakers included Josh Clulow, Brian Bennett, Robert Mustacchi, and Steve Tuck.Some of the topics we hit on, in the order that we hit them:The Register: Fat-fingered admin downs entire Joyent data centerBryan's talk: Debugging Under FireOxide and Friends on the Oakland BallersThe Ur AgentJoyent post-mortemPRs needed!If we got something wrong or missed something, please file a PR! Our next show will likely be on Monday at 5p Pacific Time on our Discord server; stay tuned to our Mastodon feeds for details, or subscribe to this calendar. We'd love to have you join us, as we always love to hear from new speakers!