18th January 2016: ~nicolaw — Übergeek & CatLoaf Facilitator

18th January 2016

19th January 2016 at 3:01pm

http://n.tfb.net/ and add a proper SSL certificate etc.

http://www.site-reliability-engineering.info/2014/04/what-is-site-reliability-engineering.html
problem solving, diagnosis, issues at scale, root cause analysis, defining and driving best practices with developers
"I took some information from one internal DSL and rewrote it into another internal DSL"
Administer high-availability and high-QPS web frontend and RPC services backed by custom storage layers (e.g., GFS, BigTable)
- Administer application load balancers
- Administer custom Java, C++, and Python services using the standard suite of Google custom tools
- Perform thorough capacity planning, loadtesting, resource management, and utilization responsibilities for existing and new production services
- Perform software releases
- Provide launch reviews for new applications and releases
- Perform frequent oncall duties for critical high-availability services (three and four nines)