18th January 2016

19th January 2016 at 3:01pm
Journal

TODO

  • Update my contact page

http://n.tfb.net/ and add a proper SSL certificate etc.

  • Update CV for Google and Velocix / Alcatel-Lucent

Google

  • http://www.site-reliability-engineering.info/2014/04/what-is-site-reliability-engineering.html
  • problem solving, diagnosis, issues at scale, root cause analysis, defining and driving best practices with developers
  • "I took some information from one internal DSL and rewrote it into another internal DSL"
  • Administer high-availability and high-QPS web frontend and RPC services backed by custom storage layers (e.g., GFS, BigTable)
    • Administer application load balancers
    • Administer custom Java, C++, and Python services using the standard suite of Google custom tools
    • Perform thorough capacity planning, loadtesting, resource management, and utilization responsibilities for existing and new production services
    • Perform software releases
    • Provide launch reviews for new applications and releases
    • Perform frequent oncall duties for critical high-availability services (three and four nines)