- Added, and configured, Open Source Chef Server, which would become host to a dynamic EC2 server fleet that ranged from 100 to 300 Chef clients.
- Set up a metric collection suite consisting of Sensu, Graphite, and Grafana, a front-end dashboard for Graphite.
- Used Rundeck to automate several tasks related to monitoring and deployment (e.g. restarting EC2 nodes failing status checks).
- Configured Logstash to parse, and alert on, specific thresholds within Java API logs.
Technologies used: Ruby, Chef, Nagios, Sensu, Rundeck, lsyncd, Graphite, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana)