This role takes care of adding collectd to any given server.
It can be used to:
-
Collect metrics about a machines, and optionally forward them to an aggregation machine.
-
Create an aggregation machine, which will receive the metrics from a given set of machine within the same datacenter. In this case too, metrics can optionally be forwarded to another service.
Simply add the collector role to your playbook.
By default, we will save all metrics to RRD files under /var/lib/collectd/rrd/${hostname}
.
Normally, we put all aggregation on a single machine. However, it is possible to decouple them, and provision separate machines for any aggregators we might have. This example demonstrate how to create a dedicated collectd aggregator.
[collectd:children]
collectd-somedc-prod
[collectd-somedc-prod]
collectd1.somedc.prod ansible_ssh_host=10.0.1.111
[... skipping ...]
[somedc-prod:children]
collectd-somedc-prod
[somedc-prod:vars]
collectd_forwarder = collectd1.somedc.prod
#
# Optional: if you are aggregating logs with logstash
# and are using ElasticSearch and Kibana, you might
# want to also keep track of your metrics through
# Kibana. If so, simply add the following to your
# global configuration - keep in mind that
# activating this will turn off any other
# forwarding you might have configured in
# favor of sending all entries to logstash
#
logstash_forwarder = collectd2.somedc.prod
collectd_forward_to_logstash = true
The following may also be added to your inventory.
collectd_interval
: at what interval in seconds to take measurements (default: 60)check_disk
: a value which we will use to select what disk to monitor (default: xvde)collectd_df_fs_type
: the file system type to monitor (Collectd 5.4 default: ext4. Collectd 5.5 default: undefined)collectd_df_device
: the dev device to monitor.collectd_df_mountpoint
: the mount point to monitor.collectd_df_IgnoreSelected
: see wiki (default: false)collectd_df_ReportByDevice
: see wiki (default: false)collectd_df_ReportInodes
: see wiki (default: false)collectd_df_ValuesAbsolute
: see wiki (default: true)collectd_df_ValuesPercentage
: see wiki (default: false)monitor_coretemp
: set to true if you want to monitor coretemp (only useful on real hardware)private_ip
: set to the IP address of thecollectd_forwarder
ifcollectd_fowarder
has multiple IPs. Example:
collectd1.somedc.prod ansible_ssh_host=10.0.1.111 private_ip=172.16.1.111
Default: xvde
check_disk: xvde
All options are optional.
The following should be set as an array. Example 1: (Single value)
collectd_df_device: '["/dev/xvda1"]'
collectd_df_mountpoint: '["/"]'
collectd_df_fs_type: '["xfs"]'
Example 2: (Multiple values)
collectd_df_device: '["/dev/xvda1","/dev/xvdb1"]'
collectd_df_mountpoint: '["/","/data"]'
collectd_df_fs_type: '["xfs","ext4"]'
You can change the rest of the defaults.
collectd_df_IgnoreSelected: false
collectd_df_ReportByDevice: false
collectd_df_ReportInodes: false
collectd_df_ValuesAbsolute: true
collectd_df_ValuesPercentage: false
It is possible for any roles to add their own custom metric collection configuration.
In myrole/templates/collectd.conf.j2
:
<Plugin "whatever">
...
</Plugin>
Then, in myrole/tasks/main.yml
:
- name: Adding collectd monitoring for myrole
template: >
src=collectd.conf.j2
dest=/etc/collectd.d/myrole.conf
notify:
- Restart collectd
tags:
- myrole
- files
- collectd
Simply follow the standard provisioning method.
When provisioning with this role, you can also add Librato and Graphite support.
As you should not add any credentials to your inventory,
it should be done by using --extra-vars
as described below.
--extra-vars="use_librato=true" \
--extra-vars="librato_email=some@email.com" \
--extra-vars="librato_token=API-KEY"
--extra-vars="use_graphite=true" \
--extra-vars="graphite_host=some.host.com" \
--extra-vars="graphite_port="