Prometheus consul注册发现

[Unit] 
Description=Consul. 
Documentation=https://www.consul.io 
After=network.target 
 
[Service] 
User=consul 
Group=consul 
ExecStart=/usr/bin/consul agent -server -bootstrap -data-dir=/data/consul/data -node=Consul -bind=0.0.0.0 -client=0.0.0.0 -ui 
ExecReload=/bin/kill -HUP $MAINPID 
Restart=always
RestartSec=5s
 
[Install] 
WantedBy=multi-user.target

创建consul用户、数据目录与配置文件目录,并配置token认证
useradd --no-create-home --shell /sbin/nologin consul
mkdir -p /data/consul/data
chown consul. /data/consul/data/
mkdir /etc/consul/
vim /etc/consul/acl.json

{ 
 "acl": { 
  "enabled": true, 
  "default_policy": "deny", 
  "enable_token_persistence": true, 
  "tokens": { 
    "master": "xxxxxx-xxxx-xxxx-xxxx-xxxxxxxxx"      //token自定义
  } 
 } 
}

启动服务并设置开机自启
systemctl daemon-reload
systemctl start consul.service
systemctl enable consul.service

修改配置文件,与consul、alertmanager连接
vim /etc/prometheus/prometheus.yml

global: 
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute. 
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute. 
 
# Alertmanager configuration 
alerting: 
  alertmanagers: 
  - static_configs: 
    - targets: 
       - localhost:9093          # Alertmanager 地址与端口号
 
# Load rules once and periodically evaluate them according to the global 'evaluation_interval'. 
rule_files: 
   - "/etc/prometheus/rules/*.rules"      # 报警规则文件路径
 
scrape_configs: 
  - job_name: 'prometheus' 
    static_configs: 
    - targets: ['localhost:9090'] 
  - job_name: 'Instance' 
    consul_sd_configs:            # 配置基于consul的服务发现
    - server: localhost:8500       # consul地址
      token: 8dc1eb67-1f5f-4e10-ad9d-5e58b047647c     # 自定义的token
      refresh_interval: 60s      # 刷新间隔
      services: []      # 当前需要发现哪些类型服务实例的信息,后面会做详细配置
# 所有服务会先注册到consul,prometheus通过基于Consul的服务发现监控到target

启动服务及设置开机自启
systemctl start prometheus
systemctl enable prometheus

修改Prometheus配置,让其支持基于consul的服务发现
vim /etc/prometheus/prometheus.yml

......
scrape_configs: 
......
  - job_name: 'Instance' 
    consul_sd_configs: 
    - server: localhost:8500 
      token: 8dc1eb67-1f5f-4e10-ad9d-5e58b047647c 
      refresh_interval: 60s 
      services: ['node_exporter','alertmanager_exporter','consul_exporter'] 
    relabel_configs:        # 对默认的Metadata进行自定义Relabeling
    - source_labels: [__meta_consul_service_address] 
      target_label: 'ipaddress' 
    - source_labels: [__meta_consul_service_metadata_instance] 
      target_label: 'instance' 
    - source_labels: [__meta_consul_service_metadata_group] 
      target_label: 'group' 
    - source_labels: [__meta_consul_service_metadata_environment] 
      target_label: 'environment' 
    - source_labels: [__meta_consul_service_metadata_Project] 
      target_label: 'Project' 
    - source_labels: [__meta_consul_service] 
      target_label: 'service'
...... 

注册被监控端信息到consul
curl -X PUT -H "Content-Type: application/x-www-form-urlencoded" -H "X-Consul-Token: 8dc1eb67-1f5f-4e10-ad9d-5e58b047647c" -d @./node_exporter.json http://192.168.122.11:8500/v1/agent/service/register
node_exporter.json文件内容如下:

{ 
 "name": "node_exporter", 
 "id": "prometheus_node_exporter", 
 "address": "192.168.122.11", 
 "port": 9100, 
 "meta": { 
   "instance": "prometheus", 
   "group": "demo", 
   "environment": "UAT", 
   "Project": "Mon"
 }, 
 "checks": [{ 
   "http": "http://192.168.122.11:9100/metrics", 
   "interval": "35s" 
 }] 
}
#!/bin/bash
HOST=`hostname`
machine_physics_net=$(ls /sys/class/net/ | grep -v "`ls /sys/devices/virtual/net/`")
IP=$(ip addr | grep "$machine_physics_net" | awk '/^[0-9]+: / {}; /inet.*global/ {print gensub(/(.*)\/(.*)/, "\\1", "g", $2)}')
server_name=`hostname | awk -F'.' '{print $1}'`
PUBLIC_IP=`curl icanhazip.com`

curl -X PUT -H "Content-Type: application/x-www-form-urlencoded" -H "X-Consul-Token: 8dc1eb67-1f5f-4e10-ad9d-5e58b047647c" -d '{"id": "'$HOST'","name": "'$HOST'","address": "'$IP'","port": 9100,"tags": ["linux_system"],"meta": {"factory": "google", "project": "'$server_name'", "host":"'$HOST'", "public_ip":"'$PUBLIC_IP'", "private_ip":"'$IP'", "monitoring-items": "linux_base"},  "checks": [{"http": "http://'$IP':9100/metrics", "interval": "5s"}]}'  http://172.18.5.62:8500/v1/agent/service/register > /dev/null 2>&1 && echo -e "\033[92m $HOST 注册成功 \033[0m"
image.png
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容