- 16 Dec, 2019 1 commit
-
-
Raúl Sánchez authored
-
- 12 Dec, 2019 1 commit
-
-
Erik Kristensen authored
-
- 10 Dec, 2019 1 commit
-
-
GGGitBoy authored
-
- 15 Nov, 2019 1 commit
-
-
loganhz authored
-
- 11 Nov, 2019 1 commit
-
-
Frank Mai authored
-
- 22 Oct, 2019 1 commit
-
-
gitlawr authored
To make monitoring work when hardening rules are applied.
-
- 17 Oct, 2019 2 commits
- 18 Sep, 2019 1 commit
-
-
Frank Mai authored
Copy the binary to the host before wins cli prc run
-
- 14 Sep, 2019 1 commit
-
-
Frank Mai authored
**Problem:** Could not distinguish Windows images from `values.yaml` **Solution:** - Add a new field `os` below field `image` - Must indicate `os: windows` if the image is using in Windows cluster
-
- 10 Sep, 2019 1 commit
-
-
gitlawr authored
- Bump Prometheus operator image version - Sync upstream CRDs. Add PodMonitor and update the rest.(https://github.com/helm/charts/tree/master/stable/prometheus-operator/templates/prometheus-operator) - Update ClusterRole Related Upstream changes: https://github.com/helm/charts/commit/6a94c082c2daf8be72ac241397d652993ad82a0b#diff-b9f481ac97524bfb24d27f62676dc143 https://github.com/helm/charts/commit/d58247103fafd5930792bb1fd3ac240547523200#diff-23b6255dfa711af76ad99181e2fb0464
-
- 29 Aug, 2019 1 commit
-
-
michelia feng authored
-
- 27 Aug, 2019 1 commit
-
-
aiwantaozi authored
Problem: can't get windows container/node metrics Solution: 1. deploy windows node exporter 2. add add windows metrics service, endpoint, service monitor 3. add metric relabel rules to unify windows and linux expression Issue: https://github.com/rancher/rancher/issues/20513 https://github.com/rancher/rancher/issues/20076 https://github.com/rancher/rancher/issues/20122
-
- 12 Aug, 2019 1 commit
-
-
Guangbo Chen authored
-
- 23 Jul, 2019 2 commits
-
-
Guangbo Chen authored
-
Guangbo Chen authored
-
- 21 May, 2019 1 commit
-
-
Aiwantaozi authored
Problem: alertmanager 0.16.0 unable to send email notification without SMTP authentication Solution: upgrade to alertmanger 0.17.0 which solve this problem Issue: https://github.com/rancher/rancher/issues/20060
-
- 07 May, 2019 2 commits
- 01 May, 2019 2 commits
-
-
Frank Mai authored
- Embed operator as sub charts + Support to configure operator like other charts + Adjust operator default limit - Add permission to kube-state exporter - Replace localhost by 127.0.0.1 on prometheus-auth - Increase Nginx proxy buffers - Configure PVC name of Prometheus or Alertmanager + Allow to configure PVC name of Prometheus or Alertmanager via `prometheus.persistence.name` or `alertmanager.persistence.name` - Adjust Cluster Monitoring scrape logic + Don't scrape Monitoring namespace on `prometheus-io-scrape` job + The rate to scrape is using global interval, the default is 60s + Remove useless Prometheus record rules **Issue:** - https://github.com/rancher/rancher/issues/19693 - https://github.com/rancher/rancher/issues/18830 - https://github.com/rancher/rancher/issues/19243 - https://github.com/rancher/rancher/issues/19689 - https://github.com/rancher/rancher/issues/19410 - https://github.com/rancher/rancher/issues/19248 -
Frank Mai authored
-
- 12 Mar, 2019 1 commit
-
-
Frank Mai authored
-
- 26 Feb, 2019 1 commit
-
-
Frank Mai authored
**Problem:** Cannot start "rules-configmap-reloader" container with 10Mi limit resource **Solution:** Update images: - quay.io/coreos/prometheus-operator:v0.29.0 -> rancher/coreos-prometheus-operator:v0.29.0 - quay.io/coreos/prometheus-config-reloader -> rancher/coreos-prometheus-config-reloader:v0.29.0 - prom/alertmanager:v0.16.1 -> rancher/prom-alertmanager:v0.16.1 - prom/prometheus:v2.7.1 -> rancher/prom-prometheus:v2.7.1 - grafana/grafana:5.4.3 -> rancher/grafana-grafana:5.4.3 - prom/node-exporter:v0.17.0 -> rancher/prom-node-exporter:v0.17.0 - quay.io/coreos/kube-state-metrics:v1.5.0 -> rancher/coreos-kube-state-metrics:v1.5.0 **Issue:** - https://github.com/rancher/rancher/issues/17997 - https://github.com/rancher/rancher/issues/18353
-
- 25 Feb, 2019 1 commit
-
-
Frank Mai authored
**Problem:** Enable logging and monitoring in `rancher/rancher:master`, but can't see fluentd metric **Solution:** Consist label and endpoint name in `system-chart/rancher-monitoring:v0.0.2`` **Issue:** https://github.com/rancher/rancher/issues/18327 **Patch:** https://github.com/rancher/system-charts/pull/17
-
- 20 Feb, 2019 3 commits
-
-
Frank Mai authored
**Issues:** https://github.com/rancher/rancher/issues/18166
-
Frank Mai authored
-
Frank Mai authored
-
- 15 Feb, 2019 1 commit
-
-
Frank Mai authored
**Problem:** - Remote reader mode only allow `project-level` Prometheus to share the metrics from `cluster-level` Prometheus - Remote reader mode cannot save the namespace-related metrics from `cluster-level` Prometheus **Solution:** - Add `prometheus.sync.mode` to choose - Add a "federate" scrape job when deploying federation mode **Issue:** https://github.com/rancher/rancher/issues/17390
-
- 14 Feb, 2019 4 commits
-
-
Frank Mai authored
**Problem:** Cannot input like `x.y.z/k` label name into serviceSelectorLabels **Solution:** Use array instead of object as values
-
Frank Mai authored
-
Frank Mai authored
-
Frank Mai authored
**Problem:** Cannot input like `x.y.z/k` label name into nodeSelector **Solution:** Use array instead of object as values **Issue:** https://github.com/rancher/rancher/issues/17340
-
- 13 Feb, 2019 1 commit
-
-
Fyery authored
problem: We can not deploy monitoring tools in an air gap environment. Solution: Add the ability to use the private image registry when deploying monitoring tools Issue: https://github.com/rancher/rancher/issues/17842
-
- 12 Feb, 2019 2 commits
-
-
frank authored
**Problem:** - Previous charts cannot satisfy the project level monitoring deploying design - Grafana cannot be restarted after password changed - node-exporter cannot be scheduled to `controlpane` or `etcd` role nodes - Prometheus cannot be started with PVC that provided by some storage provisioner which don't respect the `SecurityContext` **Solution:** - Deploy "project level" monitoring with a permission-limit Prometheus - Remove Grafana account `Secret` and use provisioning instead of `grafana-watch` - Modify node-exporter `taints` - Add configurable `SecurityContext` for Prometheus and Alertmanager - Narrow Prometheus permission **Issue:** - https://github.com/rancher/rancher/issues/17039 - https://github.com/rancher/rancher/issues/16962 - https://github.com/rancher/rancher/issues/17030 - https://github.com/rancher/rancher/issues/17256Co-authored-by:
orangedeng <jxfa0043379@hotmail.com>
-
frank authored
-
- 10 Dec, 2018 1 commit
-
-
frank authored
**Problem:** Can't enable project level monitoring without resources quota **Solution:** Add resource quota to workload spec **Issue:** https://github.com/rancher/rancher/issues/16993
-
- 05 Dec, 2018 1 commit
-
-
Frank Mai authored
-
- 03 Dec, 2018 1 commit
-
-
frank authored
(+) Only use for Rancher 2.0 Monitoring and Alerting (+) Support Grafana to proxy with authorization bearer token to Prometheus-Auth agent (+) Support Prometheus web to proxy with authorization bearer token to Prometheus-Auth agent (+) Rich metrics for Kubernetes and Rancher Co-authored-by:aiwantaozi <michelia.feng@gmail.com> Co-authored-by:
orangedeng <jxfa0043379@hotmail.com>
-