Deploying Monitoring Services on Kubernetes
This topic describes how to use Prometheus to deploy monitoring services for a Milvus cluster on Kubernetes.
Monitor metrics with Prometheus
Metrics are indicators providing information about the running status of your system. For example, with metrics, you can understand how much memory or CPU resources are consumed by a data node in Milvus. Being aware of the performance and status of the components in your Milvus cluster makes you well-informed and hence making better decisions and adjusting resource allocation in a more timely manner.
Generally, metrics are stored in a time series database (TSDB), like Prometheus, and the metrics are recorded with a timestamp. In the case of monitoring Milvus services, you can use Prometheus to pull data from endpoints set by exporters. Prometheus then exports metrics of each Milvus component at http://<component-host>:9091/metrics.
However, you might have several replicas for one component, which makes manual configuration of Prometheus too complicated. Therefore, you can use Prometheus Operator, an extension to Kubernetes, for automated and effective management of Prometheus monitoring instances. Using Prometheus Operator saves you the trouble of manually adding metric targets and service providers.
The ServiceMonitor Custom Resource Definition (CRD) enables you to declaratively define how a dynamic set of services are monitored. It also allows selecting which services to monitor with the desired configuration using label selections. With Prometheus Operator, you can introduce conventions specifying how metrics are exposed. New services can be automatically discovered following the convention you set without the need for manual reconfiguration.
The following image illustrates Prometheus workflow.
  
     Prometheus_architecture
    Prometheus_architecture
  
Prerequisites
This tutorial uses kube-prometheus to save you the trouble of installing and manually configuring each monitoring and alerting component.
Kube-prometheus collects Kubernetes manifests, Grafana dashboards, and Prometheus rules combined with documentation and scripts.
Before deploying monitoring services, you need to create a monitoring stack by using the configuration in the kube-prometheus manifests directory.
$ git clone https://github.com/prometheus-operator/kube-prometheus.git
$ cd kube-prometheus
$ kubectl apply --server-side -f manifests/setup
$ kubectl wait \
        --for condition=Established \
        --all CustomResourceDefinition \
        --namespace=monitoring
$ kubectl apply -f manifests/
kubectl patch clusterrole prometheus-k8s --type=json -p='[{"op": "add", "path": "/rules/-", "value": {"apiGroups": [""], "resources": ["pods", "services", "endpoints"], "verbs": ["get", "watch", "list"]}}]'
To delete a stack, run kubectl delete --ignore-not-found=true -f manifests/ -f manifests/setup.
Deploy monitoring services on Kubernetes
1. Access the dashboards
Forward the Prometheus service to port 9090, and Grafana service to port 3000.
$ kubectl --namespace monitoring --address 0.0.0.0 port-forward svc/prometheus-k8s 9090
$ kubectl --namespace monitoring --address 0.0.0.0 port-forward svc/grafana 3000
2. Enable ServiceMonitor
The ServiceMonitor is not enabled for Milvus Helm by default. After installing the Prometheus Operator in the Kubernetes cluster, you can enable it by adding the parameter metrics.serviceMonitor.enabled=true.
With Helm
You can enable the ServiceMonitor by setting the parameter metrics.serviceMonitor.enabled=true as follows if you have installed Milvus Helm chart.
```
$ helm upgrade my-release milvus/milvus --set metrics.serviceMonitor.enabled=true --reuse-values
```
When the installation completes, use kubectl to check the ServiceMonitor resource.
With Milvus Operator
You can enable the ServiceMonitor as follows if you have installed Milvus using the Milvus Operator.
- Run the following command to edit the MIlvus custom resource. The following command assumes that the custom resource is named - my-release.- $ kubectl edit milvus my-release
- Edit the - spec.components.disableMetricsfield to- false.- ... spec: components: disableMetrics: false # set to true to disable metrics ...
- Save and exit the editor. 
- Wait for the operator to reconcile the changes. You can check the status of the Milvus custom resource by running the following command. - $ kubectl get milvus my-release -o yaml
The status.components.metrics.serviceMonitor.enabled field should be true.
3. Check the metrics
After enabling the ServiceMonitor, you can access the Prometheus dashboard at http://localhost:9090/.
Click on the Status tab and then Targets. You should see the targets of the Milvus components.
  
     Prometheus_targets
    Prometheus_targets
  
Click on the Graph tab and enter the expression up{job="default/my-release"} in the expression input box. You should see the metrics of the Milvus components.
  
     Prometheus_graph
    Prometheus_graph
  
4. Check the ServiceMonitor
$ kubectl get servicemonitor
NAME                           AGE
my-release-milvus              54s
What’s next
- If you have deployed monitoring services for the Milvus cluster, you might also want to learn to:
- If you are looking for information about how to scale a Milvus cluster:
- Learn scale a Milvus cluster
 
- If you are interested in upgrading the Milvus version,