Usar Google Cloud Managed Service para Prometheus para recopilar métricas configuradas de terceros y definidas por el usuario y, luego, enviarlas a Cloud Monitoring. Google Cloud Managed Service para Prometheus te permite supervisar y crear alertas en tus cargas de trabajo mediante Prometheus, sin administrar ni operar de forma manual Prometheus a gran escala.
Personaliza y mejora la recopilación de datos
De forma predeterminada, GKE crea un repositorio de Logging para almacenar registros de cada clúster. Puedes controlar qué registros y qué métricas, si las hay, se envían desde tu clúster de GKE a Cloud Logging y Cloud Monitoring.
Métricas del plano de control: Supervisa el estado de los componentes de Kubernetes mediante la recopilación de métricas para el servidor de la API, Scheduler y el Administrador de controladores de Kubernetes. Estas métricas son indicadores útiles del estado del servicio para definir objetivos de nivel de servicio (SLO).
Kube State Metrics: supervisa el estado de los objetos de Kubernetes, como Deployments, nodos y Pods.
Para supervisar las aplicaciones de terceros que se ejecutan en tus clústeres, como Postgres, MongoDB y Redis, usa exportadores de Prometheus con Google Cloud Managed Service para Prometheus.
Usa los datos que recopilas para analizar el estado de la aplicación, depurarla, solucionar problemas y realizar pruebas a medida que desarrollas, implementas y mantienes tus aplicaciones.
GKE proporciona funciones de observabilidad integradas para que puedas comenzar con rapidez:
Visualiza los datos recopilados para tus clústeres y cargas de trabajo en los paneles de observabilidad de GKE.
Puedes personalizar los paneles proporcionados para los siguientes fines:
Visualizar las métricas clave del clúster, como el uso de CPU, el uso de memoria y la cantidad de incidentes abiertos.
Visualizar clústeres por su infraestructura, cargas de trabajo o Services.
Inspeccionar espacios de nombres, nodos, cargas de trabajo, Services, Pods y contenedores.
En el caso de los Pods y los contenedores, consultar las métricas como una función de tiempo y visualiza las entradas de registro.
En la pestaña Observabilidad, puedes crear políticas de alertas recomendadas para que recibas notificaciones sobre los problemas. Para obtener más información sobre las alertas, consulta la Descripción general de alertas.
Crea SLOs para supervisar los objetivos de rendimiento del servicio mediante métricas de GKE recopiladas.
Los precios de la integración en Cloud Logging (incluidos los registros de auditoría de Cloud), Cloud Monitoring y el servicio administrado de Google Cloud para Prometheus se basan en la cantidad de registros y métricas recopilados. Consulta la página de Precios para obtener más detalles.
Las funciones que proporcionan otros servicios de Google Cloud enumeradas en Otras funciones tienen precios diferentes. Consulta la sección de precios de esas páginas de documentación para obtener más información.
¿Qué sigue?
Observa tus clústeres.
Aprende a ver paneles, organizar la información del clúster y ver los detalles de las alertas.
[[["Fácil de comprender","easyToUnderstand","thumb-up"],["Resolvió mi problema","solvedMyProblem","thumb-up"],["Otro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Información o código de muestra incorrectos","incorrectInformationOrSampleCode","thumb-down"],["Faltan la información o los ejemplos que necesito","missingTheInformationSamplesINeed","thumb-down"],["Problema de traducción","translationIssue","thumb-down"],["Otro","otherDown","thumb-down"]],["Última actualización: 2024-11-21 (UTC)"],[],[],null,["[Autopilot](/kubernetes-engine/docs/concepts/autopilot-overview) [Standard](/kubernetes-engine/docs/concepts/choose-cluster-mode)\n\n*** ** * ** ***\n\nThis page describes how you can understand the health of your applications and\nmaintain application availability and reliability.\n\nDefault observability features\n\nBy default, GKE clusters are configured to do the following:\n\n- Send [system logs, audit logs, and application logs](/kubernetes-engine/docs/concepts/about-logs#what_logs) to Cloud Logging.\n- Send [system metrics](/kubernetes-engine/docs/how-to/configure-metrics#system-metrics) to Cloud Monitoring.\n- Use [Google Cloud Managed Service for Prometheus](/stackdriver/docs/managed-prometheus) to collect configured third-party and user-defined metrics and then send them to Cloud Monitoring. Google Cloud Managed Service for Prometheus lets you monitor and alert on your workloads using Prometheus, without manually managing and operating Prometheus at scale.\n\nCustomize and enhance data collection\n\nBy default, GKE creates a Logging repository for\nstoring logs for each cluster. You can control which logs and which metrics, if\nany, are sent from your GKE cluster to Cloud Logging and\nCloud Monitoring.\n\nYou can also control whether to enable\n[Google Cloud Managed Service for Prometheus](/stackdriver/docs/managed-prometheus).\n\nFor GKE Autopilot clusters, you cannot disable\nthe Cloud Monitoring and Cloud Logging integration.\n\nAdditional observability metrics\n\nYou can collect additional observability metrics by enabling one or more\n[observability metrics packages](/kubernetes-engine/docs/how-to/configure-metrics).\n\n- [Control plane metrics](/kubernetes-engine/docs/how-to/control-plane-metrics): Monitor the health of Kubernetes components by collecting metrics for the Kubernetes API server, Scheduler, and Controller Manager. These metrics are useful signals of service health for defining service level objectives (SLOs).\n- [Kube state metrics](/kubernetes-engine/docs/how-to/kube-state-metrics): Monitor the health of Kubernetes objects such as Deployments, Nodes, and Pods.\n- [cAdvisor/Kubelet metrics](/kubernetes-engine/docs/how-to/cadvisor-kubelet-metrics): Monitor the health of containers and the kubelet.\n\nThird-party and user-defined metrics\n\nTo monitor third-party applications running on your clusters such as Postgres,\nMongoDB, and Redis, use\n[Prometheus exporters](/stackdriver/docs/managed-prometheus/exporters/introduction)\nwith Google Cloud Managed Service for Prometheus.\n\nYou can also\n[write custom exporters](https://prometheus.io/docs/instrumenting/writing_exporters/)\nto monitor other signals of health and performance.\n\nUse collected data\n\nUse the data you collect to analyze application health, debug, troubleshoot,\nand test as you develop, deploy, and maintain your applications.\n\nGKE provides built-in observability features to get you started\nquickly:\n\n- View collected data for your clusters and workloads on in\n GKE\n [observability dashboards](/kubernetes-engine/docs/how-to/view-observability-metrics).\n You can customize the provided dashboards for the following purposes:\n\n - View key cluster metrics, such as CPU utilization, memory utilization, and the number of open incidents.\n - View clusters by their infrastructure, workloads, or Services.\n - Inspect namespaces, Nodes, workloads, Services, Pods, and containers.\n - For Pods and containers, view metrics as a function of time and view log entries.\n\n You can also [create your own dashboards](/monitoring/charts/dashboards) or\n [import Grafana dashboards](/monitoring/dashboards/import-grafana-dashboards)\n to meet your needs.\n | **Note:** The provided GKE dashboards only display information for GKE clusters running on Google Cloud. They don't display information for GKE clusters running anywhere else, for example using on-premises or bare-metal servers.\n- From the **Observability** tab, you can create recommended alert policies so\n that you are notified about issues. To learn more about alerting, see the\n [Alerting overview](/monitoring/alerts).\n\n- [Create SLOs](/stackdriver/docs/solutions/slo-monitoring/ui/create-slo) to\n monitor your service performance goals using collected GKE\n metrics.\n\n- Use GKE playbooks to troubleshoot common issues such as\n [unschedulable Pods](/kubernetes-engine/docs/troubleshooting/deployed-workloads#PodUnschedulable)\n and\n [containers that repeatedly crash after restart](/kubernetes-engine/docs/troubleshooting/deployed-workloads#CrashLoopBackOff).\n\n- Explore and analyze your data with tools such as\n [Logs Explorer](/logging/docs/view/logs-explorer-interface),\n [Metrics Explorer](/monitoring/charts/metrics-explorer) and\n [Error Reporting](/error-reporting/docs/grouping-errors).\n\n- Review GKE\n [audit logs](/kubernetes-engine/docs/how-to/audit-logging) that record\n administrative activities and accesses as part of\n [Cloud Audit Logs](/logging/docs/audit).\n [Audit log policy](/kubernetes-engine/docs/concepts/audit-policy) determines\n which events are recorded and whether a log entry belongs to an Admin Activity\n log or a Data Access log.\n\nOther features\n\nGKE integrates with other Google Cloud services to help you\nmonitor and manage your clusters and workloads.\n\n- Use the\n [security posture dashboard](/kubernetes-engine/docs/concepts/about-security-posture-dashboard)\n to identify security concerns based on our standards and industry best\n practices.\n\n- View\n [insights and recommendations](/kubernetes-engine/docs/how-to/optimize-with-recommenders)\n to optimize your clusters.\n\n- Use\n [network policy logging](/kubernetes-engine/docs/how-to/network-policy-logging)\n to help you troubleshoot issues with Kubernetes network policies. If you use\n [GKE Dataplane V2](/kubernetes-engine/docs/concepts/dataplane-v2), then\n network policy logging is built-in.\n\nPricing\n\nPricing for integration with Cloud Logging (including\nCloud Audit Logs), Cloud Monitoring, and Google Cloud Managed Service for Prometheus is\nbased on the amount of logs and metrics collected. See the\n[Pricing](/stackdriver/pricing) page for details.\n\nFeatures provided by other Google Cloud services listed in\n[Other features](#other-features) have separate pricing. See the Pricing section\nof those documentation pages for more information.\n\nWhat's next\n\n- [Observe your clusters](/kubernetes-engine/docs/how-to/view-observability-metrics).\n Learn how to view dashboards, organize cluster information, and view alerting\n details.\n\n- [Enable verbose, OS-level audit logging](/kubernetes-engine/docs/how-to/linux-auditd-logging)\n on GKE cluster nodes and how to export logs to\n Cloud Logging.\n\n- For more information about how to use observability features to troubleshoot\n GKE, see\n [Introduction to GKE troubleshooting](/kubernetes-engine/docs/troubleshooting/introduction)."]]