Menu
Grafana Cloud

Services

CloudWatch metrics supports the following services, and allows you to pick from a wide array of available metrics and statistics. Metrics in bold text are included in the default configuration. The statistics for all metrics are Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99.

AWS/ACMPrivateCA

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_acmprivateca_info
aws_acmprivateca_crlgeneratedCRLGeneratedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of Certificate Revocation Lists (CRLs) generated. Used to ensure the regular creation of revocation lists for certificate management.
aws_acmprivateca_failureFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures in Private CA operations. Useful for identifying issues in certificate issuance or other operations.
aws_acmprivateca_misconfigured_crlbucketMisconfiguredCRLBucketAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instances where the CRL bucket is misconfigured. Useful for ensuring proper configuration and access to the CRL storage bucket.
aws_acmprivateca_successSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful operations within the ACM Private CA. Useful for monitoring operational efficiency and successful certificate issuances.
aws_acmprivateca_timeTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken for various operations in ACM Private CA, helping to monitor performance and identify any slowdowns in certificate processing.

AWS/AmazonMQ

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_amazonmq_info
aws_amazonmq_ack_rateAckRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the acknowledgment rate of messages, ensuring efficient message processing and acknowledgment.
aws_amazonmq_burst_balanceBurstBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the balance of burst credits, monitoring if the broker can handle sudden spikes in traffic.
aws_amazonmq_channel_countChannelCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active channels, indicating resource usage and load on the broker.
aws_amazonmq_confirm_rateConfirmRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate at which messages are confirmed, ensuring message delivery guarantees.
aws_amazonmq_connection_countConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active connections, helping monitor broker usage and possible overloading.
aws_amazonmq_consumer_countConsumerCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of consumers connected, useful for understanding broker demand and throughput.
aws_amazonmq_cpu_credit_balanceCpuCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the remaining CPU credits, important for ensuring the broker has enough processing power to handle workload.
aws_amazonmq_cpu_utilizationCpuUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of CPU usage, helping identify potential performance bottlenecks.
aws_amazonmq_current_connections_countCurrentConnectionsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the number of currently connected clients, useful for tracking session loads.
aws_amazonmq_dequeue_countDequeueCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of messages dequeued, which helps gauge message consumption activity.
aws_amazonmq_dispatch_countDispatchCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of messages dispatched to consumers, helping monitor message flow.
aws_amazonmq_enqueue_countEnqueueCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of messages enqueued, giving insights into the volume of messages entering the system.
aws_amazonmq_enqueue_timeEnqueueTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken to enqueue messages, used to monitor latency and performance.
aws_amazonmq_established_connections_countEstablishedConnectionsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successfully established connections, used to monitor system stability.
aws_amazonmq_exchange_countExchangeCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of exchanges, useful for analyzing message routing activity.
aws_amazonmq_expired_countExpiredCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of messages that have expired without being consumed, useful for monitoring failed message deliveries.
aws_amazonmq_heap_usageHeapUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the heap memory usage of the broker, useful for detecting memory-related performance issues.
aws_amazonmq_in_flight_countInFlightCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of messages currently in transit, helping to ensure the broker isn’t overwhelmed by unacknowledged messages.
aws_amazonmq_inactive_durable_topic_subscribers_countInactiveDurableTopicSubscribersCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors inactive durable subscribers, useful for tracking unused resources or inefficient topic subscriptions.
aws_amazonmq_job_scheduler_store_percent_usageJobSchedulerStorePercentUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of the job scheduler store usage, important for capacity planning and performance.
aws_amazonmq_journal_files_for_fast_recoveryJournalFilesForFastRecoveryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of journal files available for fast recovery, ensuring quick system recovery.
aws_amazonmq_journal_files_for_full_recoveryJournalFilesForFullRecoveryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks journal files required for full recovery, ensuring data durability and integrity during failures.
aws_amazonmq_memory_usageMemoryUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory usage of the broker, ensuring the broker has adequate memory for message processing.
aws_amazonmq_message_countMessageCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of messages in the broker, providing insights into message load and storage.
aws_amazonmq_message_ready_countMessageReadyCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of messages ready for delivery, helping gauge the efficiency of message consumption.
aws_amazonmq_message_unacknowledged_countMessageUnacknowledgedCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks unacknowledged messages, useful for detecting potential message delivery problems.
aws_amazonmq_network_inNetworkInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the incoming network traffic, useful for tracking data ingestion and throughput.
aws_amazonmq_network_outNetworkOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the outgoing network traffic, helping monitor data egress and bandwidth usage.
aws_amazonmq_open_transaction_countOpenTransactionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of open transactions, useful for identifying resource contention or potential system stalls.
aws_amazonmq_producer_countProducerCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of producers, useful for understanding message production activity in the system.
aws_amazonmq_publish_ratePublishRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate at which messages are being published, providing insights into message inflow.
aws_amazonmq_queue_countQueueCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active queues, useful for analyzing message distribution across queues.
aws_amazonmq_queue_sizeQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the size of the message queues, helping gauge message backlog and system load.
aws_amazonmq_rabbit_mqdisk_freeRabbitMQDiskFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the available disk space for RabbitMQ, ensuring that there’s enough storage for message persistence.
aws_amazonmq_rabbit_mqdisk_free_limitRabbitMQDiskFreeLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the disk free space threshold, alerting when approaching critical limits to avoid disruptions.
aws_amazonmq_rabbit_mqfd_usedRabbitMQFdUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of file descriptors used by RabbitMQ, ensuring system resources are not exhausted.
aws_amazonmq_rabbit_mqmem_limitRabbitMQMemLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the memory usage limit for RabbitMQ, ensuring the broker doesn’t run out of memory.
aws_amazonmq_rabbit_mqmem_usedRabbitMQMemUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory currently in use by RabbitMQ, useful for monitoring resource efficiency.
aws_amazonmq_receive_countReceiveCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of received messages, helping monitor message inflow and processing rates.
aws_amazonmq_store_percent_usageStorePercentUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of the store usage, ensuring sufficient capacity for message persistence.
aws_amazonmq_system_cpu_utilizationSystemCpuUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the CPU usage of the underlying system, helping to detect potential CPU bottlenecks.
aws_amazonmq_temp_percent_usageTempPercentUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage usage of temporary storage, useful for avoiding storage exhaustion during peak loads.
aws_amazonmq_total_consumer_countTotalConsumerCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of consumers, helping assess the overall load and activity on the broker.
aws_amazonmq_total_dequeue_countTotalDequeueCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of dequeued messages, useful for analyzing message consumption rates.
aws_amazonmq_total_enqueue_countTotalEnqueueCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of enqueued messages, providing insights into message production volumes.
aws_amazonmq_total_message_countTotalMessageCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total count of messages in the system, giving an overview of the message load.
aws_amazonmq_total_producer_countTotalProducerCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of producers, useful for understanding message inflow activity.
aws_amazonmq_volume_read_opsVolumeReadOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of read operations on the broker’s volume, helping monitor disk I/O performance.
aws_amazonmq_volume_write_opsVolumeWriteOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write operations on the broker’s volume, useful for detecting disk I/O bottlenecks.

AWS/ApiGateway

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_apigateway_info
aws_apigateway_4xx4xxSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of 4xx client errors, used to track issues related to invalid requests from clients.
aws_apigateway_5xx5xxSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of 5xx server errors, used to monitor API Gateway or backend server issues.
aws_apigateway_countCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of API requests, providing insights into traffic volume.
aws_apigateway_integration_latencyIntegrationLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency between API Gateway and the backend integration, useful for diagnosing performance issues in backend services.
aws_apigateway_latencyLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks overall API latency, including both API Gateway processing and backend integration latency, helping to monitor user experience.
aws_apigateway_4_xxerror4XXErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_5_xxerror5XXErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_cache_hit_countCacheHitCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_cache_miss_countCacheMissCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_client_errorClientErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_connect_countConnectCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_data_processedDataProcessedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_execution_errorExecutionErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_integration_errorIntegrationErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_apigateway_message_countMessageCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/AppStream

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_appstream_info
aws_appstream_actual_capacityActualCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual number of available instances for streaming, used to ensure enough resources are deployed.
aws_appstream_available_capacityAvailableCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances available for use but not currently in use, helping to gauge spare capacity for handling future demand.
aws_appstream_capacity_utilizationCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of capacity utilization, useful for optimizing resource allocation and ensuring cost-effective usage.
aws_appstream_desired_capacityDesiredCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Represents the desired number of instances based on scaling policies, helping to monitor scaling efficiency and capacity planning.
aws_appstream_in_use_capacityInUseCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances currently in use, helping to monitor active workload and resource consumption.
aws_appstream_insufficient_capacity_errorInsufficientCapacityErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of times a capacity request failed due to insufficient resources, indicating capacity shortages or bottlenecks.
aws_appstream_pending_capacityPendingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances that are in the process of being provisioned, helping to track the status of scaling events.
aws_appstream_running_capacityRunningCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of running instances, providing insights into the active resources currently being used to support users.

AWS/AppSync

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_appsync_info
aws_appsync_4_xxerror4XXErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors client-side (4xx) errors in requests, useful for tracking invalid requests made by clients.
aws_appsync_5_xxerror5XXErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server-side (5xx) errors, helping to detect issues in the API or the server infrastructure.
aws_appsync_active_connectionsActiveConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of active WebSocket connections, useful for understanding the real-time activity on the AppSync API.
aws_appsync_active_subscriptionsActiveSubscriptionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active subscriptions, helping to monitor usage and engagement with subscription-based real-time data services.
aws_appsync_connect_client_errorConnectClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors encountered by clients while trying to establish connections, indicating issues in the client-side configuration or request.
aws_appsync_connect_server_errorConnectServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server-side errors during the connection process, helping to identify server-side failures or misconfigurations during connection attempts.
aws_appsync_connect_successConnectSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the successful WebSocket connection attempts, useful for monitoring overall connection success rates.
aws_appsync_connection_durationConnectionDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the duration of WebSocket connections, helping to gauge session longevity and user engagement.
aws_appsync_disconnect_client_errorDisconnectClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors that occur when clients try to disconnect, useful for monitoring client-side disconnection issues.
aws_appsync_disconnect_server_errorDisconnectServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors server-side errors during disconnection, helping to detect issues in properly closing WebSocket connections.
aws_appsync_disconnect_successDisconnectSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures successful disconnections from WebSocket connections, useful for ensuring smooth session terminations.
aws_appsync_latencyLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken to process requests, useful for monitoring API performance and identifying latency issues.
aws_appsync_publish_data_message_client_errorPublishDataMessageClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors client-side errors during data message publishing, used to detect issues with client-side data transmission.
aws_appsync_publish_data_message_server_errorPublishDataMessageServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server-side errors during data message publishing, helping to identify issues in server-side message handling or transmission.
aws_appsync_publish_data_message_sizePublishDataMessageSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the size of data messages being published, useful for tracking payload sizes and ensuring efficient message transmission.
aws_appsync_publish_data_message_successPublishDataMessageSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful data message publications, helping to monitor overall message delivery success.
aws_appsync_requestsRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of requests processed by AppSync, providing insights into traffic and API usage.
aws_appsync_subscribe_client_errorSubscribeClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors client-side errors during subscription attempts, useful for tracking issues in subscribing to real-time data feeds.
aws_appsync_subscribe_server_errorSubscribeServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server-side errors during subscription attempts, helping to identify server failures when clients try to subscribe.
aws_appsync_subscribe_successSubscribeSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures successful subscription attempts, useful for monitoring subscription adoption and engagement rates.
aws_appsync_tokens_consumedTokensConsumedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tokens consumed by requests, useful for managing API rate limits and monitoring user activity.
aws_appsync_unsubscribe_client_errorUnsubscribeClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors client-side errors during unsubscription attempts, used to detect issues when clients try to unsubscribe from data feeds.
aws_appsync_unsubscribe_server_errorUnsubscribeServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server-side errors during unsubscription attempts, useful for identifying server-side issues when clients try to unsubscribe.
aws_appsync_unsubscribe_successUnsubscribeSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures successful unsubscription attempts, ensuring smooth termination of real-time data subscriptions.

AWS/ApplicationELB

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_applicationelb_info
aws_applicationelb_active_connection_countActiveConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active connections, useful for understanding current load on the load balancer.
aws_applicationelb_client_tlsnegotiation_error_countClientTLSNegotiationErrorCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed TLS negotiations between clients and the load balancer, used to detect TLS handshake issues.
aws_applicationelb_consumed_lcusConsumedLCUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of Load Balancer Capacity Units (LCUs) used, helping to track resource consumption and cost.
aws_applicationelb_elbauth_errorELBAuthErrorSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors during authentication processes, useful for monitoring failures in authentication workflows.
aws_applicationelb_elbauth_failureELBAuthFailureSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors failed authentication attempts, helping detect potential security issues or configuration problems.
aws_applicationelb_elbauth_latencyELBAuthLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of authentication requests, useful for identifying delays in authentication workflows.
aws_applicationelb_elbauth_refresh_token_successELBAuthRefreshTokenSuccessSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful refresh token requests, useful for monitoring token refresh operations.
aws_applicationelb_elbauth_successELBAuthSuccessSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures successful authentication requests, useful for monitoring authentication performance.
aws_applicationelb_elbauth_user_claims_size_exceededELBAuthUserClaimsSizeExceededSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances where user claims exceed the allowed size, which can help in tuning authentication configurations.
aws_applicationelb_httpcode_elb_3_xx_countHTTPCode_ELB_3XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of 3xx HTTP responses, which indicate redirection, useful for monitoring redirects on the load balancer.
aws_applicationelb_httpcode_elb_4_xx_countHTTPCode_ELB_4XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of 4xx client error responses, useful for detecting invalid client requests.
aws_applicationelb_httpcode_elb_5_xx_countHTTPCode_ELB_5XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of 5xx server error responses, helping identify backend issues.
aws_applicationelb_httpcode_target_2_xx_countHTTPCode_Target_2XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of successful 2xx responses from targets, useful for tracking successful request handling.
aws_applicationelb_httpcode_target_3_xx_countHTTPCode_Target_3XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of 3xx redirects from target servers, useful for understanding traffic redirection by targets.
aws_applicationelb_httpcode_target_4_xx_countHTTPCode_Target_4XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks 4xx client errors returned by target servers, helping identify configuration or client-side issues.
aws_applicationelb_httpcode_target_5_xx_countHTTPCode_Target_5XX_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of 5xx errors returned by target servers, useful for identifying server-side issues.
aws_applicationelb_ipv6_processed_bytesIPv6ProcessedBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes processed over IPv6, useful for tracking IPv6 traffic volume.
aws_applicationelb_ipv6_request_countIPv6RequestCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of IPv6 requests, providing insights into IPv6 usage and adoption.
aws_applicationelb_new_connection_countNewConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of new connections established, helping understand connection initiation patterns.
aws_applicationelb_processed_bytesProcessedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total amount of data processed by the load balancer, useful for tracking overall throughput.
aws_applicationelb_rejected_connection_countRejectedConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of connections rejected by the load balancer, useful for identifying capacity or configuration issues.
aws_applicationelb_request_countRequestCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of requests handled by the load balancer, useful for monitoring traffic volume.
aws_applicationelb_rule_evaluationsRuleEvaluationsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rule evaluations on the load balancer, helping to monitor rule complexity and processing time.
aws_applicationelb_target_connection_error_countTargetConnectionErrorCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of connection errors to target servers, useful for identifying connectivity issues between the load balancer and targets.
aws_applicationelb_target_response_timeTargetResponseTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the response time of target servers, helping to track backend performance and latency.
aws_applicationelb_target_tlsnegotiation_error_countTargetTLSNegotiationErrorCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks failed TLS negotiations between the load balancer and target servers, useful for detecting SSL/TLS issues with backend services.
aws_applicationelb_anomalous_host_countAnomalousHostCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_desync_mitigation_mode_non_compliant_request_countDesyncMitigationMode_NonCompliant_Request_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_dropped_invalid_header_request_countDroppedInvalidHeaderRequestCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_forwarded_invalid_header_request_countForwardedInvalidHeaderRequestCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_grpc_request_countGrpcRequestCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_httpcode_elb_500_countHTTPCode_ELB_500_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_httpcode_elb_502_countHTTPCode_ELB_502_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_httpcode_elb_503_countHTTPCode_ELB_503_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_httpcode_elb_504_countHTTPCode_ELB_504_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_http_fixed_response_countHTTP_Fixed_Response_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_http_redirect_countHTTP_Redirect_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_http_redirect_url_limit_exceeded_countHTTP_Redirect_Url_Limit_Exceeded_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_healthy_host_countHealthyHostCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_healthy_state_dnsHealthyStateDNSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_healthy_state_routingHealthyStateRoutingMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_lambda_internal_errorLambdaInternalErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_lambda_target_processed_bytesLambdaTargetProcessedBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_lambda_user_errorLambdaUserErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_mitigated_host_countMitigatedHostCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_non_sticky_request_countNonStickyRequestCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_request_count_per_targetRequestCountPerTargetMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_standard_processed_bytesStandardProcessedBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_un_healthy_host_countUnHealthyHostCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_unhealthy_routing_request_countUnhealthyRoutingRequestCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_unhealthy_state_dnsUnhealthyStateDNSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_applicationelb_unhealthy_state_routingUnhealthyStateRoutingMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Athena

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_athena_info
aws_athena_engine_execution_timeEngineExecutionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken by the query engine to execute a query, helping to monitor query performance and identify execution bottlenecks.
aws_athena_processed_bytesProcessedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of data processed by the query engine, useful for understanding query cost and efficiency.
aws_athena_query_planning_timeQueryPlanningTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the time taken to plan and prepare the query for execution, helping identify delays during the query planning phase.
aws_athena_query_queue_timeQueryQueueTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time a query spends in the queue before execution, useful for monitoring system load and query prioritization issues.
aws_athena_service_processing_timeServiceProcessingTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken by Athena’s internal services to process a query, helping to identify processing delays within the service.
aws_athena_total_execution_timeTotalExecutionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total time from query submission to completion, providing a comprehensive view of query performance and potential bottlenecks.

AWS/AutoScaling

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_autoscaling_info
aws_autoscaling_group_and_warm_pool_desired_capacityGroupAndWarmPoolDesiredCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the desired capacity of both the Auto Scaling group and the warm pool, used to ensure adequate resources are provisioned.
aws_autoscaling_group_and_warm_pool_total_capacityGroupAndWarmPoolTotalCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total capacity of the Auto Scaling group and warm pool, providing an overview of the available resources.
aws_autoscaling_group_desired_capacityGroupDesiredCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the desired number of instances in the Auto Scaling group, useful for capacity planning and scaling decisions.
aws_autoscaling_group_in_service_capacityGroupInServiceCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances currently in service, helping to monitor the active workload.
aws_autoscaling_group_in_service_instancesGroupInServiceInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual number of instances currently running in the group, useful for managing resource availability.
aws_autoscaling_group_max_sizeGroupMaxSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum size of the Auto Scaling group, helping ensure the group does not exceed the defined limit.
aws_autoscaling_group_min_sizeGroupMinSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the minimum size of the Auto Scaling group, ensuring a baseline level of capacity is maintained.
aws_autoscaling_group_pending_capacityGroupPendingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the capacity of instances that are pending launch, useful for understanding the state of scaling events.
aws_autoscaling_group_pending_instancesGroupPendingInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances that are pending launch, helping monitor scaling processes in progress.
aws_autoscaling_group_standby_capacityGroupStandbyCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the capacity of instances in standby mode, useful for tracking inactive but available resources.
aws_autoscaling_group_standby_instancesGroupStandbyInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instances in standby mode, helping assess resource availability for scaling.
aws_autoscaling_group_terminating_capacityGroupTerminatingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the capacity of instances being terminated, helping to monitor scaling down activities.
aws_autoscaling_group_terminating_instancesGroupTerminatingInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instances being terminated, useful for understanding scaling down operations.
aws_autoscaling_group_total_capacityGroupTotalCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total capacity of the Auto Scaling group, providing a complete view of resources available for scaling.
aws_autoscaling_group_total_instancesGroupTotalInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of instances in the Auto Scaling group, helping to monitor overall resource allocation.
aws_autoscaling_predictive_scaling_capacity_forecastPredictiveScalingCapacityForecastAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides forecasted capacity based on predictive scaling, helping to plan for future resource needs.
aws_autoscaling_predictive_scaling_load_forecastPredictiveScalingLoadForecastAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks forecasted load on the Auto Scaling group, helping to ensure capacity meets future demand.
aws_autoscaling_predictive_scaling_metric_pair_correlationPredictiveScalingMetricPairCorrelationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the correlation between metric pairs for predictive scaling, useful for improving prediction accuracy.
aws_autoscaling_warm_pool_desired_capacityWarmPoolDesiredCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the desired capacity of the warm pool, helping to ensure the pool has sufficient resources for quick scaling.
aws_autoscaling_warm_pool_min_sizeWarmPoolMinSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the minimum size of the warm pool, ensuring a baseline level of resources for rapid scaling.
aws_autoscaling_warm_pool_pending_capacityWarmPoolPendingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the capacity of instances pending in the warm pool, useful for understanding warm pool availability.
aws_autoscaling_warm_pool_terminating_capacityWarmPoolTerminatingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the capacity of instances being terminated in the warm pool, helping to track scaling down activities.
aws_autoscaling_warm_pool_total_capacityWarmPoolTotalCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total capacity of the warm pool, providing a complete view of available resources for quick scaling.
aws_autoscaling_warm_pool_warmed_capacityWarmPoolWarmedCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the capacity of warmed instances in the warm pool, useful for tracking resources that are ready for immediate use.

AWS/Backup

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_backup_info
aws_backup_number_of_backup_jobs_abortedNumberOfBackupJobsAbortedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of backup jobs that were aborted, useful for monitoring failed or incomplete backup operations.
aws_backup_number_of_backup_jobs_completedNumberOfBackupJobsCompletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of backup jobs successfully completed, useful for tracking the effectiveness of backup operations.
aws_backup_number_of_backup_jobs_createdNumberOfBackupJobsCreatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of backup jobs initiated, helping to monitor backup frequency and schedule adherence.
aws_backup_number_of_backup_jobs_expiredNumberOfBackupJobsExpiredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of backup jobs that have expired, useful for ensuring data retention policies are followed.
aws_backup_number_of_backup_jobs_failedNumberOfBackupJobsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of backup jobs that have failed, useful for identifying errors in the backup process.
aws_backup_number_of_backup_jobs_pendingNumberOfBackupJobsPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of backup jobs currently in a pending state, helping monitor delays or scheduling issues.
aws_backup_number_of_backup_jobs_runningNumberOfBackupJobsRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of backup jobs that are currently running, useful for tracking ongoing backup processes.
aws_backup_number_of_copy_jobs_completedNumberOfCopyJobsCompletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of copy jobs successfully completed, helping track backup data replication across regions or storage tiers.
aws_backup_number_of_copy_jobs_createdNumberOfCopyJobsCreatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of initiated copy jobs, useful for monitoring data replication schedules.
aws_backup_number_of_copy_jobs_failedNumberOfCopyJobsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed copy jobs, helping to detect issues with backup replication processes.
aws_backup_number_of_copy_jobs_runningNumberOfCopyJobsRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of copy jobs currently in progress, useful for monitoring ongoing replication activities.
aws_backup_number_of_recovery_points_coldNumberOfRecoveryPointsColdAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of cold (archived) recovery points, useful for tracking long-term storage of backup data.
aws_backup_number_of_recovery_points_completedNumberOfRecoveryPointsCompletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of recovery points successfully created, helping to ensure that data can be restored when needed.
aws_backup_number_of_recovery_points_deletingNumberOfRecoveryPointsDeletingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of recovery points being deleted, useful for tracking clean-up or retention policy actions.
aws_backup_number_of_recovery_points_expiredNumberOfRecoveryPointsExpiredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of expired recovery points, useful for ensuring compliance with retention policies.
aws_backup_number_of_recovery_points_partialNumberOfRecoveryPointsPartialAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of incomplete (partial) recovery points, helping to identify issues with backup integrity or storage capacity.
aws_backup_number_of_restore_jobs_completedNumberOfRestoreJobsCompletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of successful restore jobs, useful for tracking data recovery operations.
aws_backup_number_of_restore_jobs_failedNumberOfRestoreJobsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of restore jobs that have failed, useful for identifying problems in the recovery process.
aws_backup_number_of_restore_jobs_pendingNumberOfRestoreJobsPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of restore jobs that are pending, useful for monitoring delays in data recovery.
aws_backup_number_of_restore_jobs_runningNumberOfRestoreJobsRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of restore jobs currently in progress, helping to track ongoing recovery processes.

AWS/Bedrock

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_bedrock_input_token_countInputTokenCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_invocation_client_errorsInvocationClientErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_invocation_latencyInvocationLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_invocation_server_errorsInvocationServerErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_invocation_throttlesInvocationThrottlesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_invocationsInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_legacy_model_invocationsLegacyModelInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_output_image_countOutputImageCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_bedrock_output_token_countOutputTokenCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Billing

Note this service only produces metrics to specific regions in AWS. Any jobs configured with this service will only gather data from the us-east-1 region(s)

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_billing_estimated_chargesEstimatedChargesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the estimated charges for your AWS account, providing insights into overall AWS cost and usage. This is useful for budget monitoring and cost management over time, helping to identify cost spikes or unusual charges.

AWS/Cassandra

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_cassandra_info
aws_cassandra_account_max_readsAccountMaxReadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of read requests for the account, helping monitor and manage read activity and limits.
aws_cassandra_account_max_table_level_readsAccountMaxTableLevelReadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum number of reads at the table level, useful for understanding read distribution across tables.
aws_cassandra_account_max_table_level_writesAccountMaxTableLevelWritesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of write operations at the table level, helping identify write-heavy tables.
aws_cassandra_account_max_writesAccountMaxWritesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum number of write requests for the account, useful for managing overall write throughput.
aws_cassandra_account_provisioned_read_capacity_utilizationAccountProvisionedReadCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the utilization of provisioned read capacity, helping ensure optimal read capacity allocation.
aws_cassandra_account_provisioned_write_capacity_utilizationAccountProvisionedWriteCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of provisioned write capacity, ensuring efficient use of write resources.
aws_cassandra_conditional_check_failed_requestsConditionalCheckFailedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of failed conditional checks, useful for monitoring logical errors during write operations.
aws_cassandra_consumed_read_capacity_unitsConsumedReadCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of read capacity units consumed, helping monitor read activity and optimize capacity.
aws_cassandra_consumed_write_capacity_unitsConsumedWriteCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of write capacity units consumed, providing insights into write operations and capacity optimization.
aws_cassandra_max_provisioned_table_read_capacity_utilizationMaxProvisionedTableReadCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum utilization of provisioned read capacity at the table level, helping manage read resources per table.
aws_cassandra_max_provisioned_table_write_capacity_utilizationMaxProvisionedTableWriteCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum utilization of provisioned write capacity at the table level, ensuring efficient use of write resources per table.
aws_cassandra_returned_item_countReturnedItemCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of items returned by read operations, useful for understanding query efficiency.
aws_cassandra_returned_item_count_by_selectReturnedItemCountBySelectAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of items returned by select queries, helping optimize query results and performance.
aws_cassandra_successful_request_countSuccessfulRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of successful requests, providing insights into the operational success rate of read and write operations.
aws_cassandra_successful_request_latencySuccessfulRequestLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of successful requests, helping to optimize performance and identify bottlenecks.
aws_cassandra_system_errorsSystemErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of system-related errors, useful for identifying and addressing infrastructure or service issues.
aws_cassandra_user_errorsUserErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of user-related errors, helping identify application-level issues or misconfigurations.

AWS/CertificateManager

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_certificatemanager_info
aws_certificatemanager_days_to_expiryDaysToExpiryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of days remaining until an SSL/TLS certificate expires. This metric is useful for monitoring certificate lifecycles and ensuring that certificates are renewed before expiration to avoid service disruptions.

AWS/CloudFront

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_cloudfront_info
aws_cloudfront_4xx_error_rate4xxErrorRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of 4xx client-side errors, helping to monitor user request issues.
aws_cloudfront_5xx_error_rate5xxErrorRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of 5xx server-side errors, useful for detecting backend or CloudFront issues.
aws_cloudfront_bytes_downloadedBytesDownloadedSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total bytes downloaded via CloudFront, useful for monitoring bandwidth usage.
aws_cloudfront_bytes_uploadedBytesUploadedSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of data uploaded to CloudFront, helping track upload activity.
aws_cloudfront_requestsRequestsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of requests processed by CloudFront, providing insight into traffic volume.
aws_cloudfront_total_error_rateTotalErrorRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the combined rate of all error responses (both 4xx and 5xx), helping monitor service reliability.
aws_cloudfront_401_error_rate401ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_403_error_rate403ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_404_error_rate404ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_502_error_rate502ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_503_error_rate503ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_504_error_rate504ErrorRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_cache_hit_rateCacheHitRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_function_compute_utilizationFunctionComputeUtilizationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_function_execution_errorsFunctionExecutionErrorsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_function_invocationsFunctionInvocationsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_function_throttlesFunctionThrottlesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_function_validation_errorsFunctionValidationErrorsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_lambda_execution_errorLambdaExecutionErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_lambda_limit_exceeded_errorsLambdaLimitExceededErrorsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_lambda_validation_errorLambdaValidationErrorMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_cloudfront_origin_latencyOriginLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Cognito

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_cognito_info
aws_cognito_account_take_over_riskAccountTakeOverRiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the risk of account takeover attempts, useful for detecting malicious login attempts.
aws_cognito_compromised_credentials_riskCompromisedCredentialsRiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the risk of compromised credentials, helping to detect and mitigate security threats.
aws_cognito_federation_successesFederationSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful federated sign-ins, useful for monitoring third-party identity provider usage.
aws_cognito_federation_throttlesFederationThrottlesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of throttled federation sign-in attempts, useful for identifying rate-limiting issues.
aws_cognito_no_riskNoRiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of no-risk sign-ins, indicating successful and secure login attempts.
aws_cognito_override_blockOverrideBlockAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances where an administrator overrides a block, useful for auditing account management actions.
aws_cognito_riskRiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks general login risk events, helping to monitor suspicious activity.
aws_cognito_sign_in_successesSignInSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful sign-ins, helping to monitor user authentication success.
aws_cognito_sign_in_throttlesSignInThrottlesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of throttled sign-in attempts, useful for detecting excessive login activity or rate-limiting.
aws_cognito_sign_up_successesSignUpSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful user sign-ups, providing insight into account creation trends.
aws_cognito_sign_up_throttlesSignUpThrottlesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures throttled sign-up attempts, useful for identifying potential rate-limiting or abuse during account creation.
aws_cognito_token_refresh_successesTokenRefreshSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful token refreshes, useful for monitoring user session continuity.
aws_cognito_token_refresh_throttlesTokenRefreshThrottlesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of throttled token refresh requests, helping identify rate-limiting or session issues.

AWS/DDoSProtection

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_ddosprotection_info
aws_ddosprotection_ddo_sattack_bits_per_secondDDoSAttackBitsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the volume of a DDoS attack in terms of data transfer per second, useful for detecting bandwidth-based attacks.
aws_ddosprotection_ddo_sattack_packets_per_secondDDoSAttackPacketsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets involved in a DDoS attack per second, helping to identify packet flood attacks.
aws_ddosprotection_ddo_sattack_requests_per_secondDDoSAttackRequestsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of requests in a DDoS attack per second, useful for identifying application-layer DDoS attacks.
aws_ddosprotection_ddo_sdetectedDDoSDetectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the detection of DDoS attacks, providing alerts when a potential attack is detected.
aws_ddosprotection_volume_bits_per_secondVolumeBitsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the data transfer volume per second during a DDoS attack, helping to understand the scale of the attack.
aws_ddosprotection_volume_packets_per_secondVolumePacketsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the volume of packets per second, useful for tracking the size of DDoS attacks in terms of packet rate.

AWS/DMS

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_dms_info
aws_dms_cdcchanges_disk_sourceCDCChangesDiskSourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks changes to the disk source during Change Data Capture (CDC) operations, useful for monitoring disk-based CDC changes.
aws_dms_cdcchanges_disk_targetCDCChangesDiskTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors changes to the disk target during CDC, useful for tracking target-side disk usage in migrations.
aws_dms_cdcchanges_memory_sourceCDCChangesMemorySourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage on the source during CDC operations, helping monitor memory-based migrations.
aws_dms_cdcchanges_memory_targetCDCChangesMemoryTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors memory usage on the target during CDC operations, useful for tracking memory consumption on the target side.
aws_dms_cdcincoming_changesCDCIncomingChangesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of incoming changes during CDC operations, helping to monitor the rate of data changes.
aws_dms_cdclatency_sourceCDCLatencySourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks latency on the source side during CDC operations, helping to identify performance issues with data changes.
aws_dms_cdclatency_targetCDCLatencyTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency on the target side during CDC operations, useful for tracking potential bottlenecks.
aws_dms_cdcthroughput_bandwidth_sourceCDCThroughputBandwidthSourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the source bandwidth usage during CDC operations, helping to monitor network usage.
aws_dms_cdcthroughput_bandwidth_targetCDCThroughputBandwidthTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the target bandwidth usage during CDC, useful for tracking data transfer rates.
aws_dms_cdcthroughput_rows_sourceCDCThroughputRowsSourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rows processed from the source during CDC operations, useful for monitoring data throughput.
aws_dms_cdcthroughput_rows_targetCDCThroughputRowsTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of rows written to the target during CDC, helping to ensure data is migrated efficiently.
aws_dms_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the CPU usage of DMS instances, helping to ensure that the system has enough resources to perform migrations.
aws_dms_free_storage_spaceFreeStorageSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free storage available on the DMS instance, useful for preventing storage exhaustion during migrations.
aws_dms_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the available memory on the DMS instance, useful for ensuring that enough memory is available for operations.
aws_dms_full_load_throughput_bandwidth_sourceFullLoadThroughputBandwidthSourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks bandwidth usage during full load operations on the source, useful for monitoring network utilization.
aws_dms_full_load_throughput_bandwidth_targetFullLoadThroughputBandwidthTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors bandwidth usage during full load operations on the target, helping track data transfer efficiency.
aws_dms_full_load_throughput_rows_sourceFullLoadThroughputRowsSourceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rows processed from the source during full load migrations, helping to monitor data throughput.
aws_dms_full_load_throughput_rows_targetFullLoadThroughputRowsTargetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of rows loaded to the target during full load operations, helping to ensure migration progress.
aws_dms_network_receive_throughputNetworkReceiveThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the network receive rate, helping to monitor inbound network performance during migrations.
aws_dms_network_transmit_throughputNetworkTransmitThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the network transmit rate, useful for monitoring outbound network performance.
aws_dms_read_iopsReadIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of read operations per second, helping to monitor disk read performance.
aws_dms_read_latencyReadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of read operations, helping to identify performance issues in disk reads.
aws_dms_read_throughputReadThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the throughput of read operations, useful for tracking how much data is being read during migrations.
aws_dms_swap_usageSwapUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of swap space used, helping monitor memory performance.
aws_dms_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write operations per second, useful for monitoring disk write performance.
aws_dms_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of write operations, helping identify performance issues during data writes.
aws_dms_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the throughput of write operations, helping to understand the speed of data writes during migration operations.

AWS/DX

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_dx_info
aws_dx_connection_bps_egressConnectionBpsEgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the egress bandwidth (bits per second) for Direct Connect connections, helping monitor outbound data transfer.
aws_dx_connection_bps_ingressConnectionBpsIngressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the ingress bandwidth (bits per second), providing insights into inbound data transfer rates.
aws_dx_connection_crcerror_countConnectionCRCErrorCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks CRC errors on the connection, useful for identifying data integrity issues or hardware problems.
aws_dx_connection_encryption_stateConnectionEncryptionStateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the encryption state of Direct Connect connections, helping ensure secure data transfer.
aws_dx_connection_error_countConnectionErrorCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors on the Direct Connect connection, useful for diagnosing connectivity issues.
aws_dx_connection_light_level_rxConnectionLightLevelRxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the received light level, helping monitor the health of fiber optic connections.
aws_dx_connection_light_level_txConnectionLightLevelTxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the transmitted light level, helping ensure proper signal strength in fiber optic connections.
aws_dx_connection_pps_egressConnectionPpsEgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of packets per second being transmitted (egress), useful for tracking network traffic patterns.
aws_dx_connection_pps_ingressConnectionPpsIngressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets per second being received (ingress), useful for understanding inbound traffic load.
aws_dx_connection_stateConnectionStateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the operational state of Direct Connect connections, helping to detect connection status changes.
aws_dx_virtual_interface_bps_egressVirtualInterfaceBpsEgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the outbound bandwidth usage for virtual interfaces, helping track the data flow from virtual interfaces.
aws_dx_virtual_interface_bps_ingressVirtualInterfaceBpsIngressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors inbound bandwidth usage for virtual interfaces, providing insight into data ingress through virtual interfaces.
aws_dx_virtual_interface_pps_egressVirtualInterfacePpsEgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of outbound packets per second for virtual interfaces, helping monitor packet-based traffic.
aws_dx_virtual_interface_pps_ingressVirtualInterfacePpsIngressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of inbound packets per second for virtual interfaces, useful for monitoring packet-level ingress.

AWS/DocDB

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_docdb_info
aws_docdb_backup_retention_period_storage_usedBackupRetentionPeriodStorageUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of storage used for backup retention, helping manage backup costs and storage.
aws_docdb_buffer_cache_hit_ratioBufferCacheHitRatioAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the cache hit ratio, helping to ensure data is being effectively cached.
aws_docdb_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the CPU usage of the database, useful for monitoring resource consumption.
aws_docdb_change_stream_log_sizeChangeStreamLogSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the change stream log, helping monitor the volume of changes being processed.
aws_docdb_dbcluster_replica_lag_maximumDBClusterReplicaLagMaximumAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum replication lag between the primary and replica nodes in the cluster.
aws_docdb_dbcluster_replica_lag_minimumDBClusterReplicaLagMinimumAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the minimum replication lag, helping ensure data replication is kept in sync.
aws_docdb_dbinstance_replica_lagDBInstanceReplicaLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors replication lag at the instance level, useful for tracking data consistency across instances.
aws_docdb_database_connectionsDatabaseConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active connections to the database, helping monitor connection load.
aws_docdb_database_connections_maxDatabaseConnectionsMaxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum number of connections allowed, helping avoid connection exhaustion.
aws_docdb_database_cursorsDatabaseCursorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of database cursors in use, helping monitor query processing.
aws_docdb_database_cursors_maxDatabaseCursorsMaxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum number of database cursors, useful for managing resource limits.
aws_docdb_database_cursors_timed_outDatabaseCursorsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks cursors that have timed out, helping identify performance issues.
aws_docdb_disk_queue_depthDiskQueueDepthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the depth of the disk I/O queue, useful for monitoring disk performance.
aws_docdb_documents_deletedDocumentsDeletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of documents deleted, helping to monitor data deletion operations.
aws_docdb_documents_insertedDocumentsInsertedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of documents inserted, helping to track data growth in the database.
aws_docdb_documents_returnedDocumentsReturnedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of documents returned by queries, useful for monitoring query performance.
aws_docdb_documents_updatedDocumentsUpdatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of documents updated, helping track changes in the database.
aws_docdb_engine_uptimeEngineUptimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total uptime of the database engine, useful for tracking availability.
aws_docdb_free_local_storageFreeLocalStorageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free storage on the database node, helping to prevent storage exhaustion.
aws_docdb_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of free memory, useful for ensuring sufficient memory availability.
aws_docdb_network_receive_throughputNetworkReceiveThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data being received by the database, useful for tracking inbound network usage.
aws_docdb_network_throughputNetworkThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors overall network throughput, helping track both inbound and outbound traffic.
aws_docdb_network_transmit_throughputNetworkTransmitThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data being transmitted from the database, helping track outbound traffic.
aws_docdb_opcounters_commandOpcountersCommandAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of database commands executed, useful for monitoring operational throughput.
aws_docdb_opcounters_deleteOpcountersDeleteAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of delete operations, useful for tracking data modifications.
aws_docdb_opcounters_getmoreOpcountersGetmoreAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of getMore operations, useful for monitoring pagination in queries.
aws_docdb_opcounters_insertOpcountersInsertAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of insert operations, helping monitor data insert performance.
aws_docdb_opcounters_queryOpcountersQueryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of queries executed, useful for tracking query load.
aws_docdb_opcounters_updateOpcountersUpdateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of update operations, helping monitor data modifications in the database.
aws_docdb_read_iopsReadIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of input/output operations per second for reads, helping to monitor read performance.
aws_docdb_read_latencyReadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of read operations, helping to identify performance issues with data retrieval.
aws_docdb_read_throughputReadThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate of data being read from the database, useful for tracking read performance.
aws_docdb_snapshot_storage_usedSnapshotStorageUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of storage used for database snapshots, helping manage backup storage costs.
aws_docdb_swap_usageSwapUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of swap space used, helping track memory efficiency.
aws_docdb_total_backup_storage_billedTotalBackupStorageBilledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of backup storage billed, useful for understanding backup costs.
aws_docdb_volume_bytes_usedVolumeBytesUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of storage volume in use, helping track database storage usage.
aws_docdb_volume_read_iopsVolumeReadIOPsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of read input/output operations per second on the storage volume, useful for monitoring storage performance.
aws_docdb_volume_write_iopsVolumeWriteIOPsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write I/O operations per second, helping monitor write performance on the storage volume.
aws_docdb_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of write operations per second, useful for tracking write throughput.
aws_docdb_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of write operations, helping to identify performance bottlenecks during data insertion or updates.
aws_docdb_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate at which data is written to the database, useful for understanding write performance.

AWS/DynamoDB

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_dynamodb_info
aws_dynamodb_account_max_readsAccountMaxReadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum number of reads across all tables in the account, helping track overall read activity.
aws_dynamodb_account_max_table_level_readsAccountMaxTableLevelReadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum reads at the table level, helping to identify read-heavy tables.
aws_dynamodb_account_max_table_level_writesAccountMaxTableLevelWritesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum number of writes at the table level, useful for identifying write-intensive tables.
aws_dynamodb_account_max_writesAccountMaxWritesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of writes across all tables in the account, helping monitor write throughput.
aws_dynamodb_account_provisioned_read_capacity_utilizationAccountProvisionedReadCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the utilization of the provisioned read capacity, helping ensure sufficient read capacity allocation.
aws_dynamodb_account_provisioned_write_capacity_utilizationAccountProvisionedWriteCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of the provisioned write capacity, useful for efficient capacity management.
aws_dynamodb_age_of_oldest_unreplicated_recordAgeOfOldestUnreplicatedRecordAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the age of the oldest unreplicated record, helping track replication lag.
aws_dynamodb_conditional_check_failed_requestsConditionalCheckFailedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed conditional checks, useful for identifying logical issues during write operations.
aws_dynamodb_consumed_change_data_capture_unitsConsumedChangeDataCaptureUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of consumed Change Data Capture units, helping monitor CDC-based operations.
aws_dynamodb_consumed_read_capacity_unitsConsumedReadCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total read capacity units consumed, helping track and optimize read operations.
aws_dynamodb_consumed_write_capacity_unitsConsumedWriteCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total write capacity units consumed, useful for monitoring and optimizing write operations.
aws_dynamodb_failed_to_replicate_record_countFailedToReplicateRecordCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of records that failed to replicate, useful for identifying replication issues.
aws_dynamodb_max_provisioned_table_read_capacity_utilizationMaxProvisionedTableReadCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum utilization of the provisioned read capacity at the table level, useful for understanding table-specific read activity.
aws_dynamodb_max_provisioned_table_write_capacity_utilizationMaxProvisionedTableWriteCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum utilization of provisioned write capacity at the table level, helping optimize write capacity.
aws_dynamodb_on_demand_max_read_request_unitsOnDemandMaxReadRequestUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the maximum number of read request units in on-demand mode, useful for managing scaling costs.
aws_dynamodb_on_demand_max_write_request_unitsOnDemandMaxWriteRequestUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of write request units in on-demand mode, helping optimize scaling and cost management.
aws_dynamodb_online_index_consumed_write_capacityOnlineIndexConsumedWriteCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the write capacity consumed by online index builds, useful for tracking index creation overhead.
aws_dynamodb_online_index_percentage_progressOnlineIndexPercentageProgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the progress of online index creation, useful for understanding index build status.
aws_dynamodb_online_index_throttle_eventsOnlineIndexThrottleEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks throttle events during online index creation, useful for detecting capacity constraints.
aws_dynamodb_pending_replication_countPendingReplicationCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of records pending replication, useful for tracking replication progress.
aws_dynamodb_provisioned_read_capacity_unitsProvisionedReadCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total provisioned read capacity units, useful for managing resource allocation.
aws_dynamodb_provisioned_write_capacity_unitsProvisionedWriteCapacityUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total provisioned write capacity units, helping ensure proper capacity allocation.
aws_dynamodb_read_throttle_eventsReadThrottleEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of throttled read requests, useful for identifying capacity limitations.
aws_dynamodb_replication_latencyReplicationLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the replication latency, helping ensure timely data consistency across replicas.
aws_dynamodb_returned_bytesReturnedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of data returned in response to queries, useful for tracking data retrieval patterns.
aws_dynamodb_returned_item_countReturnedItemCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of items returned by read operations, useful for monitoring query performance.
aws_dynamodb_returned_records_countReturnedRecordsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of records returned by queries, useful for understanding query load and performance.
aws_dynamodb_successful_request_latencySuccessfulRequestLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency of successful requests, useful for optimizing request performance.
aws_dynamodb_system_errorsSystemErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks system-level errors, helping identify infrastructure or platform issues.
aws_dynamodb_throttled_put_record_countThrottledPutRecordCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of throttled PutItem requests, useful for managing write capacity.
aws_dynamodb_throttled_requestsThrottledRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of throttled requests, helping to identify capacity limitations or traffic spikes.
aws_dynamodb_time_to_live_deleted_item_countTimeToLiveDeletedItemCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of items deleted due to Time to Live (TTL) expiration, useful for managing automatic data deletion.
aws_dynamodb_transaction_conflictTransactionConflictAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of transaction conflicts, helping to optimize transaction performance.
aws_dynamodb_user_errorsUserErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks user-level errors, helping identify application issues.
aws_dynamodb_write_throttle_eventsWriteThrottleEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of throttled write requests, useful for identifying capacity constraints during write operations.

AWS/EBS

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_ebs_info
aws_ebs_volume_read_bytesVolumeReadBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total bytes read from the EBS volume, useful for monitoring data retrieval activity.
aws_ebs_volume_write_bytesVolumeWriteBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total bytes written to the EBS volume, helping monitor data write operations.
aws_ebs_volume_read_opsVolumeReadOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of read operations on the EBS volume, useful for tracking read performance.
aws_ebs_volume_write_opsVolumeWriteOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write operations on the EBS volume, helping to monitor write throughput.
aws_ebs_volume_total_read_timeVolumeTotalReadTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time spent on read operations, useful for understanding read latency.
aws_ebs_volume_total_write_timeVolumeTotalWriteTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total time spent on write operations, helping to understand write latency.
aws_ebs_volume_idle_timeVolumeIdleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of idle time for the EBS volume, useful for understanding periods of inactivity.
aws_ebs_volume_queue_lengthVolumeQueueLengthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the length of the queue for I/O requests on the EBS volume, helping to identify potential performance bottlenecks.
aws_ebs_volume_throughput_percentageVolumeThroughputPercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the throughput percentage of the EBS volume, useful for ensuring optimal performance.
aws_ebs_volume_consumed_read_write_opsVolumeConsumedReadWriteOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of read and write operations consumed, helping track IOPS utilization.
aws_ebs_burst_balanceBurstBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the balance of burst credits available for burstable performance EBS volumes, helping manage performance spikes.
aws_ebs_byte_balance_percentEBSByteBalance%Maximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_iobalance_percentEBSIOBalance%Maximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_read_bytesEBSReadBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_read_opsEBSReadOpsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_write_bytesEBSWriteBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_write_opsEBSWriteOpsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_enable_copied_image_deprecation_completedEnableCopiedImageDeprecationCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_enable_copied_image_deprecation_failedEnableCopiedImageDeprecationFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_enable_image_deprecation_completedEnableImageDeprecationCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_enable_image_deprecation_failedEnableImageDeprecationFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_fast_snapshot_restore_credits_balanceFastSnapshotRestoreCreditsBalanceMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_fast_snapshot_restore_credits_bucket_sizeFastSnapshotRestoreCreditsBucketSizeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_copied_region_completedImagesCopiedRegionCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_copied_region_deregister_completedImagesCopiedRegionDeregisterCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_copied_region_deregistered_failedImagesCopiedRegionDeregisteredFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_copied_region_failedImagesCopiedRegionFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_copied_region_startedImagesCopiedRegionStartedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_create_completedImagesCreateCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_create_failedImagesCreateFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_create_startedImagesCreateStartedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_deregister_completedImagesDeregisterCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_images_deregister_failedImagesDeregisterFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_resources_targetedResourcesTargetedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshot_copy_bytes_transferredSnapshotCopyBytesTransferredMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_account_completedSnapshotsCopiedAccountCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_account_delete_completedSnapshotsCopiedAccountDeleteCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_account_delete_failedSnapshotsCopiedAccountDeleteFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_account_failedSnapshotsCopiedAccountFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_account_startedSnapshotsCopiedAccountStartedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_region_completedSnapshotsCopiedRegionCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_region_delete_completedSnapshotsCopiedRegionDeleteCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_region_delete_failedSnapshotsCopiedRegionDeleteFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_region_failedSnapshotsCopiedRegionFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_copied_region_startedSnapshotsCopiedRegionStartedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_create_completedSnapshotsCreateCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_create_failedSnapshotsCreateFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_create_startedSnapshotsCreateStartedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_delete_completedSnapshotsDeleteCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_delete_failedSnapshotsDeleteFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_snapshots_shared_completedSnapshotsSharedCompletedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_volume_avg_read_latencyVolumeAvgReadLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_volume_avg_write_latencyVolumeAvgWriteLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_volume_iopsexceeded_checkVolumeIOPSExceededCheckMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_volume_stalled_iocheckVolumeStalledIOCheckMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ebs_volume_throughput_exceeded_checkVolumeThroughputExceededCheckMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/EC2

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_ec2_info
aws_ec2_cpuutilizationCPUUtilizationMaximum, Average, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data received by the EC2 instance, useful for monitoring inbound traffic.
aws_ec2_network_inNetworkInAverage, Sum, Maximum, Minimum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data received by the EC2 instance, useful for monitoring inbound traffic.
aws_ec2_network_outNetworkOutAverage, Sum, Maximum, Minimum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of data sent from the EC2 instance, helping track outbound traffic.
aws_ec2_network_packets_inNetworkPacketsInSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of network packets received, useful for understanding inbound network traffic patterns.
aws_ec2_network_packets_outNetworkPacketsOutSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of network packets sent, helping monitor outbound network activity.
aws_ec2_disk_read_bytesDiskReadBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of bytes read from the instance’s storage, useful for tracking data retrieval performance.
aws_ec2_disk_write_bytesDiskWriteBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes written to the instance’s storage, helping to track storage write operations.
aws_ec2_disk_read_opsDiskReadOpsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of read operations on the instance’s storage, useful for monitoring storage performance.
aws_ec2_disk_write_opsDiskWriteOpsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write operations on the instance’s storage, helping track write activity.
aws_ec2_status_check_failedStatusCheckFailedSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the EC2 instance has failed the instance or system status checks, useful for identifying potential issues.
aws_ec2_status_check_failed_instanceStatusCheckFailed_InstanceSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors whether the instance has failed the instance-level status checks, helping to detect internal instance issues.
aws_ec2_status_check_failed_systemStatusCheckFailed_SystemSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks failures in the system-level status checks, useful for identifying infrastructure issues impacting the instance.
aws_ec2_ebsiobalance_percentEBSIOBalance%Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_ebsbyte_balance_percentEBSByteBalance%Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_ebsread_opsEBSReadOpsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of read operations on attached EBS volumes, useful for tracking storage read performance.
aws_ec2_ebswrite_opsEBSWriteOpsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of write operations on attached EBS volumes, helping to monitor storage write activity.
aws_ec2_ebsread_bytesEBSReadBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes read from attached EBS volumes, useful for monitoring data retrieval performance.
aws_ec2_ebswrite_bytesEBSWriteBytesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes written to attached EBS volumes, helping to monitor data write performance.
aws_ec2_cpucredit_balanceCPUCreditBalanceMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_cpucredit_usageCPUCreditUsageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_cpusurplus_credit_balanceCPUSurplusCreditBalanceMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_cpusurplus_credits_chargedCPUSurplusCreditsChargedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_dedicated_host_cpuutilizationDedicatedHostCPUUtilizationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_metadata_no_tokenMetadataNoTokenMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2_status_check_failed_attached_ebsStatusCheckFailed_AttachedEBSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/EC2Spot

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_ec2spot_info
aws_ec2spot_available_instance_pools_countAvailableInstancePoolsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instance pools available for Spot requests, useful for tracking availability.
aws_ec2spot_bids_submitted_for_capacityBidsSubmittedForCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ec2spot_eligible_instance_pool_countEligibleInstancePoolCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of eligible instance pools for Spot requests, useful for understanding Spot market options.
aws_ec2spot_fulfilled_capacityFulfilledCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the capacity fulfilled by Spot instances, helping monitor the success rate of Spot requests.
aws_ec2spot_max_percent_capacity_allocationMaxPercentCapacityAllocationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum percent of capacity allocated, useful for understanding the allocation of Spot instances.
aws_ec2spot_pending_capacityPendingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the pending Spot instance capacity, helping monitor Spot instance provisioning.
aws_ec2spot_percent_capacity_allocationPercentCapacityAllocationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of capacity allocated to Spot instances, useful for managing resource allocation.
aws_ec2spot_target_capacityTargetCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the target capacity for Spot instances, useful for monitoring Spot instance request goals.
aws_ec2spot_terminating_capacityTerminatingCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the capacity being terminated in Spot instances, helping track Spot instance lifecycle management.

AWS/ECR

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_ecr_repository_pull_countRepositoryPullCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of pulls from an ECR repository, useful for tracking container image usage.

AWS/ECS

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_ecs_info
aws_ecs_cpureservationCPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU reserved for ECS tasks, helping monitor resource reservation.
aws_ecs_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the CPU utilization of ECS tasks, useful for tracking resource usage.
aws_ecs_gpureservationGPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks GPU reservation for ECS tasks, helping manage GPU resources.
aws_ecs_memory_reservationMemoryReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the memory reserved for ECS tasks, helping track memory resource allocation.
aws_ecs_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the memory utilization of ECS tasks, useful for monitoring memory resource consumption.

AWS/EFS

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_efs_info
aws_efs_burst_credit_balanceBurstCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the balance of burst credits for EFS, useful for managing performance bursts.
aws_efs_client_connectionsClientConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of client connections to EFS, useful for understanding file system usage.
aws_efs_data_read_iobytesDataReadIOBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data read from EFS, helping track read performance.
aws_efs_data_write_iobytesDataWriteIOBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of data written to EFS, helping monitor write performance.
aws_efs_metadata_iobytesMetadataIOBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the metadata operations on EFS, useful for tracking metadata-related I/O.
aws_efs_metered_iobytesMeteredIOBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of metered I/O operations, helping manage performance limits.
aws_efs_percent_iolimitPercentIOLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of the I/O limit reached, useful for performance management.
aws_efs_permitted_throughputPermittedThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the allowed throughput for EFS, helping monitor throughput limits.
aws_efs_storage_bytesStorageBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total storage used by EFS, useful for managing storage capacity.
aws_efs_total_iobytesTotalIOBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total I/O operations, helping monitor overall file system performance.

AWS/ELB

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_elb_info
aws_elb_backend_connection_errorsBackendConnectionErrorsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of connection errors between ELB and the backend instances, useful for identifying connection issues.
aws_elb_healthy_host_countHealthyHostCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of healthy backend instances, helping track instance health.
aws_elb_httpcode_backend_2_xxHTTPCode_Backend_2XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful responses (2XX) from the backend, useful for monitoring backend application performance.
aws_elb_httpcode_backend_3_xxHTTPCode_Backend_3XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures redirection responses (3XX) from the backend, helping monitor routing performance.
aws_elb_httpcode_backend_4_xxHTTPCode_Backend_4XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks client errors (4XX) from the backend, useful for identifying issues with client requests.
aws_elb_httpcode_backend_5_xxHTTPCode_Backend_5XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors server errors (5XX) from the backend, helping track server-side issues.
aws_elb_httpcode_elb_4_xxHTTPCode_ELB_4XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures client errors (4XX) at the ELB level, useful for tracking errors handled by the ELB.
aws_elb_httpcode_elb_5_xxHTTPCode_ELB_5XXSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server errors (5XX) at the ELB level, helping monitor ELB server-side performance.
aws_elb_latencyLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency of requests through the ELB, useful for tracking response times.
aws_elb_request_countRequestCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests handled by the ELB, useful for monitoring traffic levels.
aws_elb_spillover_countSpilloverCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of requests that were rejected due to lack of available resources,
aws_elb_surge_queue_lengthSurgeQueueLengthMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the length of the request queue, useful for monitoring traffic surges.
aws_elb_un_healthy_host_countUnHealthyHostCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of unhealthy backend instances, helping identify infrastructure issues.
aws_elb_estimated_albactive_connection_countEstimatedALBActiveConnectionCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elb_estimated_albconsumed_lcusEstimatedALBConsumedLCUsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elb_estimated_albnew_connection_countEstimatedALBNewConnectionCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elb_estimated_processed_bytesEstimatedProcessedBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/ES

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_es_info
aws_es_2xx2xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful requests to the Elasticsearch service
aws_es_3xx3xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks redirection requests to the Elasticsearch service
aws_es_4xx4xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks client error responses from the Elasticsearch service
aws_es_5xx5xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks server error responses from the Elasticsearch service
aws_es_adanomaly_detectors_index_status_redADAnomalyDetectorsIndexStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_adanomaly_detectors_index_status_index_existsADAnomalyDetectorsIndexStatusIndexExistsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the anomaly detection index exists or not
aws_es_adanomaly_results_index_status_redADAnomalyResultsIndexStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_adanomaly_results_index_status_index_existsADAnomalyResultsIndexStatusIndexExistsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the anomaly results index exists or not
aws_es_adexecute_failure_countADExecuteFailureCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times anomaly detection execution has failed
aws_es_adexecute_request_countADExecuteRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of anomaly detection execution requests
aws_es_adhcexecute_failure_countADHCExecuteFailureCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of high cardinality anomaly detection execution failures
aws_es_adhcexecute_request_countADHCExecuteRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of high cardinality anomaly detection execution requests
aws_es_admodels_checkpoint_index_status_redADModelsCheckpointIndexStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_admodels_checkpoint_index_status_index_existsADModelsCheckpointIndexStatusIndexExistsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the model checkpoint index exists
aws_es_adplugin_unhealthyADPluginUnhealthyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if the anomaly detection plugin is in an unhealthy state
aws_es_alerting_degradedAlertingDegradedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if the alerting feature is in a degraded state
aws_es_alerting_index_existsAlertingIndexExistsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the alerting index exists
aws_es_alerting_index_status_greenAlertingIndexStatus.greenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_alerting_index_status_redAlertingIndexStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_alerting_index_status_yellowAlertingIndexStatus.yellowAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_alerting_nodes_not_on_scheduleAlertingNodesNotOnScheduleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of nodes not on schedule for alerting
aws_es_alerting_nodes_on_scheduleAlertingNodesOnScheduleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of nodes on schedule for alerting
aws_es_alerting_scheduled_job_enabledAlertingScheduledJobEnabledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if alerting scheduled jobs are enabled
aws_es_asynchronous_search_cancelledAsynchronousSearchCancelledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of asynchronous search requests that were canceled
aws_es_asynchronous_search_completion_rateAsynchronousSearchCompletionRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of successful asynchronous search completions
aws_es_asynchronous_search_failure_rateAsynchronousSearchFailureRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of failed asynchronous search requests
aws_es_asynchronous_search_initialized_rateAsynchronousSearchInitializedRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of initialized asynchronous search requests
aws_es_asynchronous_search_max_running_timeAsynchronousSearchMaxRunningTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum time taken by asynchronous search requests
aws_es_asynchronous_search_persist_failed_rateAsynchronousSearchPersistFailedRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of failed attempts to persist asynchronous search results
aws_es_asynchronous_search_persist_rateAsynchronousSearchPersistRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of successful attempts to persist asynchronous search results
aws_es_asynchronous_search_rejectedAsynchronousSearchRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of asynchronous search requests that were rejected
aws_es_asynchronous_search_running_currentAsynchronousSearchRunningCurrentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of currently running asynchronous search requests
aws_es_asynchronous_search_store_healthAsynchronousSearchStoreHealthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the health of the store for asynchronous search
aws_es_asynchronous_search_store_sizeAsynchronousSearchStoreSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the asynchronous search store
aws_es_asynchronous_search_stored_response_countAsynchronousSearchStoredResponseCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of responses stored for asynchronous search
aws_es_asynchronous_search_submission_rateAsynchronousSearchSubmissionRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_auto_follow_leader_call_failureAutoFollowLeaderCallFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures when trying to call the leader for cross-cluster replication
aws_es_auto_follow_num_failed_start_replicationAutoFollowNumFailedStartReplicationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed attempts to start cross-cluster replication
aws_es_auto_follow_num_success_start_replicationAutoFollowNumSuccessStartReplicationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful attempts to start cross-cluster replication
aws_es_auto_tune_changes_history_heap_sizeAutoTuneChangesHistoryHeapSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the heap size usage history for auto-tune changes
aws_es_auto_tune_changes_history_jvmyoung_gen_argsAutoTuneChangesHistoryJVMYoungGenArgsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks JVM young generation arguments for auto-tune changes
aws_es_auto_tune_failedAutoTuneFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed auto-tune attempts
aws_es_auto_tune_succeededAutoTuneSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful auto-tune attempts
aws_es_auto_tune_valueAutoTuneValueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the value of auto-tune changes
aws_es_automated_snapshot_failureAutomatedSnapshotFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures in automated
aws_es_avg_point_in_time_alive_timeAvgPointInTimeAliveTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the average lifetime of point-in-time snapshots
aws_es_burst_balanceBurstBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the burst balance for the service
aws_es_cpucredit_balanceCPUCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the balance of CPU credits for the nodes
aws_es_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU utilization of the nodes
aws_es_cluster_index_writes_blockedClusterIndexWritesBlockedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether index writes are blocked at the cluster level
aws_es_cluster_status_greenClusterStatus.greenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_cluster_status_redClusterStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_cluster_status_yellowClusterStatus.yellowAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_cluster_used_spaceClusterUsedSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of used storage space in the cluster
aws_es_cold_storage_space_utilizationColdStorageSpaceUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the storage utilization of cold data
aws_es_cold_to_warm_migration_failure_countColdToWarmMigrationFailureCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures during migration from cold to warm storage
aws_es_cold_to_warm_migration_latencyColdToWarmMigrationLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of migration from cold to warm storage
aws_es_cold_to_warm_migration_queue_sizeColdToWarmMigrationQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the queue size for migration from cold to warm storage
aws_es_cold_to_warm_migration_success_countColdToWarmMigrationSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful migrations from cold to warm storage
aws_es_coordinating_write_rejectedCoordinatingWriteRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rejected coordinating node write requests
aws_es_cross_cluster_inbound_replication_requestsCrossClusterInboundReplicationRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of inbound replication requests for cross-cluster replication
aws_es_cross_cluster_inbound_requestsCrossClusterInboundRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of inbound requests for cross-cluster replication
aws_es_cross_cluster_outbound_connectionsCrossClusterOutboundConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of outbound connections for cross-cluster replication
aws_es_cross_cluster_outbound_replication_requestsCrossClusterOutboundReplicationRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of outbound replication requests for cross-cluster replication
aws_es_cross_cluster_outbound_requestsCrossClusterOutboundRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of outbound requests for cross-cluster replication
aws_es_current_point_in_timeCurrentPointInTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the current point in time (snapshot) available in Elasticsearch
aws_es_data_nodesDataNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of data nodes in the Elasticsearch cluster
aws_es_data_nodes_shards_activeDataNodesShards.activeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_data_nodes_shards_initializingDataNodesShards.initializingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_data_nodes_shards_relocatingDataNodesShards.relocatingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_data_nodes_shards_unassignedDataNodesShards.unassignedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_deleted_documentsDeletedDocumentsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of deleted documents from the Elasticsearch cluster
aws_es_disk_queue_depthDiskQueueDepthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the depth of the disk queue
aws_es_reporting_failed_request_sys_err_countESReportingFailedRequestSysErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed reporting requests due to system errors
aws_es_reporting_failed_request_user_err_countESReportingFailedRequestUserErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed reporting requests due to user errors
aws_es_reporting_request_countESReportingRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of reporting requests submitted to Elasticsearch
aws_es_reporting_success_countESReportingSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful reporting requests
aws_es_elasticsearch_requestsElasticsearchRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests to Elasticsearch
aws_es_follower_check_pointFollowerCheckPointAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the checkpoint of a follower node in cross-cluster replication
aws_es_free_storage_spaceFreeStorageSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the available storage space in the Elasticsearch cluster
aws_es_has_active_point_in_timeHasActivePointInTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates `whether there is an active point-in-time snapshot
aws_es_has_used_point_in_timeHasUsedPointInTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates whether the point-in-time snapshot has been used
aws_es_hot_storage_space_utilizationHotStorageSpaceUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the storage utilization of hot data
aws_es_hot_to_warm_migration_failure_countHotToWarmMigrationFailureCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures during migration from hot to warm storage
aws_es_hot_to_warm_migration_force_merge_latencyHotToWarmMigrationForceMergeLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of force merging during migration from hot to warm storage
aws_es_hot_to_warm_migration_processing_latencyHotToWarmMigrationProcessingLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of processing migration from hot to warm storage
aws_es_hot_to_warm_migration_queue_sizeHotToWarmMigrationQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the queue size for migration from hot to warm storage
aws_es_hot_to_warm_migration_snapshot_latencyHotToWarmMigrationSnapshotLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of snapshotting during migration from hot to warm storage
aws_es_hot_to_warm_migration_success_countHotToWarmMigrationSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful migrations from hot to warm storage
aws_es_hot_to_warm_migration_success_latencyHotToWarmMigrationSuccessLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of successful migrations from hot to warm storage
aws_es_indexing_latencyIndexingLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_indexing_rateIndexingRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_invalid_host_header_requestsInvalidHostHeaderRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests with invalid host headers
aws_es_iops_throttleIopsThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks throttling of input/output operations
aws_es_jvmgcold_collection_countJVMGCOldCollectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of garbage collection events in the old generation of JVM
aws_es_jvmgcold_collection_timeJVMGCOldCollectionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time spent in garbage collection in the old generation of JVM
aws_es_jvmgcyoung_collection_countJVMGCYoungCollectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of garbage collection events in the young generation of JVM
aws_es_jvmgcyoung_collection_timeJVMGCYoungCollectionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time spent in garbage collection in the young generation of JVM
aws_es_jvmmemory_pressureJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory pressure on the JVM used by Elasticsearch
aws_es_kmskey_errorKMSKeyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_kmskey_inaccessibleKMSKeyInaccessibleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times a KMS key is inaccessible for the Elasticsearch cluster
aws_es_knncache_capacity_reachedKNNCacheCapacityReachedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks when the KNN cache capacity is reached
aws_es_knncircuit_breaker_triggeredKNNCircuitBreakerTriggeredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks when the KNN circuit breaker is triggered
aws_es_knneviction_countKNNEvictionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of evictions from the KNN cache
aws_es_knngraph_index_errorsKNNGraphIndexErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors during KNN graph indexing
aws_es_knngraph_index_requestsKNNGraphIndexRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN graph index requests
aws_es_knngraph_memory_usageKNNGraphMemoryUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage by the KNN graph
aws_es_knngraph_query_errorsKNNGraphQueryErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors during KNN graph queries
aws_es_knngraph_query_requestsKNNGraphQueryRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN graph query requests
aws_es_knnhit_countKNNHitCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of hits returned by KNN queries
aws_es_knnload_exception_countKNNLoadExceptionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of exceptions during
aws_es_knnload_success_countKNNLoadSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful KNN data load operations
aws_es_knnmiss_countKNNMissCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN cache misses
aws_es_knnquery_requestsKNNQueryRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN queries
aws_es_knnscript_compilation_errorsKNNScriptCompilationErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors during KNN script compilation
aws_es_knnscript_compilationsKNNScriptCompilationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN script compilations
aws_es_knnscript_query_errorsKNNScriptQueryErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors during KNN script queries
aws_es_knnscript_query_requestsKNNScriptQueryRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of KNN script queries
aws_es_knntotal_load_timeKNNTotalLoadTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total load time for KNN operations
aws_es_kibana_concurrent_connectionsKibanaConcurrentConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of concurrent Kibana connections
aws_es_kibana_healthy_nodesKibanaHealthyNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of healthy Kibana nodes
aws_es_kibana_heap_totalKibanaHeapTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total heap size of Kibana
aws_es_kibana_heap_usedKibanaHeapUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the heap size used by Kibana
aws_es_kibana_heap_utilizationKibanaHeapUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the heap utilization of Kibana
aws_es_kibana_os1_minute_loadKibanaOS1MinuteLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the 1-minute load average of the Kibana node’s operating system
aws_es_kibana_reporting_failed_request_sys_err_countKibanaReportingFailedRequestSysErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed Kibana reporting requests due to system errors
aws_es_kibana_reporting_failed_request_user_err_countKibanaReportingFailedRequestUserErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed Kibana reporting requests due to user errors
aws_es_kibana_reporting_request_countKibanaReportingRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of Kibana reporting requests
aws_es_kibana_reporting_success_countKibanaReportingSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful Kibana reporting requests
aws_es_kibana_request_totalKibanaRequestTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of requests sent to Kibana
aws_es_kibana_response_times_max_in_millisKibanaResponseTimesMaxInMillisAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum response time of Kibana requests in milliseconds
aws_es_ltrfeature_memory_usage_in_bytesLTRFeatureMemoryUsageInBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage by LTR features in bytes
aws_es_ltrfeatureset_memory_usage_in_bytesLTRFeaturesetMemoryUsageInBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage by LTR feature sets in bytes
aws_es_ltrmemory_usageLTRMemoryUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks overall memory usage by LTR features
aws_es_ltrmodel_memory_usage_in_bytesLTRModelMemoryUsageInBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage by LTR models in bytes
aws_es_ltrrequest_error_countLTRRequestErrorCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors in LTR requests
aws_es_ltrrequest_total_countLTRRequestTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of LTR requests
aws_es_ltrstatus_redLTRStatus.redAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_leader_check_pointLeaderCheckPointAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the checkpoint of the leader node in cross-cluster replication
aws_es_master_cpucredit_balanceMasterCPUCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_cpuutilizationMasterCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_free_storage_spaceMasterFreeStorageSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_jvmmemory_pressureMasterJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_old_gen_jvmmemory_pressureMasterOldGenJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_reachable_from_nodeMasterReachableFromNodeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_master_sys_memory_utilizationMasterSysMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_max_provisioned_throughputMaxProvisionedThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_nodesNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_old_gen_jvmmemory_pressureOldGenJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_concurrent_connectionsOpenSearchDashboardsConcurrentConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_healthy_nodeOpenSearchDashboardsHealthyNodeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_healthy_nodesOpenSearchDashboardsHealthyNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_heap_totalOpenSearchDashboardsHeapTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_heap_usedOpenSearchDashboardsHeapUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_heap_utilizationOpenSearchDashboardsHeapUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_os1_minute_loadOpenSearchDashboardsOS1MinuteLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_request_totalOpenSearchDashboardsRequestTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_dashboards_response_times_max_in_millisOpenSearchDashboardsResponseTimesMaxInMillisAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_open_search_requestsOpenSearchRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_opensearch_dashboards_reporting_failed_request_sys_err_countOpensearchDashboardsReportingFailedRequestSysErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_opensearch_dashboards_reporting_failed_request_user_err_countOpensearchDashboardsReportingFailedRequestUserErrCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_opensearch_dashboards_reporting_request_countOpensearchDashboardsReportingRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_opensearch_dashboards_reporting_success_countOpensearchDashboardsReportingSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_pplfailed_request_count_by_cus_errPPLFailedRequestCountByCusErrAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of PPL failed requests due to customer errors
aws_es_pplfailed_request_count_by_sys_errPPLFailedRequestCountBySysErrAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of PPL failed requests due to system errors
aws_es_pplrequest_countPPLRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of PPL requests
aws_es_primary_write_rejectedPrimaryWriteRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rejected primary write requests
aws_es_read_iopsReadIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks input/output operations per second for reads
aws_es_read_iopsmicro_burstingReadIOPSMicroBurstingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks micro-bursting of input/output operations for reads
aws_es_read_latencyReadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of read operations in the Elasticsearch cluster
aws_es_read_throughputReadThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the throughput of read operations
aws_es_read_throughput_micro_burstingReadThroughputMicroBurstingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks micro-bursting of read throughput
aws_es_remote_storage_used_spaceRemoteStorageUsedSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of used space in remote storage
aws_es_remote_storage_write_rejectedRemoteStorageWriteRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rejected write operations in remote storage
aws_es_replica_write_rejectedReplicaWriteRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rejected replica write requests
aws_es_replication_num_bootstrapping_indicesReplicationNumBootstrappingIndicesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of indices in the bootstrapping state for replication
aws_es_replication_num_failed_indicesReplicationNumFailedIndicesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed replication indices
aws_es_replication_num_paused_indicesReplicationNumPausedIndicesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of paused replication indices
aws_es_replication_num_syncing_indicesReplicationNumSyncingIndicesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of replication indices currently syncing
aws_es_replication_rateReplicationRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of replication in Elasticsearch
aws_es_sqldefault_cursor_request_countSQLDefaultCursorRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of default SQL cursor requests
aws_es_sqlfailed_request_count_by_cus_errSQLFailedRequestCountByCusErrAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of SQL failed requests due to customer errors
aws_es_sqlfailed_request_count_by_sys_errSQLFailedRequestCountBySysErrAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of SQL failed requests due to system errors
aws_es_sqlrequest_countSQLRequestCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of SQL requests
aws_es_sqlunhealthySQLUnhealthyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks whether the SQL plugin is in an unhealthy state
aws_es_search_latencySearchLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of search operations in the Elasticsearch cluster
aws_es_search_rateSearchRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of search operations
aws_es_search_shard_task_cancelledSearchShardTaskCancelledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of search shard tasks that were canceled
aws_es_search_task_cancelledSearchTaskCancelledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of canceled search tasks
aws_es_searchable_documentsSearchableDocumentsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of searchable documents
aws_es_segment_countSegmentCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of segments in the Elasticsearch cluster
aws_es_shards_activeShards.activeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_shards_active_primaryShards.activePrimaryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_shards_delayed_unassignedShards.delayedUnassignedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_shards_initializingShards.initializingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_shards_relocatingShards.relocatingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_shards_unassignedShards.unassignedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_sys_memory_utilizationSysMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks system memory utilization
aws_es_threadpool_bulk_queueThreadpoolBulkQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the bulk thread pool queue
aws_es_threadpool_bulk_rejectedThreadpoolBulkRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bulk thread pool tasks that were rejected
aws_es_threadpool_bulk_threadsThreadpoolBulkThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the bulk thread pool
aws_es_threadpool_force_merge_queueThreadpoolForce_mergeQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the force merge thread pool queue
aws_es_threadpool_force_merge_rejectedThreadpoolForce_mergeRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of force merge thread pool tasks that were rejected
aws_es_threadpool_force_merge_threadsThreadpoolForce_mergeThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the force merge thread pool
aws_es_threadpool_index_queueThreadpoolIndexQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the index thread pool queue
aws_es_threadpool_index_rejectedThreadpoolIndexRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of index thread pool tasks that were rejected
aws_es_threadpool_index_threadsThreadpoolIndexThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the index thread pool
aws_es_threadpool_search_queueThreadpoolSearchQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the search thread pool queue
aws_es_threadpool_search_rejectedThreadpoolSearchRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of search thread pool tasks that were rejected
aws_es_threadpool_search_threadsThreadpoolSearchThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the search thread pool
aws_es_threadpool_write_queueThreadpoolWriteQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the write thread pool queue
aws_es_threadpool_write_rejectedThreadpoolWriteRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of write thread pool tasks that were rejected
aws_es_threadpool_write_threadsThreadpoolWriteThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the write thread pool
aws_es_threadpoolsql_worker_queueThreadpoolsql-workerQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_threadpoolsql_worker_rejectedThreadpoolsql-workerRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_threadpoolsql_worker_threadsThreadpoolsql-workerThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_throughput_throttleThroughputThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks throttling of throughput in the Elasticsearch cluster
aws_es_total_point_in_timeTotalPointInTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of point-in-time snapshots
aws_es_warm_cpuutilizationWarmCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU utilization of warm data nodes
aws_es_warm_free_storage_spaceWarmFreeStorageSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the available storage space in warm data nodes
aws_es_warm_jvmgcold_collection_countWarmJVMGCOldCollectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of garbage collection events in the old generation of JVM on warm data nodes
aws_es_warm_jvmgcyoung_collection_countWarmJVMGCYoungCollectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of garbage collection events in the young generation of JVM on warm data nodes
aws_es_warm_jvmgcyoung_collection_timeWarmJVMGCYoungCollectionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time spent in garbage collection in the young generation of JVM on warm data nodes
aws_es_warm_jvmmemory_pressureWarmJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory pressure on warm data nodes
aws_es_warm_old_gen_jvmmemory_pressureWarmOldGenJVMMemoryPressureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks old generation JVM memory pressure on warm data nodes
aws_es_warm_search_latencyWarmSearchLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of search operations on warm data nodes
aws_es_warm_search_rateWarmSearchRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of search operations on warm data nodes
aws_es_warm_searchable_documentsWarmSearchableDocumentsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of searchable documents on warm data nodes
aws_es_warm_storage_space_utilizationWarmStorageSpaceUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks storage space utilization on warm data nodes
aws_es_warm_sys_memory_utilizationWarmSysMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks system memory utilization on warm data nodes
aws_es_warm_threadpool_search_queueWarmThreadpoolSearchQueueAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the search thread pool queue on warm data nodes
aws_es_warm_threadpool_search_rejectedWarmThreadpoolSearchRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of search thread pool tasks that were rejected on warm data nodes
aws_es_warm_threadpool_search_threadsWarmThreadpoolSearchThreadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active threads in the search thread pool on warm data nodes
aws_es_warm_to_cold_migration_failure_countWarmToColdMigrationFailureCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failures during migration from warm to cold storage
aws_es_warm_to_cold_migration_latencyWarmToColdMigrationLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of migration from warm to cold storage
aws_es_warm_to_cold_migration_queue_sizeWarmToColdMigrationQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the queue size for migration from warm to cold storage
aws_es_warm_to_cold_migration_success_countWarmToColdMigrationSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful migrations from warm to cold storage
aws_es_warm_to_hot_migration_queue_sizeWarmToHotMigrationQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the queue size for migration from warm to hot storage
aws_es_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_es_write_iopsmicro_burstingWriteIOPSMicroBurstingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks micro-bursting of input/output operations for writes
aws_es_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of write operations in the Elasticsearch cluster
aws_es_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the throughput of write operations
aws_es_write_throughput_micro_burstingWriteThroughputMicroBurstingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks micro-bursting of write throughput

AWS/ElastiCache

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_elasticache_info
aws_elasticache_active_defrag_hitsActiveDefragHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active defragmentation hits in ElastiCache
aws_elasticache_authentication_failuresAuthenticationFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed authentication attempts
aws_elasticache_bytes_read_from_diskBytesReadFromDiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes read from disk in the ElastiCache cluster
aws_elasticache_bytes_read_into_memcachedBytesReadIntoMemcachedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes read into Memcached
aws_elasticache_bytes_used_for_cacheBytesUsedForCacheAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total amount of memory used for cache
aws_elasticache_bytes_used_for_cache_itemsBytesUsedForCacheItemsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory used by items in cache
aws_elasticache_bytes_used_for_hashBytesUsedForHashAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory used for hash tables in the cache
aws_elasticache_bytes_used_for_memory_dbBytesUsedForMemoryDBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors memory usage for MemoryDB in ElastiCache
aws_elasticache_bytes_written_out_from_memcachedBytesWrittenOutFromMemcachedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes written out from Memcached
aws_elasticache_bytes_written_to_diskBytesWrittenToDiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes written to disk in the ElastiCache cluster
aws_elasticache_cpucredit_balanceCPUCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the balance of CPU credits for burstable instance types
aws_elasticache_cpucredit_usageCPUCreditUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors CPU credit usage for burstable instance types
aws_elasticache_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the CPU utilization of the ElastiCache instance
aws_elasticache_cache_hit_rateCacheHitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the cache hit rate, indicating how often requested data is found in cache
aws_elasticache_cache_hitsCacheHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of cache hits
aws_elasticache_cache_missesCacheMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of cache misses, when requested data is not found in cache
aws_elasticache_cas_badvalCasBadvalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of CAS operations that failed due to bad values
aws_elasticache_cas_hitsCasHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful CAS operations
aws_elasticache_cas_missesCasMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of CAS operations that failed due to missing data
aws_elasticache_channel_authorization_failuresChannelAuthorizationFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of channel authorization failures
aws_elasticache_cluster_based_cmdsClusterBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of cluster-based commands executed
aws_elasticache_cluster_based_cmds_latencyClusterBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of cluster-based commands
aws_elasticache_cmd_config_getCmdConfigGetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of configuration GET commands executed
aws_elasticache_cmd_config_setCmdConfigSetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of configuration SET commands executed
aws_elasticache_cmd_flushCmdFlushAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of flush commands executed in the ElastiCache cluster
aws_elasticache_cmd_getCmdGetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of GET commands executed in the cache
aws_elasticache_cmd_setCmdSetAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of SET commands executed in the cache
aws_elasticache_cmd_touchCmdTouchAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of touch commands executed in the cache
aws_elasticache_command_authorization_failuresCommandAuthorizationFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of command authorization failures in the ElastiCache cluster
aws_elasticache_curr_configCurrConfigAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the current configuration state of the ElastiCache instance
aws_elasticache_curr_connectionsCurrConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the current number of open connections to the ElastiCache instance
aws_elasticache_curr_itemsCurrItemsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the current number of items in the cache
aws_elasticache_curr_volatile_itemsCurrVolatileItemsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of volatile items in the cache
aws_elasticache_db0_average_ttlDB0AverageTTLAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average time-to-live (TTL) of items in the cache
aws_elasticache_database_capacity_usage_counted_for_evict_percentageDatabaseCapacityUsageCountedForEvictPercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_database_capacity_usage_percentageDatabaseCapacityUsagePercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the overall percentage of database capacity usage
aws_elasticache_database_memory_usage_counted_for_evict_percentageDatabaseMemoryUsageCountedForEvictPercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_database_memory_usage_percentageDatabaseMemoryUsagePercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the overall memory usage percentage in the ElastiCache cluster
aws_elasticache_decr_hitsDecrHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of successful DECR (decrement) operations
aws_elasticache_decr_missesDecrMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of DECR operations that failed
aws_elasticache_delete_hitsDeleteHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of successful DELETE operations
aws_elasticache_delete_missesDeleteMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of DELETE operations that failed
aws_elasticache_elasti_cache_processing_unitsElastiCacheProcessingUnitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_engine_cpuutilizationEngineCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the CPU utilization of the ElastiCache engine
aws_elasticache_eval_based_cmdsEvalBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of EVAL-based commands executed in the cache
aws_elasticache_eval_based_cmds_ecpusEvalBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_eval_based_cmds_latencyEvalBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of EVAL-based commands in the cache
aws_elasticache_evicted_unfetchedEvictedUnfetchedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of items evicted before being fetched
aws_elasticache_evictionsEvictionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of evictions in the cache
aws_elasticache_expired_unfetchedExpiredUnfetchedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of items that expired before being fetched
aws_elasticache_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free memory available in the ElastiCache cluster
aws_elasticache_geo_spatial_based_cmdsGeoSpatialBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of geospatial commands executed
aws_elasticache_geo_spatial_based_cmds_ecpusGeoSpatialBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_geo_spatial_based_cmds_latencyGeoSpatialBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of geospatial commands
aws_elasticache_get_hitsGetHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful GET operations in the cache
aws_elasticache_get_missesGetMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of GET operations that failed
aws_elasticache_get_type_cmdsGetTypeCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of GET-type commands executed
aws_elasticache_get_type_cmds_ecpusGetTypeCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_get_type_cmds_latencyGetTypeCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of GET-type commands executed
aws_elasticache_global_datastore_replication_lagGlobalDatastoreReplicationLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hash_based_cmdsHashBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hash_based_cmds_ecpusHashBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hash_based_cmds_latencyHashBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hyper_log_log_based_cmdsHyperLogLogBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hyper_log_log_based_cmds_ecpusHyperLogLogBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_hyper_log_log_based_cmds_latencyHyperLogLogBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_iam_authentication_expirationsIamAuthenticationExpirationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_iam_authentication_throttlingIamAuthenticationThrottlingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_incr_hitsIncrHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_incr_missesIncrMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_is_masterIsMasterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_is_primaryIsPrimaryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_cmdsJsonBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_cmds_ecpusJsonBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_cmds_latencyJsonBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_get_cmdsJsonBasedGetCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_get_cmds_ecpusJsonBasedGetCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_set_cmdsJsonBasedSetCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_json_based_set_cmds_ecpusJsonBasedSetCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_key_authorization_failuresKeyAuthorizationFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_key_based_cmdsKeyBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_key_based_cmds_ecpusKeyBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_key_based_cmds_latencyKeyBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_keys_trackedKeysTrackedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_keyspace_hitsKeyspaceHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_keyspace_missesKeyspaceMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_list_based_cmdsListBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_list_based_cmds_ecpusListBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_list_based_cmds_latencyListBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_master_link_health_statusMasterLinkHealthStatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_max_replication_throughputMaxReplicationThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_memory_fragmentation_ratioMemoryFragmentationRatioAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_bandwidth_in_allowance_exceededNetworkBandwidthInAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_bandwidth_out_allowance_exceededNetworkBandwidthOutAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_bytes_inNetworkBytesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_bytes_outNetworkBytesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_conntrack_allowance_exceededNetworkConntrackAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_link_local_allowance_exceededNetworkLinkLocalAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_max_bytes_inNetworkMaxBytesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_max_bytes_out_NetworkMaxBytesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_max_packets_inNetworkMaxPacketsInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_max_packets_outNetworkMaxPacketsOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_packets_inNetworkPacketsInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_packets_outNetworkPacketsOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_network_packets_per_second_allowance_exceededNetworkPacketsPerSecondAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_new_connectionsNewConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_new_itemsNewItemsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_non_key_type_cmdsNonKeyTypeCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_non_key_type_cmds_ecpusNonKeyTypeCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_num_items_read_from_diskNumItemsReadFromDiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_num_items_written_to_diskNumItemsWrittenToDiskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_primary_link_health_statusPrimaryLinkHealthStatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_pub_sub_based_cmdsPubSubBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_pub_sub_based_cmds_ecpusPubSubBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_pub_sub_based_cmds_latencyPubSubBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_reclaimedReclaimedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_replication_bytesReplicationBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_replication_delayed_write_commandsReplicationDelayedWriteCommandsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_replication_lagReplicationLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_save_in_progressSaveInProgressAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_based_cmdsSearchBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_based_get_cmdsSearchBasedGetCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_based_set_cmdsSearchBasedSetCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_number_of_indexed_keysSearchNumberOfIndexedKeysAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_number_of_indexesSearchNumberOfIndexesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_search_total_index_sizeSearchTotalIndexSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_based_cmdsSetBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_based_cmds_ecpusSetBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_based_cmds_latencySetBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_type_cmdsSetTypeCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_type_cmds_ecpusSetTypeCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_set_type_cmds_latencySetTypeCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_slabs_movedSlabsMovedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_sorted_set_based_cmdsSortedSetBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_sorted_set_based_cmds_ecpusSortedSetBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_sorted_set_based_cmds_latencySortedSetBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_stream_based_cmdsStreamBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_stream_based_cmds_ecpusStreamBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_stream_based_cmds_latencyStreamBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_string_based_cmdsStringBasedCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_string_based_cmds_ecpusStringBasedCmdsECPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_string_based_cmds_latencyStringBasedCmdsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_successful_read_request_latencySuccessfulReadRequestLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_successful_write_request_latencySuccessfulWriteRequestLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_swap_usageSwapUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_throttled_cmdsThrottledCmdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_total_cmds_countTotalCmdsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_touch_hitsTouchHitsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_touch_missesTouchMissesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_traffic_management_activeTrafficManagementActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticache_unused_memoryUnusedMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/ElasticBeanstalk

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_elasticbeanstalk_info
aws_elasticbeanstalk_application_latency_p10ApplicationLatencyP10Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the 10th percentile application latency for Elastic Beanstalk
aws_elasticbeanstalk_application_latency_p50ApplicationLatencyP50Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the median (50th percentile) application latency
aws_elasticbeanstalk_application_latency_p75ApplicationLatencyP75Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the 75th percentile latency of requests in Elastic Beanstalk
aws_elasticbeanstalk_application_latency_p85ApplicationLatencyP85Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the 85th percentile latency for Elastic Beanstalk applications
aws_elasticbeanstalk_application_latency_p90ApplicationLatencyP90Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the 90th percentile application latency
aws_elasticbeanstalk_application_latency_p95ApplicationLatencyP95Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the 95th percentile latency for Elastic Beanstalk applications
aws_elasticbeanstalk_application_latency_p99ApplicationLatencyP99Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the 99th percentile application latency
aws_elasticbeanstalk_application_latency_p99_9ApplicationLatencyP99.9Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticbeanstalk_application_requests2xxApplicationRequests2xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful application requests with 2xx status codes
aws_elasticbeanstalk_application_requests3xxApplicationRequests3xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of application requests with 3xx (redirection) status codes
aws_elasticbeanstalk_application_requests4xxApplicationRequests4xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of client error requests with 4xx status codes
aws_elasticbeanstalk_application_requests5xxApplicationRequests5xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of server error requests with 5xx status codes
aws_elasticbeanstalk_application_requests_totalApplicationRequestsTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of application requests received
aws_elasticbeanstalk_cpuidleCPUIdleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the idle CPU time of instances within Elastic Beanstalk
aws_elasticbeanstalk_cpuiowaitCPUIowaitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU time spent waiting for I/O operations to complete
aws_elasticbeanstalk_cpuirqCPUIrqAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time spent on interrupt requests (IRQ) on the CPU
aws_elasticbeanstalk_cpuniceCPUNiceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU time spent on user processes that have been “niced”
aws_elasticbeanstalk_cpusoftirqCPUSoftirqAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors CPU time used for soft interrupt requests
aws_elasticbeanstalk_cpusystemCPUSystemAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of CPU time spent executing system-level tasks
aws_elasticbeanstalk_cpuuserCPUUserAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of CPU time spent executing user processes
aws_elasticbeanstalk_environment_healthEnvironmentHealthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the overall health status of the Elastic Beanstalk environment
aws_elasticbeanstalk_instance_healthInstanceHealthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the health status of individual instances in Elastic Beanstalk
aws_elasticbeanstalk_instances_degradedInstancesDegradedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instances with degraded health
aws_elasticbeanstalk_instances_infoInstancesInfoAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticbeanstalk_instances_no_dataInstancesNoDataAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances reporting no data
aws_elasticbeanstalk_instances_okInstancesOkAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of healthy instances in the environment
aws_elasticbeanstalk_instances_pendingInstancesPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of instances in a pending state
aws_elasticbeanstalk_instances_severeInstancesSevereAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances with severe health problems
aws_elasticbeanstalk_instances_unknownInstancesUnknownAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of instances with unknown health status
aws_elasticbeanstalk_instances_warningInstancesWarningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of instances in warning status
aws_elasticbeanstalk_load_average1minLoadAverage1minAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the system load average over the last 1 minute
aws_elasticbeanstalk_load_average5minLoadAverage5minAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the system load average over the last 5 minutes
aws_elasticbeanstalk_root_filesystem_utilRootFilesystemUtilAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the usage of the root file system

AWS/ElasticMapReduce

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_elasticmapreduce_info
aws_elasticmapreduce_apps_completedAppsCompletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of applications that have successfully completed
aws_elasticmapreduce_apps_failedAppsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of applications that have failed
aws_elasticmapreduce_apps_killedAppsKilledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of applications that were terminated or killed
aws_elasticmapreduce_apps_pendingAppsPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of applications that are in the pending state
aws_elasticmapreduce_apps_runningAppsRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of applications currently running
aws_elasticmapreduce_apps_submittedAppsSubmittedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of applications that have been submitted
aws_elasticmapreduce_backup_failedBackupFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of backup attempts that failed
aws_elasticmapreduce_capacity_remaining_gbCapacityRemainingGBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the remaining storage capacity in gigabytes within the cluster
aws_elasticmapreduce_cluster_statusCluster StatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the overall status of the Elastic MapReduce cluster
aws_elasticmapreduce_container_allocatedContainerAllocatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of containers allocated for running tasks
aws_elasticmapreduce_container_pendingContainerPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of containers pending allocation
aws_elasticmapreduce_container_pending_ratioContainerPendingRatioAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the ratio of pending containers to total containers
aws_elasticmapreduce_container_reservedContainerReservedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of containers reserved for future tasks
aws_elasticmapreduce_core_nodes_pendingCoreNodesPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of core nodes that are pending
aws_elasticmapreduce_core_nodes_runningCoreNodesRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of core nodes that are currently running
aws_elasticmapreduce_corrupt_blocksCorruptBlocksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of blocks that are identified as corrupt
aws_elasticmapreduce_dfs_pending_replication_blocksDfsPendingReplicationBlocksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of HDFS blocks that are pending replication
aws_elasticmapreduce_hbaseHBaseAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the health and activity of the HBase database in the cluster
aws_elasticmapreduce_hdfsbytes_readHDFSBytesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes read from HDFS in the cluster
aws_elasticmapreduce_hdfsbytes_writtenHDFSBytesWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes written to HDFS
aws_elasticmapreduce_hdfsutilizationHDFSUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the utilization of HDFS in the cluster
aws_elasticmapreduce_hbase_backup_failedHbaseBackupFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed backups for HBase in the cluster
aws_elasticmapreduce_ioIOAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors input/output (I/O) operations in the cluster
aws_elasticmapreduce_is_idleIsIdleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks if the cluster or a node is currently idle
aws_elasticmapreduce_jobs_failedJobsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of failed jobs in the cluster
aws_elasticmapreduce_jobs_runningJobsRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of currently running jobs
aws_elasticmapreduce_live_data_nodesLiveDataNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of live data nodes in the cluster
aws_elasticmapreduce_live_task_trackersLiveTaskTrackersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of live task trackers
aws_elasticmapreduce_mractive_nodesMRActiveNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of active MapReduce nodes in the cluster
aws_elasticmapreduce_mrdecommissioned_nodesMRDecommissionedNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of decommissioned MapReduce nodes
aws_elasticmapreduce_mrlost_nodesMRLostNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of lost MapReduce nodes in the cluster
aws_elasticmapreduce_mrrebooted_nodesMRRebootedNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of rebooted MapReduce nodes
aws_elasticmapreduce_mrtotal_nodesMRTotalNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of MapReduce nodes
aws_elasticmapreduce_mrunhealthy_nodesMRUnhealthyNodesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of unhealthy MapReduce nodes
aws_elasticmapreduce_map_reduceMap/ReduceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_elasticmapreduce_map_slots_openMapSlotsOpenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of open Map slots in the cluster
aws_elasticmapreduce_map_tasks_remainingMapTasksRemainingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of remaining Map tasks
aws_elasticmapreduce_map_tasks_runningMapTasksRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of Map tasks currently running
aws_elasticmapreduce_memory_allocated_mbMemoryAllocatedMBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory allocated in MB in the cluster
aws_elasticmapreduce_memory_available_mbMemoryAvailableMBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the available memory in MB in the cluster
aws_elasticmapreduce_memory_reserved_mbMemoryReservedMBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the memory reserved for future tasks in MB
aws_elasticmapreduce_memory_total_mbMemoryTotalMBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total memory available in MB in the cluster
aws_elasticmapreduce_missing_blocksMissingBlocksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of missing HDFS blocks in the cluster
aws_elasticmapreduce_most_recent_backup_durationMostRecentBackupDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of the most recent backup
aws_elasticmapreduce_node_statusNode StatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the overall status of the nodes in the cluster
aws_elasticmapreduce_pending_deletion_blocksPendingDeletionBlocksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of HDFS blocks pending deletion
aws_elasticmapreduce_reduce_slots_openReduceSlotsOpenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of open Reduce slots in the cluster
aws_elasticmapreduce_reduce_tasks_remainingReduceTasksRemainingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of remaining Reduce tasks
aws_elasticmapreduce_reduce_tasks_runningReduceTasksRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of currently running Reduce tasks
aws_elasticmapreduce_remaining_map_tasks_per_slotRemainingMapTasksPerSlotAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the remaining Map tasks per slot
aws_elasticmapreduce_s3_bytes_readS3BytesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes read from S3 during the cluster operation
aws_elasticmapreduce_s3_bytes_writtenS3BytesWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes written to S3 during the cluster operation
aws_elasticmapreduce_task_nodes_pendingTaskNodesPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of task nodes that are pending allocation
aws_elasticmapreduce_task_nodes_runningTaskNodesRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of running task nodes in the cluster
aws_elasticmapreduce_time_since_last_successful_backupTimeSinceLastSuccessfulBackupAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time elapsed since the last successful backup
aws_elasticmapreduce_total_loadTotalLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total computational load on the cluster
aws_elasticmapreduce_under_replicated_blocksUnderReplicatedBlocksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of under-replicated HDFS blocks in the cluster
aws_elasticmapreduce_yarnmemory_available_percentageYARNMemoryAvailablePercentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of available YARN memory in the cluster

AWS/Events

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_events_info
aws_events_dead_letter_invocationsDeadLetterInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times a message is sent to the dead letter queue
aws_events_EventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_events_failed_invocationsFailedInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of invocation failures
aws_events_ingestionto_invocation_complete_latencyIngestiontoInvocationCompleteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency from event ingestion to invocation completion
aws_events_ingestionto_invocation_start_latencyIngestiontoInvocationStartLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency from event ingestion to invocation start
aws_events_invocation_attemptsInvocationAttemptsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of invocation attempts
aws_events_invocationsInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of invocations
aws_events_invocations_createdInvocationsCreatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of invocations created
aws_events_invocations_failed_to_be_sent_to_dlqInvocationsFailedToBeSentToDlqAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of invocations that failed to be sent to the dead letter queue
aws_events_invocations_sent_to_dlqInvocationsSentToDlqAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of invocations successfully sent to the dead letter queue
aws_events_matched_eventsMatchedEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of events that matched event rules
aws_events_put_events_approximate_call_countPutEventsApproximateCallCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the approximate number of PutEvents API call requests
aws_events_put_events_approximate_failed_countPutEventsApproximateFailedCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate number of PutEvents API call failures
aws_events_put_events_approximate_success_countPutEventsApproximateSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the approximate number of successful PutEvents API call requests
aws_events_put_events_approximate_throttled_countPutEventsApproximateThrottledCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate number of throttled PutEvents API call requests
aws_events_put_events_entries_countPutEventsEntriesCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of event entries in PutEvents requests
aws_events_put_events_failed_entries_countPutEventsFailedEntriesCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed event entries in PutEvents requests
aws_events_put_events_latencyPutEventsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency of PutEvents API requests
aws_events_put_events_request_sizePutEventsRequestSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the size of PutEvents API requests
aws_events_put_partner_events_approximate_call_countPutPartnerEventsApproximateCallCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the approximate number of PutPartnerEvents API call requests
aws_events_put_partner_events_approximate_failed_countPutPartnerEventsApproximateFailedCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate number of failed PutPartnerEvents API call requests
aws_events_put_partner_events_approximate_success_countPutPartnerEventsApproximateSuccessCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the approximate number of successful PutPartnerEvents API call requests
aws_events_put_partner_events_approximate_throttled_countPutPartnerEventsApproximateThrottledCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate number of throttled PutPartnerEvents API call requests
aws_events_put_partner_events_entries_countPutPartnerEventsEntriesCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of event entries in PutPartnerEvents requests
aws_events_put_partner_events_failed_entries_countPutPartnerEventsFailedEntriesCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed event entries in PutPartnerEvents requests
aws_events_put_partner_events_latencyPutPartnerEventsLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of PutPartnerEvents API requests
aws_events_retry_invocation_attemptsRetryInvocationAttemptsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of retry invocation attempts
aws_events_successful_invocation_attemptsSuccessfulInvocationAttemptsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful invocation attempts
aws_events_throttled_rulesThrottledRulesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of rules that were throttled
aws_events_triggered_rulesTriggeredRulesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of event rules that were triggered

AWS/FSx

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_fsx_info
aws_fsx_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of CPU utilization on the FSx file system
aws_fsx_client_connectionsClientConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active client connections to the FSx file system
aws_fsx_data_read_bytesDataReadBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total bytes read from the file system
aws_fsx_data_read_operationsDataReadOperationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of data read operations
aws_fsx_data_write_bytesDataWriteBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total bytes written to the file system
aws_fsx_data_write_operationsDataWriteOperationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of data write operations
aws_fsx_deduplication_saved_storageDeduplicationSavedStorageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of storage saved through data deduplication
aws_fsx_disk_iops_utilizationDiskIopsUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of disk IOPS (Input/Output Operations Per Second) utilization
aws_fsx_disk_read_bytesDiskReadBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total bytes read from the disk
aws_fsx_disk_read_operationsDiskReadOperationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of disk read operations
aws_fsx_disk_throughput_balanceDiskThroughputBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the balance of disk throughput usage
aws_fsx_disk_throughput_utilizationDiskThroughputUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of disk throughput utilization
aws_fsx_disk_write_bytesDiskWriteBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total bytes written to the disk
aws_fsx_disk_write_operationsDiskWriteOperationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of disk write operations
aws_fsx_file_server_disk_iops_balanceFileServerDiskIopsBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the balance of IOPS utilization on the file server
aws_fsx_file_server_disk_iops_utilizationFileServerDiskIopsUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of IOPS utilization on the file server
aws_fsx_file_server_disk_throughput_balanceFileServerDiskThroughputBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the balance of disk throughput on the file server
aws_fsx_file_server_disk_throughput_utilizationFileServerDiskThroughputUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of disk throughput utilization on the file server
aws_fsx_free_data_storage_capacityFreeDataStorageCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free data storage capacity available
aws_fsx_free_storage_capacityFreeStorageCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total amount of free storage capacity available
aws_fsx_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of memory utilization on the file system
aws_fsx_metadata_operationsMetadataOperationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of metadata operations (like file system metadata lookups)
aws_fsx_network_throughput_utilizationNetworkThroughputUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of network throughput utilization
aws_fsx_storage_capacity_utilizationStorageCapacityUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of storage capacity utilization

AWS/Firehose

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_firehose_info
aws_firehose_active_partitions_limitActivePartitionsLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the limit of active partitions
aws_firehose_backup_to_s3_bytesBackupToS3.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_backup_to_s3_data_freshnessBackupToS3.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_backup_to_s3_recordsBackupToS3.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_backup_to_s3_successBackupToS3.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_bytes_per_second_limitBytesPerSecondLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the bytes per second limit for data delivery
aws_firehose_data_read_from_kinesis_stream_bytesDataReadFromKinesisStream.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_data_read_from_kinesis_stream_recordsDataReadFromKinesisStream.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_data_read_from_source_backpressuredDataReadFromSource.BackpressuredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_data_read_from_source_bytesDataReadFromSource.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_data_read_from_source_recordsDataReadFromSource.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_auth_failureDeliveryToAmazonOpenSearchServerless.AuthFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_bytesDeliveryToAmazonOpenSearchServerless.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_data_freshnessDeliveryToAmazonOpenSearchServerless.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_delivery_rejectedDeliveryToAmazonOpenSearchServerless.DeliveryRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_recordsDeliveryToAmazonOpenSearchServerless.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_serverless_successDeliveryToAmazonOpenSearchServerless.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_auth_failureDeliveryToAmazonOpenSearchService.AuthFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_bytesDeliveryToAmazonOpenSearchService.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_data_freshnessDeliveryToAmazonOpenSearchService.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_delivery_rejectedDeliveryToAmazonOpenSearchService.DeliveryRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_recordsDeliveryToAmazonOpenSearchService.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_amazon_open_search_service_successDeliveryToAmazonOpenSearchService.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_elasticsearch_bytesDeliveryToElasticsearch.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_elasticsearch_recordsDeliveryToElasticsearch.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_elasticsearch_successDeliveryToElasticsearch.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_bytesDeliveryToHttpEndpoint.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_data_freshnessDeliveryToHttpEndpoint.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_processed_bytesDeliveryToHttpEndpoint.ProcessedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_processed_recordsDeliveryToHttpEndpoint.ProcessedRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_recordsDeliveryToHttpEndpoint.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_http_endpoint_successDeliveryToHttpEndpoint.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_redshift_bytesDeliveryToRedshift.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_redshift_recordsDeliveryToRedshift.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_redshift_successDeliveryToRedshift.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_s3_bytesDeliveryToS3.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_s3_data_freshnessDeliveryToS3.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_s3_object_countDeliveryToS3.ObjectCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_s3_recordsDeliveryToS3.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_s3_successDeliveryToS3.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_snowflake_bytesDeliveryToSnowflake.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_snowflake_data_commit_latencyDeliveryToSnowflake.DataCommitLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_snowflake_data_freshnessDeliveryToSnowflake.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_snowflake_recordsDeliveryToSnowflake.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_snowflake_successDeliveryToSnowflake.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_splunk_bytesDeliveryToSplunk.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_splunk_data_ack_latencyDeliveryToSplunk.DataAckLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_splunk_data_freshnessDeliveryToSplunk.DataFreshnessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_splunk_recordsDeliveryToSplunk.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_delivery_to_splunk_successDeliveryToSplunk.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_describe_delivery_stream_latencyDescribeDeliveryStream.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_describe_delivery_stream_requestsDescribeDeliveryStream.RequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_execute_processing_durationExecuteProcessing.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_execute_processing_successExecuteProcessing.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_failed_conversion_bytesFailedConversion.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_failed_conversion_recordsFailedConversion.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_failed_validation_bytesFailedValidation.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_failed_validation_recordsFailedValidation.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_incoming_bytesIncomingBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of incoming data in bytes
aws_firehose_incoming_put_requestsIncomingPutRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of incoming put requests
aws_firehose_incoming_recordsIncomingRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of incoming records
aws_firehose_jqprocessing_durationJQProcessing.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_kmskey_access_deniedKMSKeyAccessDeniedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances where access to the KMS (Key Management Service) key is denied
aws_firehose_kmskey_disabledKMSKeyDisabledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the instances where the KMS key is disabled
aws_firehose_kmskey_invalid_stateKMSKeyInvalidStateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the instances where the KMS key is in an invalid state
aws_firehose_kmskey_not_foundKMSKeyNotFoundAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the instances where the KMS key is not found
aws_firehose_kafka_offset_lagKafkaOffsetLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the lag in Kafka offset
aws_firehose_kinesis_millis_behind_latestKinesisMillisBehindLatestAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time lag (in milliseconds) behind the latest record in Kinesis
aws_firehose_list_delivery_streams_latencyListDeliveryStreams.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_list_delivery_streams_requestsListDeliveryStreams.RequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_output_decompressed_bytes_failedOutputDecompressedBytes.FailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_output_decompressed_bytes_successOutputDecompressedBytes.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_output_decompressed_records_failedOutputDecompressedRecords.FailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_output_decompressed_records_successOutputDecompressedRecords.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_partition_countPartitionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the count of partitions during data delivery
aws_firehose_partition_count_exceededPartitionCountExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances where partition count exceeds limits
aws_firehose_per_partition_throughputPerPartitionThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the throughput per partition during data delivery
aws_firehose_put_record_bytesPutRecord.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_latencyPutRecord.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_requestsPutRecord.RequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_batch_bytesPutRecordBatch.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_batch_latencyPutRecordBatch.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_batch_recordsPutRecordBatch.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_record_batch_requestsPutRecordBatch.RequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_put_requests_per_second_limitPutRequestsPerSecondLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the limit on PutRecord requests per second
aws_firehose_records_per_second_limitRecordsPerSecondLimitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the limit on records processed per second
aws_firehose_resource_countResourceCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the count of resources in the data delivery stream
aws_firehose_source_throttled_delaySourceThrottled.DelayAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_succeed_conversion_bytesSucceedConversion.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_succeed_conversion_recordsSucceedConversion.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_succeed_processing_bytesSucceedProcessing.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_succeed_processing_recordsSucceedProcessing.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_throttled_describe_streamThrottledDescribeStreamAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors instances of throttled DescribeStream API calls
aws_firehose_throttled_get_recordsThrottledGetRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures instances of throttled GetRecords API calls
aws_firehose_throttled_get_shard_iteratorThrottledGetShardIteratorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks instances of throttled GetShardIterator API calls
aws_firehose_throttled_recordsThrottledRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures instances where records are throttled
aws_firehose_update_delivery_stream_latencyUpdateDeliveryStream.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_firehose_update_delivery_stream_requestsUpdateDeliveryStream.RequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/GameLift

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_gamelift_info
aws_gamelift_activating_game_sessionsActivatingGameSessionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of game sessions currently being activated
aws_gamelift_active_game_sessionsActiveGameSessionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active game sessions
aws_gamelift_active_instancesActiveInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active GameLift instances
aws_gamelift_active_server_processesActiveServerProcessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active server processes
aws_gamelift_available_game_serversAvailableGameServersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of available game servers
aws_gamelift_available_game_sessionsAvailableGameSessionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of available game sessions
aws_gamelift_average_wait_timeAverageWaitTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the average wait time for players
aws_gamelift_current_player_sessionsCurrentPlayerSessionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of current active player sessions
aws_gamelift_current_ticketsCurrentTicketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of current active matchmaking tickets
aws_gamelift_desired_instancesDesiredInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of desired instances for the fleet
aws_gamelift_draining_available_game_serversDrainingAvailableGameServersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of available game servers that are draining
aws_gamelift_draining_utilized_game_serversDrainingUtilizedGameServersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of utilized game servers that are draining
aws_gamelift_first_choice_not_viableFirstChoiceNotViableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of times the first placement choice was not viable
aws_gamelift_first_choice_out_of_capacityFirstChoiceOutOfCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times the first placement choice ran out of capacity
aws_gamelift_game_session_interruptionsGameSessionInterruptionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of game session interruptions
aws_gamelift_healthy_server_processesHealthyServerProcessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of healthy server processes
aws_gamelift_idle_instancesIdleInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of idle instances in the fleet
aws_gamelift_instance_interruptionsInstanceInterruptionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of GameLift instance interruptions
aws_gamelift_lowest_latency_placementLowestLatencyPlacementAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors placements based on the lowest latency
aws_gamelift_lowest_price_placementLowestPricePlacementAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks placements based on the lowest price
aws_gamelift_match_acceptances_timed_outMatchAcceptancesTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of match acceptance timeouts
aws_gamelift_matches_acceptedMatchesAcceptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of matches that have been accepted
aws_gamelift_matches_createdMatchesCreatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of matches that have been created
aws_gamelift_matches_placedMatchesPlacedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of matches successfully placed
aws_gamelift_matches_rejectedMatchesRejectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of rejected matches
aws_gamelift_max_instancesMaxInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of instances
aws_gamelift_min_instancesMinInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the minimum number of instances
aws_gamelift_percent_available_game_sessionsPercentAvailableGameSessionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of available game sessions
aws_gamelift_percent_healthy_server_processesPercentHealthyServerProcessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of healthy server processes
aws_gamelift_percent_idle_instancesPercentIdleInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of idle instances
aws_gamelift_placementPlacementAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the match placement process
aws_gamelift_placements_canceledPlacementsCanceledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of canceled placements
aws_gamelift_placements_failedPlacementsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed placements
aws_gamelift_placements_startedPlacementsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of placement processes started
aws_gamelift_placements_succeededPlacementsSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of successful placements
aws_gamelift_placements_timed_outPlacementsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of timed-out placements
aws_gamelift_player_session_activationsPlayerSessionActivationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of activated player sessions
aws_gamelift_players_startedPlayersStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of players who have started their sessions
aws_gamelift_queue_depthQueueDepthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the depth of the matchmaking queue
aws_gamelift_rule_evaluations_failedRuleEvaluationsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed rule evaluations during matchmaking
aws_gamelift_rule_evaluations_passedRuleEvaluationsPassedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of passed rule evaluations during matchmaking
aws_gamelift_server_process_abnormal_terminationsServerProcessAbnormalTerminationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of abnormal terminations of server processes
aws_gamelift_server_process_activationsServerProcessActivationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of server process activations
aws_gamelift_server_process_terminationsServerProcessTerminationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of server process terminations
aws_gamelift_tickets_failedTicketsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed matchmaking tickets
aws_gamelift_tickets_startedTicketsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of matchmaking tickets that have started
aws_gamelift_tickets_timed_outTicketsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of matchmaking tickets that have timed out
aws_gamelift_time_to_matchTimeToMatchAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the average time taken to find a match
aws_gamelift_time_to_ticket_successTimeToTicketSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the time taken to successfully complete a matchmaking ticket
aws_gamelift_utilized_game_serversUtilizedGameServersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of utilized game servers

AWS/GlobalAccelerator

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_globalaccelerator_info
aws_globalaccelerator_healthy_endpoint_countHealthyEndpointCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of healthy endpoints in the accelerator
aws_globalaccelerator_new_flow_countNewFlowCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of new network flows being processed
aws_globalaccelerator_processed_bytes_inProcessedBytesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the volume of incoming traffic processed by the accelerator
aws_globalaccelerator_processed_bytes_outProcessedBytesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the volume of outgoing traffic processed by the accelerator
aws_globalaccelerator_unhealthy_endpoint_countUnhealthyEndpointCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Glue

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_glue_info
aws_glue_all_disk_available_gbglue.ALL.disk.available_GBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_disk_used_percentageglue.ALL.disk.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_disk_used_gbglue.ALL.disk.used_GBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_jvm_heap_usageglue.ALL.jvm.heap.usageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_jvm_heap_usedglue.ALL.jvm.heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_heap_availableglue.ALL.memory.heap.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_heap_usedglue.ALL.memory.heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_heap_used_percentageglue.ALL.memory.heap.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_non_heap_availableglue.ALL.memory.non-heap.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_non_heap_percentageglue.ALL.memory.non-heap.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_non_heap_usedglue.ALL.memory.non-heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_total_availableglue.ALL.memory.total.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_total_usedglue.ALL.memory.total.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_memory_total_used_percentageglue.ALL.memory.total.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_s3_filesystem_read_bytesglue.ALL.s3.filesystem.read_bytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_s3_filesystem_write_bytesglue.ALL.s3.filesystem.write_bytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_all_system_cpu_system_loadglue.ALL.system.cpuSystemLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_block_manager_disk_disk_space_used_mbglue.driver.BlockManager.disk.diskSpaceUsed_MBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_executor_allocation_manager_executors_number_all_executorsglue.driver.ExecutorAllocationManager.executors.numberAllExecutorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_executor_allocation_manager_executors_number_max_needed_executorsglue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_bytes_readglue.driver.aggregate.bytesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_elapsed_timeglue.driver.aggregate.elapsedTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_num_completed_stagesglue.driver.aggregate.numCompletedStagesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_num_completed_tasksglue.driver.aggregate.numCompletedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_num_failed_tasksglue.driver.aggregate.numFailedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_num_killed_tasksglue.driver.aggregate.numKilledTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_records_readglue.driver.aggregate.recordsReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_shuffle_bytes_writtenglue.driver.aggregate.shuffleBytesWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_aggregate_shuffle_local_bytes_readglue.driver.aggregate.shuffleLocalBytesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_bytes_readglue.driver.bytesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_bytes_writtenglue.driver.bytesWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_disk_available_gbglue.driver.disk.available_GBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_disk_used_percentageglue.driver.disk.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_disk_used_gbglue.driver.disk.used_GBAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_files_readglue.driver.filesReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_files_writtenglue.driver.filesWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_jvm_heap_usageglue.driver.jvm.heap.usageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_jvm_heap_usedglue.driver.jvm.heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_heap_availableglue.driver.memory.heap.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_heap_usedglue.driver.memory.heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_heap_used_percentageglue.driver.memory.heap.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_non_heap_availableglue.driver.memory.non-heap.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_non_heap_percentageglue.driver.memory.non-heap.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_non_heap_usedglue.driver.memory.non-heap.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_total_availableglue.driver.memory.total.availableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_total_usedglue.driver.memory.total.usedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_memory_total_used_percentageglue.driver.memory.total.used.percentageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_partitions_readglue.driver.partitionsReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_records_readglue.driver.recordsReadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_records_writtenglue.driver.recordsWrittenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_s3_filesystem_read_bytesglue.driver.s3.filesystem.read_bytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_s3_filesystem_write_bytesglue.driver.s3.filesystem.write_bytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_skewness_jobglue.driver.skewness.jobAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_skewness_stageglue.driver.skewness.stageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_streaming_batch_processing_time_in_msglue.driver.streaming.batchProcessingTimeInMsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_streaming_num_recordsglue.driver.streaming.numRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_system_cpu_system_loadglue.driver.system.cpuSystemLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_driver_worker_utilizationglue.driver.workerUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_error_allglue.error.ALLAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_glue_succeed_allglue.succeed.ALLAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/IoT

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_iot_info
aws_iot_canceled_job_execution_countCanceledJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of canceled job executions
aws_iot_canceled_job_execution_total_countCanceledJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total count of canceled job executions
aws_iot_client_errorClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the client error count
aws_iot_connect_auth_errorConnect.AuthErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_connect_client_errorConnect.ClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_connect_server_errorConnect.ServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_connect_successConnect.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_connect_throttleConnect.ThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_delete_thing_shadow_acceptedDeleteThingShadow.AcceptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_failed_job_execution_countFailedJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of failed job executions
aws_iot_failed_job_execution_total_countFailedJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total count of failed job executions
aws_iot_failureFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks overall failure events
aws_iot_get_thing_shadow_acceptedGetThingShadow.AcceptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_in_progress_job_execution_countInProgressJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of in-progress job executions
aws_iot_in_progress_job_execution_total_countInProgressJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total count of in-progress job executions
aws_iot_non_compliant_resourcesNonCompliantResourcesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of non-compliant resources
aws_iot_num_log_batches_failed_to_publish_throttledNumLogBatchesFailedToPublishThrottledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors log batches that failed to publish due to throttling
aws_iot_num_log_events_failed_to_publish_throttledNumLogEventsFailedToPublishThrottledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures log events that failed to publish due to throttling
aws_iot_parse_errorParseErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of message parse errors
aws_iot_ping_successPing.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_in_auth_errorPublishIn.AuthErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_in_client_errorPublishIn.ClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_in_server_errorPublishIn.ServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_in_successPublishIn.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_in_throttlePublishIn.ThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_out_auth_errorPublishOut.AuthErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_out_client_errorPublishOut.ClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_publish_out_successPublishOut.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_queued_job_execution_countQueuedJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of job executions in the queue
aws_iot_queued_job_execution_total_countQueuedJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total count of queued job executions
aws_iot_rejected_job_execution_countRejectedJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of rejected job executions
aws_iot_rejected_job_execution_total_countRejectedJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total count of rejected job executions
aws_iot_removed_job_execution_countRemovedJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of removed job executions
aws_iot_removed_job_execution_total_countRemovedJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total count of removed job executions
aws_iot_resources_evaluatedResourcesEvaluatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of resources evaluated
aws_iot_rule_message_throttledRuleMessageThrottledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rule messages throttled
aws_iot_rule_not_foundRuleNotFoundAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures instances where rules were not found
aws_iot_rules_executedRulesExecutedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of executed rules
aws_iot_server_errorServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors server-side errors
aws_iot_subscribe_auth_errorSubscribe.AuthErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_subscribe_client_errorSubscribe.ClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_subscribe_server_errorSubscribe.ServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_subscribe_successSubscribe.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_subscribe_throttleSubscribe.ThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_succeded_job_execution_countSuccededJobExecutionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_succeded_job_execution_total_countSuccededJobExecutionTotalCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_successSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks overall successful operations
aws_iot_topic_matchTopicMatchAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of successful topic matches
aws_iot_unsubscribe_client_errorUnsubscribe.ClientErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_unsubscribe_server_errorUnsubscribe.ServerErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_unsubscribe_successUnsubscribe.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_unsubscribe_throttleUnsubscribe.ThrottleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_update_thing_shadow_acceptedUpdateThingShadow.AcceptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_iot_violationsViolationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks policy violations
aws_iot_violations_clearedViolationsClearedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures cleared violations
aws_iot_violations_invalidatedViolationsInvalidatedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks invalidated violations

AWS/Kafka

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_kafka_info
aws_kafka_active_controller_countActiveControllerCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates how many active controllers are in the Kafka cluster
aws_kafka_burst_balanceBurstBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the burst balance remaining for the Kafka broker instances
aws_kafka_bw_in_allowance_exceededBwInAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the instances where incoming bandwidth allowance has been exceeded
aws_kafka_bw_out_allowance_exceededBwOutAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the instances where outgoing bandwidth allowance has been exceeded
aws_kafka_bytes_in_per_secBytesInPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of incoming bytes per second into the Kafka cluster
aws_kafka_bytes_out_per_secBytesOutPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of outgoing bytes per second from the Kafka cluster
aws_kafka_cpucredit_balanceCPUCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the remaining CPU credits for instances running in burstable performance mode
aws_kafka_client_connection_countClientConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the total number of client connections to the Kafka brokers
aws_kafka_conn_track_allowance_exceededConnTrackAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks instances where the connection tracking allowance is exceeded
aws_kafka_connection_close_rateConnectionCloseRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate at which connections are being closed
aws_kafka_connection_countConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Displays the number of open connections to the Kafka brokers
aws_kafka_connection_creation_rateConnectionCreationRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of new connections being created to the Kafka brokers
aws_kafka_cpu_credit_usageCpuCreditUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the CPU credits consumed by the Kafka instances running in burstable mode
aws_kafka_cpu_idleCpuIdleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the percentage of idle CPU resources on Kafka instances
aws_kafka_cpu_io_waitCpuIoWaitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time instances spend waiting for I/O operations to complete
aws_kafka_cpu_systemCpuSystemAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks CPU usage by the system processes on Kafka instances
aws_kafka_cpu_userCpuUserAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows CPU usage by user processes on Kafka instances
aws_kafka_estimated_max_time_lagEstimatedMaxTimeLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum estimated time lag in replication
aws_kafka_estimated_time_lagEstimatedTimeLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the estimated time lag in replication between Kafka brokers
aws_kafka_fetch_consumer_local_time_ms_meanFetchConsumerLocalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average time it takes to fetch messages locally by the consumer
aws_kafka_fetch_consumer_request_queue_time_ms_meanFetchConsumerRequestQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the average time messages spend in the consumer request queue
aws_kafka_fetch_consumer_response_queue_time_ms_meanFetchConsumerResponseQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the average time it takes for a consumer to queue a response
aws_kafka_fetch_consumer_response_send_time_ms_meanFetchConsumerResponseSendTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average time taken to send a consumer response
aws_kafka_fetch_consumer_total_time_ms_meanFetchConsumerTotalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time spent processing a consumer fetch request
aws_kafka_fetch_follower_local_time_ms_meanFetchFollowerLocalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average time it takes for a Kafka broker follower to fetch messages locally
aws_kafka_fetch_follower_request_queue_time_ms_meanFetchFollowerRequestQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time follower fetch requests spend in the queue
aws_kafka_fetch_follower_response_queue_time_ms_meanFetchFollowerResponseQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time follower fetch responses spend in the response queue
aws_kafka_fetch_follower_response_send_time_ms_meanFetchFollowerResponseSendTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time it takes for a Kafka broker follower to send a fetch response
aws_kafka_fetch_follower_total_time_ms_meanFetchFollowerTotalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time for a Kafka broker follower to fetch messages
aws_kafka_fetch_message_conversions_per_secFetchMessageConversionsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate of message format conversions during fetching
aws_kafka_fetch_throttle_byte_rateFetchThrottleByteRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate at which fetching is throttled due to byte rate limits
aws_kafka_fetch_throttle_queue_sizeFetchThrottleQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the number of messages in the fetch throttle queue
aws_kafka_fetch_throttle_timeFetchThrottleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time Kafka throttles fetch requests
aws_kafka_global_partition_countGlobalPartitionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Displays the total number of partitions in the Kafka cluster
aws_kafka_global_topic_countGlobalTopicCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the total number of topics in the Kafka cluster
aws_kafka_heap_memory_after_gcHeapMemoryAfterGCAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of heap memory remaining after garbage collection
aws_kafka_app_logs_disk_usedKafkaAppLogsDiskUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of disk space used by Kafka application logs
aws_kafka_data_logs_disk_usedKafkaDataLogsDiskUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the disk space used by Kafka data logs
aws_kafka_leader_countLeaderCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the number of partition leaders in the Kafka cluster
aws_kafka_max_offset_lagMaxOffsetLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the maximum offset lag between Kafka brokers
aws_kafka_memory_bufferedMemoryBufferedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the amount of memory currently buffered by Kafka
aws_kafka_memory_cachedMemoryCachedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the amount of memory cached by Kafka
aws_kafka_memory_freeMemoryFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Displays the amount of free memory on Kafka brokers
aws_kafka_memory_usedMemoryUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total amount of memory being used by Kafka brokers
aws_kafka_messages_in_per_secMessagesInPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of messages produced per second in the Kafka cluster
aws_kafka_network_processor_avg_idle_percentNetworkProcessorAvgIdlePercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the idle percentage of the network processors
aws_kafka_network_rx_droppedNetworkRxDroppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the number of dropped incoming network packets
aws_kafka_network_rx_errorsNetworkRxErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors on received network packets
aws_kafka_network_rx_packetsNetworkRxPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of network packets received
aws_kafka_network_tx_droppedNetworkTxDroppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of dropped outgoing network packets
aws_kafka_network_tx_errorsNetworkTxErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the number of errors on transmitted network packets
aws_kafka_network_tx_packetsNetworkTxPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of network packets transmitted
aws_kafka_offline_partitions_countOfflinePartitionsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of Kafka partitions that are offline
aws_kafka_offset_lagOffsetLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the current offset lag in Kafka replication
aws_kafka_partition_countPartitionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Displays the total number of partitions in the Kafka cluster
aws_kafka_pps_allowance_exceededPpsAllowanceExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks instances where the packets-per-second allowance has been exceeded
aws_kafka_produce_local_time_ms_meanProduceLocalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average time taken to produce messages locally
aws_kafka_produce_message_conversions_per_secProduceMessageConversionsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate of message conversions during production
aws_kafka_produce_message_conversions_time_ms_meanProduceMessageConversionsTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken to convert messages during production
aws_kafka_produce_request_queue_time_ms_meanProduceRequestQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time produce requests spend in the queue
aws_kafka_produce_response_queue_time_ms_meanProduceResponseQueueTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the time produce responses spend in the queue
aws_kafka_produce_response_send_time_ms_meanProduceResponseSendTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time it takes to send produce responses
aws_kafka_produce_throttle_byte_rateProduceThrottleByteRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate at which production is throttled due to byte rate limits
aws_kafka_produce_throttle_queue_sizeProduceThrottleQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the production throttle queue
aws_kafka_produce_throttle_timeProduceThrottleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total time Kafka throttles produce requests
aws_kafka_produce_total_time_ms_meanProduceTotalTimeMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time spent on producing messages
aws_kafka_remote_copy_bytes_per_secRemoteCopyBytesPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of bytes copied remotely
aws_kafka_remote_copy_errors_per_secRemoteCopyErrorsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of errors during remote copying
aws_kafka_remote_copy_lag_bytesRemoteCopyLagBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the lag in bytes during remote copying
aws_kafka_remote_fetch_bytes_per_secRemoteFetchBytesPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of bytes fetched remotely
aws_kafka_remote_fetch_errors_per_secRemoteFetchErrorsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of errors during remote fetching
aws_kafka_remote_fetch_requests_per_secRemoteFetchRequestsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of remote fetch requests per second
aws_kafka_remote_log_manager_tasks_avg_idle_percentRemoteLogManagerTasksAvgIdlePercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the idle percentage of remote log manager tasks
aws_kafka_remote_log_reader_avg_idle_percentRemoteLogReaderAvgIdlePercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the idle percentage of remote log reader tasks
aws_kafka_remote_log_reader_task_queue_sizeRemoteLogReaderTaskQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the size of the remote log reader task queue
aws_kafka_replication_bytes_in_per_secReplicationBytesInPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of incoming replication bytes
aws_kafka_replication_bytes_out_per_secReplicationBytesOutPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of outgoing replication bytes
aws_kafka_request_bytes_meanRequestBytesMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the average size of Kafka requests
aws_kafka_request_exempt_from_throttle_timeRequestExemptFromThrottleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time requests are exempt from throttling
aws_kafka_request_handler_avg_idle_percentRequestHandlerAvgIdlePercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the idle percentage of request handlers
aws_kafka_request_throttle_queue_sizeRequestThrottleQueueSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the request throttle queue
aws_kafka_request_throttle_timeRequestThrottleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time requests are throttled in Kafka
aws_kafka_request_timeRequestTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the overall time spent handling requests in Kafka
aws_kafka_root_disk_usedRootDiskUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of disk space used by the root partition
aws_kafka_sum_offset_lagSumOffsetLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total offset lag across all partitions
aws_kafka_swap_freeSwapFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free swap memory available on Kafka brokers
aws_kafka_swap_usedSwapUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of swap memory used by Kafka brokers
aws_kafka_tcpconnectionsTCPConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of TCP connections on the Kafka cluster
aws_kafka_tcp_connectionsTcpConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the active TCP connections in the Kafka cluster
aws_kafka_traffic_bytesTrafficBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total traffic in bytes on Kafka brokers
aws_kafka_traffic_shapingTrafficShapingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks instances where traffic shaping is applied to Kafka brokers
aws_kafka_under_min_isr_partition_countUnderMinIsrPartitionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of partitions below the minimum in-sync replicas
aws_kafka_under_replicated_partitionsUnderReplicatedPartitionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of under-replicated partitions in the Kafka cluster
aws_kafka_volume_queue_lengthVolumeQueueLengthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the queue length for disk I/O operations
aws_kafka_volume_read_bytesVolumeReadBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes read from disk
aws_kafka_volume_read_opsVolumeReadOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of read operations on the disk
aws_kafka_volume_total_read_timeVolumeTotalReadTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total time spent on disk read operations
aws_kafka_volume_total_write_timeVolumeTotalWriteTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total time spent on disk write operations
aws_kafka_volume_write_bytesVolumeWriteBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes written to disk
aws_kafka_volume_write_opsVolumeWriteOpsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of write operations on the disk
aws_kafka_zoo_keeper_request_latency_ms_meanZooKeeperRequestLatencyMsMeanAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the average latency of requests to ZooKeeper
aws_kafka_zoo_keeper_session_stateZooKeeperSessionStateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the current session state of ZooKeeper

AWS/Kinesis

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_kinesis_info
aws_kinesis_get_records_bytesGetRecords.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_get_records_iterator_ageGetRecords.IteratorAgeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_get_records_iterator_age_millisecondsGetRecords.IteratorAgeMillisecondsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_get_records_latencyGetRecords.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_get_records_recordsGetRecords.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_get_records_successGetRecords.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_incoming_bytesIncomingBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of incoming bytes written to the stream.
aws_kinesis_incoming_recordsIncomingRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of records being written to the stream.
aws_kinesis_iterator_age_millisecondsIteratorAgeMillisecondsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the age of the iterator used in GetRecords, measured in milliseconds.
aws_kinesis_outgoing_bytesOutgoingBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of outgoing bytes from the stream.
aws_kinesis_outgoing_recordsOutgoingRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of outgoing records from the stream.
aws_kinesis_put_record_bytesPutRecord.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_record_latencyPutRecord.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_record_successPutRecord.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_bytesPutRecords.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_failed_recordsPutRecords.FailedRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_latencyPutRecords.LatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_recordsPutRecords.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_successPutRecords.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_successful_recordsPutRecords.SuccessfulRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_throttled_recordsPutRecords.ThrottledRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_put_records_total_recordsPutRecords.TotalRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_read_provisioned_throughput_exceededReadProvisionedThroughputExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times read requests exceeded the provisioned throughput.
aws_kinesis_subscribe_to_shard_rate_exceededSubscribeToShard.RateExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_subscribe_to_shard_successSubscribeToShard.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_subscribe_to_shard_event_bytesSubscribeToShardEvent.BytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_subscribe_to_shard_event_millis_behind_latestSubscribeToShardEvent.MillisBehindLatestAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_subscribe_to_shard_event_recordsSubscribeToShardEvent.RecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_subscribe_to_shard_event_successSubscribeToShardEvent.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesis_write_provisioned_throughput_exceededWriteProvisionedThroughputExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of times write operations exceeded the provisioned throughput limits.

AWS/KinesisAnalytics

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_kinesisanalytics_info
aws_kinesisanalytics_bytesBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total amount of data processed by Kinesis Analytics.
aws_kinesisanalytics_input_processing_dropped_recordsInputProcessing.DroppedRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_input_processing_durationInputProcessing.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_input_processing_ok_bytesInputProcessing.OkBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_input_processing_ok_recordsInputProcessing.OkRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_input_processing_processing_failed_recordsInputProcessing.ProcessingFailedRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_input_processing_successInputProcessing.SuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_kpusKPUsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_lambda_delivery_delivery_failed_recordsLambdaDelivery.DeliveryFailedRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_lambda_delivery_durationLambdaDelivery.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_lambda_delivery_ok_recordsLambdaDelivery.OkRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_millis_behind_latestMillisBehindLatestAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time Kinesis Analytics is behind the latest record in milliseconds.
aws_kinesisanalytics_recordsRecordsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of records processed by Kinesis Analytics.
aws_kinesisanalytics_successSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the success rate of all Kinesis Analytics operations.
aws_kinesisanalytics_back_pressured_time_ms_per_secondbackPressuredTimeMsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of time in milliseconds Kinesis Analytics was back-pressured.
aws_kinesisanalytics_busy_time_ms_per_secondbusyTimeMsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time Kinesis Analytics spent in a busy state, processing data.
aws_kinesisanalytics_bytes_requested_per_fetchbytesRequestedPerFetchAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes requested in each fetch operation.
aws_kinesisanalytics_bytes_consumed_ratebytes_consumed_rateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate at which bytes are consumed from the stream.
aws_kinesisanalytics_commits_failedcommitsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of failed commit operations.
aws_kinesisanalytics_commits_succeededcommitsSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful commit operations.
aws_kinesisanalytics_committedoffsetscommittedoffsetsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the committed offsets of records processed.
aws_kinesisanalytics_container_cpuutilizationcontainerCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU utilization of the Kinesis Analytics container.
aws_kinesisanalytics_container_disk_utilizationcontainerDiskUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the disk utilization of the Kinesis Analytics container.
aws_kinesisanalytics_container_memory_utilizationcontainerMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory utilization of the Kinesis Analytics container.
aws_kinesisanalytics_cpu_utilizationcpuUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the overall CPU utilization of Kinesis Analytics.
aws_kinesisanalytics_current_input_watermarkcurrentInputWatermarkAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the current watermark for input data.
aws_kinesisanalytics_current_output_watermarkcurrentOutputWatermarkAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the current watermark for output data.
aws_kinesisanalytics_currentoffsetscurrentoffsetsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the current offsets for processed records.
aws_kinesisanalytics_downtimedowntimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total downtime of the Kinesis Analytics application.
aws_kinesisanalytics_full_restartsfullRestartsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of full restarts of the Kinesis Analytics application.
aws_kinesisanalytics_heap_memory_utilizationheapMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the heap memory utilization.
aws_kinesisanalytics_idle_time_ms_per_secondidleTimeMsPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the idle time of Kinesis Analytics in milliseconds per second.
aws_kinesisanalytics_last_checkpoint_durationlastCheckpointDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the duration of the last checkpoint process.
aws_kinesisanalytics_last_checkpoint_sizelastCheckpointSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the size of the last checkpoint.
aws_kinesisanalytics_managed_memory_totalmanagedMemoryTotalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total managed memory available.
aws_kinesisanalytics_managed_memory_usedmanagedMemoryUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of managed memory currently in use.
aws_kinesisanalytics_managed_memory_utilizationmanagedMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of managed memory.
aws_kinesisanalytics_num_late_records_droppednumLateRecordsDroppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of late records dropped by Kinesis Analytics.
aws_kinesisanalytics_num_records_innumRecordsInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of records ingested by Kinesis Analytics.
aws_kinesisanalytics_num_records_in_per_secondnumRecordsInPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the rate of incoming records per second.
aws_kinesisanalytics_num_records_outnumRecordsOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of records output by Kinesis Analytics.
aws_kinesisanalytics_num_records_out_per_secondnumRecordsOutPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of outgoing records per second.
aws_kinesisanalytics_number_of_failed_checkpointsnumberOfFailedCheckpointsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of failed checkpoints in Kinesis Analytics.
aws_kinesisanalytics_old_generation_gccountoldGenerationGCCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of garbage collection events in the old generation heap space.
aws_kinesisanalytics_old_generation_gctimeoldGenerationGCTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time spent in garbage collection for the old generation heap.
aws_kinesisanalytics_records_lag_maxrecords_lag_maxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum lag of records being processed by Kinesis Analytics.
aws_kinesisanalytics_thread_countthreadCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active threads in the Kinesis Analytics application.
aws_kinesisanalytics_uptimeuptimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_kinesisanalytics_zeppelin_cpu_utilizationzeppelinCpuUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU utilization of the Zeppelin server used by Kinesis Analytics.
aws_kinesisanalytics_zeppelin_heap_memory_utilizationzeppelinHeapMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the heap memory utilization of the Zeppelin server.
aws_kinesisanalytics_zeppelin_server_uptimezeppelinServerUptimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the uptime of the Zeppelin server.
aws_kinesisanalytics_zeppelin_thread_countzeppelinThreadCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active threads in the Zeppelin server.
aws_kinesisanalytics_zeppelin_waiting_jobszeppelinWaitingJobsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of jobs waiting to be processed by the Zeppelin server.

AWS/Lambda

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_lambda_info
aws_lambda_invocationsInvocationsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times your AWS Lambda function is invoked.
aws_lambda_errorsErrorsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of invocations that result in an error.
aws_lambda_throttlesThrottlesSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of times your Lambda function is throttled due to exceeding the concurrency limit.
aws_lambda_durationDurationMaximum, Minimum, p90, Sum, SampleCount, Average, p50, p75, p95, p99, p99.5, p99.9Tracks the amount of time a Lambda function takes to execute.
aws_lambda_async_event_ageAsyncEventAgeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_async_events_droppedAsyncEventsDroppedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_async_events_receivedAsyncEventsReceivedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_claimed_account_concurrencyClaimedAccountConcurrencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_concurrent_executionsConcurrentExecutionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_dead_letter_errorsDeadLetterErrorsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_deleted_event_countDeletedEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_destination_delivery_failuresDestinationDeliveryFailuresMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_dropped_event_countDroppedEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_failed_invoke_event_countFailedInvokeEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_filtered_out_event_countFilteredOutEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_invoked_event_countInvokedEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_iterator_ageIteratorAgeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_offset_lagOffsetLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_on_failure_destination_delivered_event_countOnFailureDestinationDeliveredEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_oversized_record_countOversizedRecordCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_polled_event_countPolledEventCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_post_runtime_extensions_durationPostRuntimeExtensionsDurationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_provisioned_concurrency_invocationsProvisionedConcurrencyInvocationsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_provisioned_concurrency_spillover_invocationsProvisionedConcurrencySpilloverInvocationsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_provisioned_concurrency_utilizationProvisionedConcurrencyUtilizationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_provisioned_concurrent_executionsProvisionedConcurrentExecutionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_provisioned_pollersProvisionedPollersMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_recursive_invocations_droppedRecursiveInvocationsDroppedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_lambda_unreserved_concurrent_executionsUnreservedConcurrentExecutionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Logs

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_logs_info
aws_logs_delivery_errorsDeliveryErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors that occurred while attempting to deliver log data to the CloudWatch Logs destination.
aws_logs_delivery_throttlingDeliveryThrottlingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of times log delivery was throttled due to exceeding the delivery limits.
aws_logs_forwarded_bytesForwardedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total volume of log data in bytes that was successfully forwarded to the CloudWatch Logs destination.
aws_logs_forwarded_log_eventsForwardedLogEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of log events successfully forwarded to the CloudWatch Logs destination.
aws_logs_incoming_bytesIncomingBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total volume of incoming log data in bytes received by CloudWatch Logs.
aws_logs_incoming_log_eventsIncomingLogEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of log events received by CloudWatch Logs.

AWS/MWAA

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_mwaa_active_connection_countActiveConnectionCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active connections to the Managed Workflows for Apache Airflow (MWAA) environment.
aws_mwaa_approximate_age_of_oldest_taskApproximateAgeOfOldestTaskAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the age of the oldest running task in the MWAA environment.
aws_mwaa_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of CPU utilization in the MWAA environment.
aws_mwaa_database_connectionsDatabaseConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of connections to the database used by MWAA.
aws_mwaa_disk_queue_depthDiskQueueDepthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the depth of the disk queue, indicating the number of IO operations waiting to be processed.
aws_mwaa_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of free memory available in the MWAA environment.
aws_mwaa_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of memory utilized in the MWAA environment.
aws_mwaa_queued_tasksQueuedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of tasks waiting to be executed in the MWAA environment.
aws_mwaa_running_tasksRunningTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tasks currently running in the MWAA environment.
aws_mwaa_volume_write_iopsVolumeWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the input/output operations per second (IOPS) for write operations on the volume.
aws_mwaa_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of write operations per second in the MWAA environment.
aws_mwaa_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of write operations in the MWAA environment.
aws_mwaa_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of data written per second in the MWAA environment.

AWS/MediaConnect

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_mediaconnect_info
aws_mediaconnect_arqrecoveredARQRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of Automatic Repeat reQuest (ARQ) packets successfully recovered in the MediaConnect flow.
aws_mediaconnect_arqrequestsARQRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of ARQ requests made by MediaConnect flows.
aws_mediaconnect_bit_rateBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the bitrate of the MediaConnect stream.
aws_mediaconnect_caterrorCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects Conditional Access Table (CAT) errors in the MediaConnect stream.
aws_mediaconnect_crcerrorCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of cyclic redundancy check (CRC) errors in the stream.
aws_mediaconnect_connectedConnectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the connection status of the MediaConnect flow.
aws_mediaconnect_connected_outputsConnectedOutputsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of outputs connected to the MediaConnect flow.
aws_mediaconnect_connection_attemptsConnectionAttemptsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of attempts made to establish a connection for the flow.
aws_mediaconnect_consecutive_dropsConsecutiveDropsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of consecutive dropped packets in the MediaConnect flow.
aws_mediaconnect_consecutive_not_recoveredConsecutiveNotRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of consecutive packets that were not successfully recovered.
aws_mediaconnect_continuity_counterContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the continuity counter of the stream to detect missing packets.
aws_mediaconnect_disconnectionsDisconnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of times the MediaConnect flow was disconnected.
aws_mediaconnect_dropped_packetsDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of packets dropped in the MediaConnect flow.
aws_mediaconnect_egress_bridge_bit_rateEgressBridgeBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the bitrate for egress bridge flows.
aws_mediaconnect_egress_bridge_caterrorEgressBridgeCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects CAT errors in egress bridge flows.
aws_mediaconnect_egress_bridge_crcerrorEgressBridgeCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the CRC errors in egress bridge flows.
aws_mediaconnect_egress_bridge_continuity_counterEgressBridgeContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the continuity of the egress bridge stream to detect missing packets.
aws_mediaconnect_egress_bridge_dropped_packetsEgressBridgeDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets dropped in the egress bridge flows.
aws_mediaconnect_egress_bridge_failover_switchesEgressBridgeFailoverSwitchesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors failover switches in the egress bridge flows.
aws_mediaconnect_egress_bridge_merge_activeEgressBridgeMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if an egress bridge merge is active.
aws_mediaconnect_egress_bridge_not_recovered_packetsEgressBridgeNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets that were not recovered in the egress bridge.
aws_mediaconnect_egress_bridge_paterrorEgressBridgePATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects Program Association Table (PAT) errors in the egress bridge.
aws_mediaconnect_egress_bridge_pcraccuracy_errorEgressBridgePCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors related to the accuracy of Program Clock Reference (PCR) in the egress bridge.
aws_mediaconnect_egress_bridge_pcrerrorEgressBridgePCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks PCR errors in the egress bridge.
aws_mediaconnect_egress_bridge_piderrorEgressBridgePIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors Packet Identifier (PID) errors in the egress bridge stream.
aws_mediaconnect_egress_bridge_pmterrorEgressBridgePMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects errors in the Program Map Table (PMT) in the egress bridge.
aws_mediaconnect_egress_bridge_ptserrorEgressBridgePTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks Presentation Time Stamp (PTS) errors in the egress bridge stream.
aws_mediaconnect_egress_bridge_packet_loss_percentEgressBridgePacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of packet loss in the egress bridge.
aws_mediaconnect_egress_bridge_recovered_packetsEgressBridgeRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of recovered packets in the egress bridge stream.
aws_mediaconnect_egress_bridge_source_bit_rateEgressBridgeSourceBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the bitrate of the source in the egress bridge.
aws_mediaconnect_egress_bridge_source_caterrorEgressBridgeSourceCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects CAT errors in the source of the egress bridge.
aws_mediaconnect_egress_bridge_source_crcerrorEgressBridgeSourceCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks CRC errors in the source of the egress bridge.
aws_mediaconnect_egress_bridge_source_continuity_counterEgressBridgeSourceContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the continuity of the source stream in the egress bridge to detect missing packets.
aws_mediaconnect_egress_bridge_source_dropped_packetsEgressBridgeSourceDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of dropped packets in the source stream of the egress bridge.
aws_mediaconnect_egress_bridge_source_merge_activeEgressBridgeSourceMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if the source merge is active in the egress bridge.
aws_mediaconnect_egress_bridge_source_merge_latencyEgressBridgeSourceMergeLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures latency during source merge in the egress bridge.
aws_mediaconnect_egress_bridge_source_not_recovered_packetsEgressBridgeSourceNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets not recovered in the source of the egress bridge.
aws_mediaconnect_egress_bridge_source_paterrorEgressBridgeSourcePATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Detects PAT errors in the source of the egress bridge.
aws_mediaconnect_egress_bridge_source_pcraccuracy_errorEgressBridgeSourcePCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors in the accuracy of the PCR in the source of the egress bridge.
aws_mediaconnect_egress_bridge_source_pcrerrorEgressBridgeSourcePCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks PCR errors in the source stream of the egress bridge.
aws_mediaconnect_egress_bridge_source_piderrorEgressBridgeSourcePIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_pmterrorEgressBridgeSourcePMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_ptserrorEgressBridgeSourcePTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_packet_loss_percentEgressBridgeSourcePacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_recovered_packetsEgressBridgeSourceRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_tsbyte_errorEgressBridgeSourceTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_tssync_lossEgressBridgeSourceTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_total_packetsEgressBridgeSourceTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_source_transport_errorEgressBridgeSourceTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_tsbyte_errorEgressBridgeTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_tssync_lossEgressBridgeTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_total_packetsEgressBridgeTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_egress_bridge_transport_errorEgressBridgeTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_failover_switchesFailoverSwitchesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_bit_rateIngressBridgeBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_caterrorIngressBridgeCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_crcerrorIngressBridgeCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_continuity_counterIngressBridgeContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_dropped_packetsIngressBridgeDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_failover_switchesIngressBridgeFailoverSwitchesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_merge_activeIngressBridgeMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_not_recovered_packetsIngressBridgeNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_paterrorIngressBridgePATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_pcraccuracy_errorIngressBridgePCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_pcrerrorIngressBridgePCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_piderrorIngressBridgePIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_pmterrorIngressBridgePMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_ptserrorIngressBridgePTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_packet_loss_percentIngressBridgePacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_recovered_packetsIngressBridgeRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_arqrecoveredIngressBridgeSourceARQRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_arqrequestsIngressBridgeSourceARQRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_bit_rateIngressBridgeSourceBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_caterrorIngressBridgeSourceCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_crcerrorIngressBridgeSourceCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_continuity_counterIngressBridgeSourceContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_dropped_packetsIngressBridgeSourceDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_fecpacketsIngressBridgeSourceFECPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_fecrecoveredIngressBridgeSourceFECRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_merge_activeIngressBridgeSourceMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_merge_latencyIngressBridgeSourceMergeLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_not_recovered_packetsIngressBridgeSourceNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_overflow_packetsIngressBridgeSourceOverflowPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_paterrorIngressBridgeSourcePATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_pcraccuracy_errorIngressBridgeSourcePCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_pcrerrorIngressBridgeSourcePCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_piderrorIngressBridgeSourcePIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_pmterrorIngressBridgeSourcePMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_ptserrorIngressBridgeSourcePTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_packet_loss_percentIngressBridgeSourcePacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_recovered_packetsIngressBridgeSourceRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_round_trip_timeIngressBridgeSourceRoundTripTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_tsbyte_errorIngressBridgeSourceTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_tssync_lossIngressBridgeSourceTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_total_packetsIngressBridgeSourceTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_source_transport_errorIngressBridgeSourceTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_tsbyte_errorIngressBridgeTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_tssync_lossIngressBridgeTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_total_packetsIngressBridgeTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ingress_bridge_transport_errorIngressBridgeTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_jitterJitterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_latencyLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_canceledMaintenanceCanceledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_failedMaintenanceFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_rescheduledMaintenanceRescheduledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_scheduledMaintenanceScheduledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_startedMaintenanceStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_maintenance_succeededMaintenanceSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_merge_activeMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_merge_latencyMergeLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_not_recovered_packetsNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_connectedOutputConnectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_disconnectionsOutputDisconnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_dropped_payloadsOutputDroppedPayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_late_payloadsOutputLatePayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_total_bytesOutputTotalBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_output_total_payloadsOutputTotalPayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_overflow_packetsOverflowPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_paterrorPATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_pcraccuracy_errorPCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_pcrerrorPCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_piderrorPIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_pmterrorPMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_ptserrorPTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_packet_loss_percentPacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_recovered_packetsRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_round_trip_timeRoundTripTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_arqrecoveredSourceARQRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_arqrequestsSourceARQRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_bit_rateSourceBitRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_caterrorSourceCATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_crcerrorSourceCRCErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_connectedSourceConnectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_continuity_counterSourceContinuityCounterAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_disconnectionsSourceDisconnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_dropped_packetsSourceDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_dropped_payloadsSourceDroppedPayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_fecpacketsSourceFECPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_fecrecoveredSourceFECRecoveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_late_payloadsSourceLatePayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_merge_activeSourceMergeActiveAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_merge_latencySourceMergeLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_merge_status_warn_mismatchSourceMergeStatusWarnMismatchAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_merge_status_warn_soloSourceMergeStatusWarnSoloAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_missing_packetsSourceMissingPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_not_recovered_packetsSourceNotRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_overflow_packetsSourceOverflowPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_paterrorSourcePATErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_pcraccuracy_errorSourcePCRAccuracyErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_pcrerrorSourcePCRErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_piderrorSourcePIDErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_pmterrorSourcePMTErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_ptserrorSourcePTSErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_packet_loss_percentSourcePacketLossPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_recovered_packetsSourceRecoveredPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_round_trip_timeSourceRoundTripTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_selectedSourceSelectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_tsbyte_errorSourceTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_tssync_lossSourceTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_total_bytesSourceTotalBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_total_packetsSourceTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_total_payloadsSourceTotalPayloadsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_source_transport_errorSourceTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_tsbyte_errorTSByteErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_tssync_lossTSSyncLossAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_total_packetsTotalPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_transport_errorTransportErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediaconnect_uptimeUptimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/MediaTailor

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_mediatailor_info
aws_mediatailor_ad_decision_server_adsAdDecisionServer.AdsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_ad_decision_server_durationAdDecisionServer.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_ad_decision_server_errorsAdDecisionServer.ErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_ad_decision_server_fill_rateAdDecisionServer.FillRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_ad_decision_server_timeoutsAdDecisionServer.TimeoutsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_ad_not_readyAdNotReadyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the number of instances where ads were not ready to be served.
aws_mediatailor_avails_durationAvails.DurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_avails_fill_rateAvails.FillRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_avails_filled_durationAvails.FilledDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_get_manifest_errorsGetManifest.ErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_origin_errorsOrigin.ErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_mediatailor_origin_timeoutsOrigin.TimeoutsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/NATGateway

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_natgateway_info
aws_natgateway_active_connection_countActiveConnectionCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active connections to the NAT Gateway.
aws_natgateway_bytes_in_from_destinationBytesInFromDestinationSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data received by the NAT Gateway from the destination (in bytes).
aws_natgateway_bytes_in_from_sourceBytesInFromSourceSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data received by the NAT Gateway from the source (in bytes).
aws_natgateway_bytes_out_to_destinationBytesOutToDestinationSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the data sent from the NAT Gateway to the destination (in bytes).
aws_natgateway_bytes_out_to_sourceBytesOutToSourceSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the data sent from the NAT Gateway to the source (in bytes).
aws_natgateway_connection_attempt_countConnectionAttemptCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of attempts to establish a connection via the NAT Gateway.
aws_natgateway_connection_established_countConnectionEstablishedCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the successful establishment of connections through the NAT Gateway.
aws_natgateway_error_port_allocationErrorPortAllocationSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks errors related to port allocation failures in the NAT Gateway.
aws_natgateway_idle_timeout_countIdleTimeoutCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of times connections are closed due to idle timeouts on the NAT Gateway.
aws_natgateway_packets_drop_countPacketsDropCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of packets dropped by the NAT Gateway.
aws_natgateway_packets_in_from_destinationPacketsInFromDestinationSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets received by the NAT Gateway from the destination.
aws_natgateway_packets_in_from_sourcePacketsInFromSourceSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of packets received by the NAT Gateway from the source.
aws_natgateway_packets_out_to_destinationPacketsOutToDestinationSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets sent from the NAT Gateway to the destination.
aws_natgateway_packets_out_to_sourcePacketsOutToSourceSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of packets sent from the NAT Gateway to the source.

AWS/Neptune

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_neptune_info
aws_neptune_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of CPU resources used by the Neptune database instance.
aws_neptune_cluster_replica_lagClusterReplicaLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the replication lag between the Neptune writer and reader nodes in milliseconds.
aws_neptune_cluster_replica_lag_maximumClusterReplicaLagMaximumAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum replica lag during the monitored period.
aws_neptune_cluster_replica_lag_minimumClusterReplicaLagMinimumAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the minimum replica lag during the monitored period.
aws_neptune_engine_uptimeEngineUptimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total uptime of the Neptune engine instance.
aws_neptune_free_local_storageFreeLocalStorageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of local storage available on the Neptune instance.
aws_neptune_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of available memory on the Neptune instance.
aws_neptune_gremlin_errorsGremlinErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the errors encountered in Gremlin queries.
aws_neptune_gremlin_http1xxGremlinHttp1xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 1xx responses for Gremlin queries.
aws_neptune_gremlin_http2xxGremlinHttp2xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 2xx (successful) responses for Gremlin queries.
aws_neptune_gremlin_http4xxGremlinHttp4xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 4xx (client error) responses for Gremlin queries.
aws_neptune_gremlin_http5xxGremlinHttp5xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 5xx (server error) responses for Gremlin queries.
aws_neptune_gremlin_requestsGremlinRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of Gremlin requests made.
aws_neptune_gremlin_requests_per_secGremlinRequestsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate of Gremlin requests per second.
aws_neptune_gremlin_web_socket_available_connectionsGremlinWebSocketAvailableConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks available WebSocket connections for Gremlin.
aws_neptune_gremlin_web_socket_client_errorsGremlinWebSocketClientErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors WebSocket client errors for Gremlin.
aws_neptune_gremlin_web_socket_server_errorsGremlinWebSocketServerErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors WebSocket server errors for Gremlin.
aws_neptune_gremlin_web_socket_successGremlinWebSocketSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts successful WebSocket connections for Gremlin.
aws_neptune_http100Http100Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 100 responses from the Neptune instance.
aws_neptune_http101Http101Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 101 responses (Switching Protocols).
aws_neptune_http1xxHttp1xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks all HTTP 1xx responses for requests made to the Neptune instance.
aws_neptune_http200Http200Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 200 (OK) responses.
aws_neptune_http2xxHttp2xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors all HTTP 2xx responses (successful requests).
aws_neptune_http400Http400Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 400 (bad request) responses.
aws_neptune_http403Http403Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 403 (forbidden) responses.
aws_neptune_http405Http405Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 405 (method not allowed) responses.
aws_neptune_http413Http413Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 413 (request entity too large) responses.
aws_neptune_http429Http429Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 429 (too many requests) responses.
aws_neptune_http4xxHttp4xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks all HTTP 4xx (client error) responses.
aws_neptune_http500Http500Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 500 (internal server error) responses.
aws_neptune_http501Http501Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 501 (not implemented) responses.
aws_neptune_http5xxHttp5xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors all HTTP 5xx (server error) responses.
aws_neptune_loader_errorsLoaderErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts errors encountered during bulk loader operations.
aws_neptune_loader_requestsLoaderRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks requests made to the bulk loader.
aws_neptune_network_receive_throughputNetworkReceiveThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the network throughput for data received by the Neptune instance.
aws_neptune_network_throughputNetworkThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total network throughput (incoming and outgoing) of the Neptune instance.
aws_neptune_network_transmit_throughputNetworkTransmitThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the network throughput for data transmitted by the Neptune instance.
aws_neptune_sparql_errorsSparqlErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors encountered in SPARQL queries.
aws_neptune_sparql_http1xxSparqlHttp1xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 1xx responses for SPARQL queries.
aws_neptune_sparql_http2xxSparqlHttp2xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 2xx responses for SPARQL queries.
aws_neptune_sparql_http4xxSparqlHttp4xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors HTTP 4xx responses for SPARQL queries.
aws_neptune_sparql_http5xxSparqlHttp5xxAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks HTTP 5xx responses for SPARQL queries.
aws_neptune_sparql_requestsSparqlRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_neptune_sparql_requests_per_secSparqlRequestsPerSecAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the rate of SPARQL requests per second.
aws_neptune_status_errorsStatusErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of status errors reported by the Neptune instance.
aws_neptune_status_requestsStatusRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of status requests made to the Neptune instance.
aws_neptune_volume_bytes_usedVolumeBytesUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of storage used by the Neptune instance.
aws_neptune_volume_read_iopsVolumeReadIOPsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the read input/output operations per second on the Neptune instance’s volume.
aws_neptune_volume_write_iopsVolumeWriteIOPsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the write input/output operations per second on the Neptune instance’s volume.

AWS/NetworkELB

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_networkelb_info
aws_networkelb_active_flow_countActiveFlowCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of active flow connections through the Network Load Balancer.
aws_networkelb_active_flow_count_tlsActiveFlowCount_TLSSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active flow connections through the Network Load Balancer that are using TLS.
aws_networkelb_client_tlsnegotiation_error_countClientTLSNegotiationErrorCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of client TLS negotiation errors, indicating issues with SSL/TLS handshakes.
aws_networkelb_consumed_lcusConsumedLCUsAverage, Sum, Maximum, Minimum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures Load Balancer Capacity Units (LCUs) consumed by the Network Load Balancer.
aws_networkelb_healthy_host_countHealthyHostCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of healthy targets available to receive traffic.
aws_networkelb_new_flow_countNewFlowCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of new flow connections established with the Network Load Balancer.
aws_networkelb_new_flow_count_tlsNewFlowCount_TLSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of new flow connections using TLS.
aws_networkelb_processed_bytesProcessedBytesAverage, Sum, Maximum, Minimum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total amount of data processed by the Network Load Balancer.
aws_networkelb_target_tlsnegotiation_error_countTargetTLSNegotiationErrorCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors TLS negotiation errors on the target side, indicating failed handshakes.
aws_networkelb_tcp_client_reset_countTCP_Client_Reset_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of TCP client resets, indicating client-initiated connection terminations.
aws_networkelb_tcp_target_reset_countTCP_Target_Reset_CountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors TCP resets initiated by the target, indicating failed connections.
aws_networkelb_un_healthy_host_countUnHealthyHostCountSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of targets marked as unhealthy by the load balancer.
aws_networkelb_active_flow_count_tcpActiveFlowCount_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_active_flow_count_udpActiveFlowCount_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_consumed_lcus_tcpConsumedLCUs_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_consumed_lcus_tlsConsumedLCUs_TLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_consumed_lcus_udpConsumedLCUs_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_new_flow_count_tcpNewFlowCount_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_new_flow_count_udpNewFlowCount_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_peak_packets_per_secondPeakPacketsPerSecondMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_port_allocation_error_countPortAllocationErrorCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_processed_bytes_tcpProcessedBytes_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_processed_bytes_tlsProcessedBytes_TLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_processed_bytes_udpProcessedBytes_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_processed_packetsProcessedPacketsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_inbound_icmpSecurityGroupBlockedFlowCount_Inbound_ICMPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_inbound_tcpSecurityGroupBlockedFlowCount_Inbound_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_inbound_udpSecurityGroupBlockedFlowCount_Inbound_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_outbound_icmpSecurityGroupBlockedFlowCount_Outbound_ICMPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_outbound_tcpSecurityGroupBlockedFlowCount_Outbound_TCPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_security_group_blocked_flow_count_outbound_udpSecurityGroupBlockedFlowCount_Outbound_UDPMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_tcp_elb_reset_countTCP_ELB_Reset_CountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_networkelb_unhealthy_routing_flow_countUnhealthyRoutingFlowCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/NetworkFirewall

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_networkfirewall_info
aws_networkfirewall_dropped_packetsDroppedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets dropped by the Network Firewall, indicating blocked or failed traffic.
aws_networkfirewall_packetsPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of packets inspected by the Network Firewall.
aws_networkfirewall_passed_packetsPassedPacketsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of packets allowed through the Network Firewall, indicating successful traffic.
aws_networkfirewall_received_packet_countReceivedPacketCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of packets received by the Network Firewall for inspection.

AWS/PrivateLinkEndpoints

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_privatelinkendpoints_info
aws_privatelinkendpoints_active_connectionsActiveConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of active connections through the PrivateLink endpoints.
aws_privatelinkendpoints_bytes_processedBytesProcessedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data processed by the PrivateLink endpoints in bytes.
aws_privatelinkendpoints_new_connectionsNewConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of new connections established through the PrivateLink endpoints.
aws_privatelinkendpoints_packets_droppedPacketsDroppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets dropped by the PrivateLink endpoints, which could indicate errors or network issues.
aws_privatelinkendpoints_rst_packets_receivedRstPacketsReceivedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of reset (RST) packets received, which can indicate connection terminations.

AWS/PrivateLinkServices

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_privatelinkservices_info
aws_privatelinkservices_active_connectionsActiveConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active connections managed by the PrivateLink services.
aws_privatelinkservices_bytes_processedBytesProcessedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total amount of data processed by the PrivateLink services in bytes.
aws_privatelinkservices_endpoints_countEndpointsCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of PrivateLink service endpoints currently connected.
aws_privatelinkservices_new_connectionsNewConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of new connections established via the PrivateLink services.
aws_privatelinkservices_rst_packets_receivedRstPacketsReceivedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of reset (RST) packets received, indicating terminated connections.

AWS/Prometheus

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_prometheus_info
aws_prometheus_alert_manager_alerts_receivedAlertManagerAlertsReceivedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of alerts received by the Prometheus Alert Manager.
aws_prometheus_alert_manager_notifications_failedAlertManagerNotificationsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed alert notifications sent by the Prometheus Alert Manager.
aws_prometheus_alert_manager_notifications_throttledAlertManagerNotificationsThrottledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of alert notifications throttled due to rate limits or other constraints.
aws_prometheus_discarded_samplesDiscardedSamplesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of discarded samples due to errors or incorrect data.
aws_prometheus_rule_evaluation_failuresRuleEvaluationFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed rule evaluations in Prometheus.
aws_prometheus_rule_evaluationsRuleEvaluationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of rule evaluations performed by Prometheus.
aws_prometheus_rule_group_iterations_missedRuleGroupIterationsMissedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of rule group evaluation iterations that were missed due to processing delays.

AWS/RDS

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_rds_info
aws_rds_cpuutilizationCPUUtilizationMaximum, Average, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of CPU resources by RDS instances.
aws_rds_database_connectionsDatabaseConnectionsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of active database connections to RDS instances.
aws_rds_replica_lagReplicaLagAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the lag time between the master and replica databases.
aws_rds_freeable_memoryFreeableMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the available memory that can be used by the RDS instance.
aws_rds_free_storage_spaceFreeStorageSpaceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the remaining storage space available on the RDS instance.
aws_rds_free_storage_space_log_volumeFreeStorageSpaceLogVolumeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_swap_usageSwapUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of swap space used by the RDS instance.
aws_rds_read_throughputReadThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the throughput for read operations from the database.
aws_rds_read_latencyReadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the latency for read operations on the database.
aws_rds_read_iopsReadIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the input/output operations per second for reads on the RDS instance.
aws_rds_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the throughput for write operations to the database.
aws_rds_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the latency for write operations on the database.
aws_rds_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the input/output operations per second for writes on the RDS instance.
aws_rds_burst_balanceBurstBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the burst balance percentage for instances with burstable performance.
aws_rds_ebsbyte_balance_percentEBSByteBalance%Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_ebsiobalance_percentEBSIOBalance%Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_dbloadDBLoadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the database load on the instance.
aws_rds_dbload_cpuDBLoadCPUAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the portion of database load related to CPU usage.
aws_rds_dbload_non_cpuDBLoadNonCPUAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the portion of database load unrelated to CPU usage.
aws_rds_cpucredit_usageCPUCreditUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_cpucredit_balanceCPUCreditBalanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_acuutilizationACUUtilizationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aborted_clientsAbortedClientsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_active_transactionsActiveTransactionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_binlog_replica_lagAuroraBinlogReplicaLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_dmlrejected_master_fullAuroraDMLRejectedMasterFullMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_dmlrejected_writer_fullAuroraDMLRejectedWriterFullMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_estimated_shared_memory_bytesAuroraEstimatedSharedMemoryBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_global_dbdata_transfer_bytesAuroraGlobalDBDataTransferBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_global_dbprogress_lagAuroraGlobalDBProgressLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_global_dbrpolagAuroraGlobalDBRPOLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_global_dbreplicated_write_ioAuroraGlobalDBReplicatedWriteIOMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_global_dbreplication_lagAuroraGlobalDBReplicationLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_memory_health_stateAuroraMemoryHealthStateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_memory_num_declined_sql_totalAuroraMemoryNumDeclinedSqlTotalMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_memory_num_kill_conn_totalAuroraMemoryNumKillConnTotalMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_memory_num_kill_query_totalAuroraMemoryNumKillQueryTotalMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_optimized_reads_cache_hit_ratioAuroraOptimizedReadsCacheHitRatioMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_replica_lagAuroraReplicaLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_replica_lag_maximumAuroraReplicaLagMaximumMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_replica_lag_minimumAuroraReplicaLagMinimumMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_slow_connection_handle_countAuroraSlowConnectionHandleCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_slow_handshake_countAuroraSlowHandshakeCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_aurora_volume_bytes_left_totalAuroraVolumeBytesLeftTotalMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_availability_percentageAvailabilityPercentageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_backtrack_change_records_creation_rateBacktrackChangeRecordsCreationRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_backtrack_change_records_storedBacktrackChangeRecordsStoredMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_backtrack_window_actualBacktrackWindowActualMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_backtrack_window_alertBacktrackWindowAlertMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_backup_retention_period_storage_usedBackupRetentionPeriodStorageUsedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_bin_log_disk_usageBinLogDiskUsageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_blocked_transactionsBlockedTransactionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_buffer_cache_hit_ratioBufferCacheHitRatioMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_cpusurplus_credit_balanceCPUSurplusCreditBalanceMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_cpusurplus_credits_chargedCPUSurplusCreditsChargedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_checkpoint_lagCheckpointLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connectionsClientConnectionsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_closedClientConnectionsClosedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_no_tlsClientConnectionsNoTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_receivedClientConnectionsReceivedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_setup_failed_authClientConnectionsSetupFailedAuthMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_setup_succeededClientConnectionsSetupSucceededMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_client_connections_tlsClientConnectionsTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_commit_latencyCommitLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_commit_throughputCommitThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_connection_attemptsConnectionAttemptsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_dbload_relative_to_num_vcpusDBLoadRelativeToNumVCPUsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_ddllatencyDDLLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_ddlthroughputDDLThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_dmllatencyDMLLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_dmlthroughputDMLThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connection_requestsDatabaseConnectionRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connection_requests_with_tlsDatabaseConnectionRequestsWithTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_borrow_latencyDatabaseConnectionsBorrowLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_currently_borrowedDatabaseConnectionsCurrentlyBorrowedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_currently_in_transactionDatabaseConnectionsCurrentlyInTransactionMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_currently_session_pinnedDatabaseConnectionsCurrentlySessionPinnedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_setup_failedDatabaseConnectionsSetupFailedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_setup_succeededDatabaseConnectionsSetupSucceededMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_database_connections_with_tlsDatabaseConnectionsWithTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_deadlocksDeadlocksMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_delete_latencyDeleteLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_delete_throughputDeleteThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_disk_queue_depthDiskQueueDepthMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_disk_queue_depth_log_volumeDiskQueueDepthLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_engine_uptimeEngineUptimeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_failed_sqlserver_agent_jobs_countFailedSQLServerAgentJobsCountMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_free_ephemeral_storageFreeEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_free_local_storageFreeLocalStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_insert_latencyInsertLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_insert_throughputInsertThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_login_failuresLoginFailuresMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_max_database_connections_allowedMaxDatabaseConnectionsAllowedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_maximum_used_transaction_idsMaximumUsedTransactionIDsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_network_receive_throughputNetworkReceiveThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_network_throughputNetworkThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_network_transmit_throughputNetworkTransmitThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_num_binary_log_filesNumBinaryLogFilesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_oldest_replication_slot_lagOldestReplicationSlotLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_purge_boundaryPurgeBoundaryMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_purge_finished_pointPurgeFinishedPointMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_queriesQueriesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_query_database_response_latencyQueryDatabaseResponseLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_query_requestsQueryRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_query_requests_no_tlsQueryRequestsNoTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_query_requests_tlsQueryRequestsTLSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_query_response_latencyQueryResponseLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_to_aurora_postgre_sqlreplica_lagRDSToAuroraPostgreSQLReplicaLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_iopsephemeral_storageReadIOPSEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_iopslog_volumeReadIOPSLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_latency_ephemeral_storageReadLatencyEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_latency_log_volumeReadLatencyLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_throughput_ephemeral_storageReadThroughputEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_read_throughput_log_volumeReadThroughputLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_replication_channel_lagReplicationChannelLagMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_replication_slot_disk_usageReplicationSlotDiskUsageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_result_set_cache_hit_ratioResultSetCacheHitRatioMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_rollback_segment_history_list_lengthRollbackSegmentHistoryListLengthMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_row_lock_timeRowLockTimeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_select_latencySelectLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_select_throughputSelectThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_serverless_database_capacityServerlessDatabaseCapacityMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_snapshot_storage_usedSnapshotStorageUsedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_storage_network_receive_throughputStorageNetworkReceiveThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_storage_network_throughputStorageNetworkThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_storage_network_transmit_throughputStorageNetworkTransmitThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_sum_binary_log_sizeSumBinaryLogSizeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_temp_storage_iopsTempStorageIOPSMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_temp_storage_throughputTempStorageThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_total_backup_storage_billedTotalBackupStorageBilledMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_transaction_logs_disk_usageTransactionLogsDiskUsageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_transaction_logs_generationTransactionLogsGenerationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_truncate_finished_pointTruncateFinishedPointMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_update_latencyUpdateLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_update_throughputUpdateThroughputMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_volume_bytes_usedVolumeBytesUsedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_volume_read_iopsVolumeReadIOPsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_volume_write_iopsVolumeWriteIOPsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_iopsephemeral_storageWriteIOPSEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_iopslog_volumeWriteIOPSLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_latency_ephemeral_storageWriteLatencyEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_latency_log_volumeWriteLatencyLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_throughput_ephemeral_storageWriteThroughputEphemeralStorageMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_rds_write_throughput_log_volumeWriteThroughputLogVolumeMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/Redshift

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_redshift_info
aws_redshift_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks CPU utilization across Redshift clusters.
aws_redshift_commit_queue_lengthCommitQueueLengthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the length of the commit queue for query execution.
aws_redshift_concurrency_scaling_active_clustersConcurrencyScalingActiveClustersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of active concurrency scaling clusters.
aws_redshift_concurrency_scaling_secondsConcurrencyScalingSecondsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time spent scaling for concurrency.
aws_redshift_database_connectionsDatabaseConnectionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of database connections to the Redshift cluster.
aws_redshift_health_statusHealthStatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides health status of Redshift clusters.
aws_redshift_maintenance_modeMaintenanceModeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates if the cluster is in maintenance mode.
aws_redshift_max_configured_concurrency_scaling_clustersMaxConfiguredConcurrencyScalingClustersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the maximum number of concurrency scaling clusters configured.
aws_redshift_network_receive_throughputNetworkReceiveThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the network throughput for receiving data.
aws_redshift_network_transmit_throughputNetworkTransmitThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the network throughput for transmitting data.
aws_redshift_num_exceeded_schema_quotasNumExceededSchemaQuotasAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks how often schema quotas have been exceeded.
aws_redshift_percentage_disk_space_usedPercentageDiskSpaceUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the percentage of disk space used by the cluster.
aws_redshift_percentage_quota_usedPercentageQuotaUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of quota used.
aws_redshift_queries_completed_per_secondQueriesCompletedPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of queries completed per second.
aws_redshift_query_durationQueryDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of queries.
aws_redshift_query_runtime_breakdownQueryRuntimeBreakdownAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides a breakdown of the time spent on query execution.
aws_redshift_read_iopsReadIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures input/output operations per second for reads.
aws_redshift_read_latencyReadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks latency for read operations.
aws_redshift_read_throughputReadThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures throughput for read operations.
aws_redshift_schema_quotaSchemaQuotaAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors schema quota usage.
aws_redshift_storage_usedStorageUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the amount of storage used by the Redshift cluster.
aws_redshift_total_table_countTotalTableCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of tables in the cluster.
aws_redshift_wlmqueries_completed_per_secondWLMQueriesCompletedPerSecondAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of queries completed per second in the Workload Management (WLM) queue.
aws_redshift_wlmquery_durationWLMQueryDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the duration of queries in the WLM queue.
aws_redshift_wlmqueue_lengthWLMQueueLengthAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the length of the WLM queue.
aws_redshift_wlmqueue_wait_timeWLMQueueWaitTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the wait time for queries in the WLM queue.
aws_redshift_wlmrunning_queriesWLMRunningQueriesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Shows the number of queries currently running in the WLM queue.
aws_redshift_write_iopsWriteIOPSAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures input/output operations per second for writes.
aws_redshift_write_latencyWriteLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks latency for write operations.
aws_redshift_write_throughputWriteThroughputAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures throughput for write operations.

AWS/Route53

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_route53_info
aws_route53_child_health_check_healthy_countChildHealthCheckHealthyCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of healthy child health checks.
aws_route53_connection_timeConnectionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time it takes to establish a connection.
aws_route53_dnsqueriesDNSQueriesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of DNS queries handled by Route 53.
aws_route53_health_check_percentage_healthyHealthCheckPercentageHealthyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Displays the percentage of healthy Route 53 health checks.
aws_route53_health_check_statusHealthCheckStatusAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Indicates the status of health checks, showing whether they are passing or failing.
aws_route53_sslhandshake_timeSSLHandshakeTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time it takes to complete the SSL handshake.
aws_route53_time_to_first_byteTimeToFirstByteAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken to receive the first byte of the response after a request is sent.

AWS/Route53Resolver

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_route53resolver_info
aws_route53resolver_inbound_query_volumeInboundQueryVolumeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the volume of DNS queries received by the Route 53 Resolver inbound endpoint.
aws_route53resolver_outbound_query_aggregated_volumeOutboundQueryAggregatedVolumeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total volume of outbound DNS queries across all outbound endpoints.
aws_route53resolver_outbound_query_volumeOutboundQueryVolumeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the volume of DNS queries sent by the Route 53 Resolver outbound endpoint.

AWS/S3

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_s3_info
aws_s3_number_of_objectsNumberOfObjectsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of objects stored in an S3 bucket.
aws_s3_bucket_size_bytesBucketSizeBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total size of an S3 bucket in bytes.
aws_s3_all_requestsAllRequestsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of all requests made to an S3 bucket.
aws_s3_4xx_errors4xxErrorsSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of 4xx HTTP status code errors encountered during S3 requests.
aws_s3_total_request_latencyTotalRequestLatencyp95, Maximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p99, p99.5, p99.9TotalRequestLatency Measures the total latency for S3 requests.
aws_s3_5xx_errors5xxErrorsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_bytes_downloadedBytesDownloadedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_bytes_pending_replicationBytesPendingReplicationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_bytes_uploadedBytesUploadedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_delete_requestsDeleteRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_first_byte_latencyFirstByteLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_get_requestsGetRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_head_requestsHeadRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_list_requestsListRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_operations_failed_replicationOperationsFailedReplicationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_operations_pending_replicationOperationsPendingReplicationMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_post_requestsPostRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_put_requestsPutRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_replication_latencyReplicationLatencyMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_select_requestsSelectRequestsMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_select_returned_bytesSelectReturnedBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_s3_select_scanned_bytesSelectScannedBytesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/SES

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_ses_bounceBounceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_complaintComplaintAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_deliveryDeliveryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_rejectRejectAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_sendSendAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_clicksClicksMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_opensOpensMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_publish_expiredPublishExpiredMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_publish_failurePublishFailureMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_publish_successPublishSuccessMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_receivedReceivedMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_rendering_failuresRendering FailuresMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_reputation_bounce_rateReputation.BounceRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_ses_reputation_complaint_rateReputation.ComplaintRateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/SNS

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sns_info
aws_sns_number_of_messages_publishedNumberOfMessagesPublishedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of messages published to SNS topics.
aws_sns_number_of_notifications_deliveredNumberOfNotificationsDeliveredAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of successfully delivered notifications.
aws_sns_number_of_notifications_failedNumberOfNotificationsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed notifications.
aws_sns_number_of_notifications_filtered_outNumberOfNotificationsFilteredOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the notifications that were filtered out based on the subscription’s filter policies.
aws_sns_number_of_notifications_filtered_out_invalid_attributesNumberOfNotificationsFilteredOut-InvalidAttributesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_sns_number_of_notifications_filtered_out_message_bodyNumberOfNotificationsFilteredOut-MessageBodyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_sns_number_of_notifications_filtered_out_no_message_attributesNumberOfNotificationsFilteredOut-NoMessageAttributesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_sns_publish_sizePublishSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the size of messages published to SNS topics.
aws_sns_smsmonth_to_date_spent_usdSMSMonthToDateSpentUSDAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the month-to-date costs incurred for sending SMS messages.
aws_sns_smssuccess_rateSMSSuccessRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the success rate of sending SMS messages via SNS.

AWS/SQS

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_sqs_info
aws_sqs_approximate_age_of_oldest_messageApproximateAgeOfOldestMessageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate age of the oldest message in the queue.
aws_sqs_approximate_number_of_messages_delayedApproximateNumberOfMessagesDelayedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the approximate number of messages currently delayed.
aws_sqs_approximate_number_of_messages_not_visibleApproximateNumberOfMessagesNotVisibleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the approximate number of messages that are not visible to consumers due to being in flight.
aws_sqs_approximate_number_of_messages_visibleApproximateNumberOfMessagesVisibleAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the approximate number of messages currently visible to consumers.
aws_sqs_number_of_empty_receivesNumberOfEmptyReceivesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of receive requests that did not return any messages.
aws_sqs_number_of_messages_deletedNumberOfMessagesDeletedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of messages successfully deleted from the queue.
aws_sqs_number_of_messages_receivedNumberOfMessagesReceivedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of messages received from the queue.
aws_sqs_number_of_messages_sentNumberOfMessagesSentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of messages successfully sent to the queue.
aws_sqs_sent_message_sizeSentMessageSizeSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of messages sent to the queue.
aws_sqs_approximate_number_of_groups_with_inflight_messagesApproximateNumberOfGroupsWithInflightMessagesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9
aws_sqs_number_of_deduplicated_sent_messagesNumberOfDeduplicatedSentMessagesMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/SageMaker

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_info
aws_sagemaker_invocation4_xxerrorsInvocation4XXErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of 4XX errors (client-side errors) during model invocations.
aws_sagemaker_invocation5_xxerrorsInvocation5XXErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of 5XX errors (server-side errors) during model invocations.
aws_sagemaker_invocation_model_errorsInvocationModelErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the errors specific to model invocations.
aws_sagemaker_invocationsInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of successful model invocations.
aws_sagemaker_invocations_per_copyInvocationsPerCopyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of invocations per copy of the model.
aws_sagemaker_invocations_per_instanceInvocationsPerInstanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of invocations per instance.
aws_sagemaker_model_cache_hitModelCacheHitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the instances where model cache is hit, reducing load times.
aws_sagemaker_model_downloading_timeModelDownloadingTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken to download the model to the instance.
aws_sagemaker_model_latencyModelLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency of model invocations.
aws_sagemaker_model_loading_timeModelLoadingTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken to load the model on the instance.
aws_sagemaker_model_loading_wait_timeModelLoadingWaitTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the wait time during the model loading process.
aws_sagemaker_model_setup_timeModelSetupTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken to set up the model environment.
aws_sagemaker_model_unloading_timeModelUnloadingTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time taken to unload the model from the instance.
aws_sagemaker_overhead_latencyOverheadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks additional latency incurred due to overheads during the invocation process.

AWS/SageMaker/Endpoints

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_endpoints_info
aws_sagemaker_endpoints_cpureservationCPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of reserved CPU resources for SageMaker endpoints.
aws_sagemaker_endpoints_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual CPU utilization by the SageMaker endpoint.
aws_sagemaker_endpoints_cpuutilization_normalizedCPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures normalized CPU utilization based on instance type and capacity.
aws_sagemaker_endpoints_disk_utilizationDiskUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the disk space utilization for SageMaker endpoints.
aws_sagemaker_endpoints_gpumemory_utilizationGPUMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual GPU memory utilization for endpoints using GPU instances.
aws_sagemaker_endpoints_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures normalized GPU memory utilization.
aws_sagemaker_endpoints_gpureservationGPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of reserved GPU resources for endpoints using GPU instances.
aws_sagemaker_endpoints_gpuutilizationGPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual GPU utilization by the SageMaker endpoint.
aws_sagemaker_endpoints_gpuutilization_normalizedGPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures normalized GPU utilization.
aws_sagemaker_endpoints_loaded_model_countLoadedModelCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of models currently loaded on the SageMaker endpoint.
aws_sagemaker_endpoints_memory_reservationMemoryReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of reserved memory for the SageMaker endpoint.
aws_sagemaker_endpoints_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the actual memory utilization by the SageMaker endpoint.

AWS/SageMaker/InferenceRecommendationsJobs

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_inferencerecommendationsjobs_info
aws_sagemaker_inferencerecommendationsjobs_client_invocation_errorsClientInvocationErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of errors encountered during client invocations for inference recommendations.
aws_sagemaker_inferencerecommendationsjobs_client_invocationsClientInvocationsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of client invocations of the inference recommendations job.
aws_sagemaker_inferencerecommendationsjobs_client_latencyClientLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of client invocations during the inference recommendations job.
aws_sagemaker_inferencerecommendationsjobs_number_of_usersNumberOfUsersAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of users interacting with the inference recommendations job.

AWS/SageMaker/ModelBuildingPipeline

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_modelbuildingpipeline_info
aws_sagemaker_modelbuildingpipeline_execution_durationExecutionDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of pipeline executions.
aws_sagemaker_modelbuildingpipeline_execution_failedExecutionFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed pipeline executions.
aws_sagemaker_modelbuildingpipeline_execution_startedExecutionStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of started pipeline executions.
aws_sagemaker_modelbuildingpipeline_execution_stoppedExecutionStoppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks pipeline executions that were stopped.
aws_sagemaker_modelbuildingpipeline_execution_succeededExecutionSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of successfully completed pipeline executions.
aws_sagemaker_modelbuildingpipeline_step_durationStepDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of individual steps within the pipeline.
aws_sagemaker_modelbuildingpipeline_step_failedStepFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed steps within the pipeline.
aws_sagemaker_modelbuildingpipeline_step_startedStepStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of steps started in the pipeline.
aws_sagemaker_modelbuildingpipeline_step_stoppedStepStoppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the steps that were stopped within the pipeline.
aws_sagemaker_modelbuildingpipeline_step_succeededStepSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of successfully completed steps within the pipeline.

AWS/SageMaker/ProcessingJobs

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_processingjobs_info
aws_sagemaker_processingjobs_cpureservationCPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of CPU resources reserved for processing jobs.
aws_sagemaker_processingjobs_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of CPU resources during processing jobs.
aws_sagemaker_processingjobs_cpuutilization_normalizedCPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized CPU utilization for easier comparison across different instance types.
aws_sagemaker_processingjobs_disk_utilizationDiskUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the disk utilization during the processing jobs.
aws_sagemaker_processingjobs_gpumemory_utilizationGPUMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks GPU memory usage during processing jobs.
aws_sagemaker_processingjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU memory utilization for comparison across different instances.
aws_sagemaker_processingjobs_gpureservationGPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of GPU resources reserved for processing jobs.
aws_sagemaker_processingjobs_gpuutilizationGPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of GPU resources during processing jobs.
aws_sagemaker_processingjobs_gpuutilization_normalizedGPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU utilization for easier cross-instance comparison.
aws_sagemaker_processingjobs_memory_reservationMemoryReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory resources reserved for processing jobs.
aws_sagemaker_processingjobs_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the utilization of memory resources during processing jobs.

AWS/SageMaker/TrainingJobs

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_trainingjobs_info
aws_sagemaker_trainingjobs_cpureservationCPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of CPU resources reserved for training jobs.
aws_sagemaker_trainingjobs_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the CPU utilization during training jobs.
aws_sagemaker_trainingjobs_cpuutilization_normalizedCPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized CPU utilization across different instance types.
aws_sagemaker_trainingjobs_disk_utilizationDiskUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the disk utilization during training jobs.
aws_sagemaker_trainingjobs_gpumemory_utilizationGPUMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks GPU memory utilization during training jobs.
aws_sagemaker_trainingjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU memory utilization for comparison across different instances.
aws_sagemaker_trainingjobs_gpureservationGPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of GPU resources reserved for training jobs.
aws_sagemaker_trainingjobs_gpuutilizationGPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors GPU utilization during training jobs.
aws_sagemaker_trainingjobs_gpuutilization_normalizedGPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU utilization across different instances.
aws_sagemaker_trainingjobs_memory_reservationMemoryReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of memory reserved for training jobs.
aws_sagemaker_trainingjobs_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the memory usage during training jobs.

AWS/SageMaker/TransformJobs

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_sagemaker_transformjobs_info
aws_sagemaker_transformjobs_cpureservationCPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU resources reserved for transform jobs.
aws_sagemaker_transformjobs_cpuutilizationCPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the CPU utilization during transform jobs.
aws_sagemaker_transformjobs_cpuutilization_normalizedCPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized CPU utilization across different instance types during transform jobs.
aws_sagemaker_transformjobs_disk_utilizationDiskUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors disk utilization during transform jobs.
aws_sagemaker_transformjobs_gpumemory_utilizationGPUMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks GPU memory utilization during transform jobs.
aws_sagemaker_transformjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU memory utilization for comparison across different instances during transform jobs.
aws_sagemaker_transformjobs_gpureservationGPUReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the GPU resources reserved for transform jobs.
aws_sagemaker_transformjobs_gpuutilizationGPUUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors GPU utilization during transform jobs.
aws_sagemaker_transformjobs_gpuutilization_normalizedGPUUtilizationNormalizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Provides normalized GPU utilization across different instances during transform jobs.
aws_sagemaker_transformjobs_memory_reservationMemoryReservationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors memory resources reserved for transform jobs.
aws_sagemaker_transformjobs_memory_utilizationMemoryUtilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks memory usage during transform jobs.

AWS/Scheduler

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_scheduler_invocation_attempt_countInvocationAttemptCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of attempts made for invocations.
aws_scheduler_invocation_dropped_countInvocationDroppedCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the count of invocations that were dropped.
aws_scheduler_invocation_throttle_countInvocationThrottleCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of invocations that were throttled due to exceeding limits.
aws_scheduler_invocations_failed_to_be_sent_to_dead_letter_countInvocationsFailedToBeSentToDeadLetterCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of invocations that failed to be sent to the dead letter queue.
aws_scheduler_invocations_sent_to_dead_letter_countInvocationsSentToDeadLetterCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of invocations successfully sent to the dead letter queue.
aws_scheduler_invocations_sent_to_dead_letter_count_truncated_message_size_exceededInvocationsSentToDeadLetterCount_Truncated_MessageSizeExceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of invocations sent to the dead letter queue due to exceeding message size.
aws_scheduler_target_error_countTargetErrorCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the count of errors encountered by the target.
aws_scheduler_target_error_throttled_countTargetErrorThrottledCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Counts the number of target errors caused by throttling.

AWS/States

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_states_info
aws_states_activities_failedActivitiesFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of failed activities.
aws_states_activities_heartbeat_timed_outActivitiesHeartbeatTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors activities whose heartbeat timed out.
aws_states_activities_scheduledActivitiesScheduledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of activities that have been scheduled.
aws_states_activities_startedActivitiesStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of activities that have started.
aws_states_activities_succeededActivitiesSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful activities.
aws_states_activities_timed_outActivitiesTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of activities that timed out.
aws_states_activity_run_timeActivityRunTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the runtime of activities.
aws_states_activity_schedule_timeActivityScheduleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the schedule time for activities.
aws_states_activity_timeActivityTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time taken by an activity.
aws_states_consumed_capacityConsumedCapacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the consumed capacity for Step Functions.
aws_states_execution_throttledExecutionThrottledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors throttled execution attempts.
aws_states_execution_timeExecutionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time taken by an execution.
aws_states_executions_abortedExecutionsAbortedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of executions that were aborted.
aws_states_executions_failedExecutionsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of failed executions.
aws_states_executions_startedExecutionsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of executions that started.
aws_states_executions_succeededExecutionsSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful executions.
aws_states_executions_timed_outExecutionsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors executions that timed out.
aws_states_express_execution_billed_durationExpressExecutionBilledDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the billed duration for Express Workflows.
aws_states_express_execution_billed_memoryExpressExecutionBilledMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the billed memory for Express Workflows.
aws_states_express_execution_memoryExpressExecutionMemoryAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the memory consumed by Express Workflows.
aws_states_lambda_function_run_timeLambdaFunctionRunTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the runtime of Lambda functions.
aws_states_lambda_function_schedule_timeLambdaFunctionScheduleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the schedule time for Lambda functions.
aws_states_lambda_function_timeLambdaFunctionTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total time taken by Lambda functions.
aws_states_lambda_functions_failedLambdaFunctionsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors Lambda functions that failed.
aws_states_lambda_functions_scheduledLambdaFunctionsScheduledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of Lambda functions that were scheduled.
aws_states_lambda_functions_startedLambdaFunctionsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks Lambda functions that have started.
aws_states_lambda_functions_succeededLambdaFunctionsSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures successful Lambda function executions.
aws_states_lambda_functions_timed_outLambdaFunctionsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors Lambda functions that timed out.
aws_states_provisioned_bucket_sizeProvisionedBucketSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the provisioned bucket size for Step Functions.
aws_states_provisioned_refill_rateProvisionedRefillRateAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the rate at which provisioned capacity is refilled.
aws_states_service_integration_run_timeServiceIntegrationRunTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_states_service_integration_schedule_timeServiceIntegrationScheduleTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the schedule time for service integrations.
aws_states_service_integration_timeServiceIntegrationTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total time taken by service integrations.
aws_states_service_integrations_failedServiceIntegrationsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks failed service integrations.
aws_states_service_integrations_scheduledServiceIntegrationsScheduledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of service integrations that were scheduled.
aws_states_service_integrations_startedServiceIntegrationsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks service integrations that have started.
aws_states_service_integrations_succeededServiceIntegrationsSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors successful service integrations.
aws_states_service_integrations_timed_outServiceIntegrationsTimedOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures service integrations that timed out.
aws_states_throttled_eventsThrottledEventsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of events that were throttled.

AWS/StorageGateway

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_storagegateway_info
aws_storagegateway_cache_freeCacheFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free cache space in the gateway.
aws_storagegateway_cache_hit_percentCacheHitPercentAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of read operations served by the cache.
aws_storagegateway_cache_percent_dirtyCachePercentDirtyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of cache space that contains data that hasn’t been uploaded yet.
aws_storagegateway_cache_percent_usedCachePercentUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of used cache space.
aws_storagegateway_cache_usedCacheUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of cache space used.
aws_storagegateway_cloud_bytes_downloadedCloudBytesDownloadedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of data downloaded from AWS to the gateway.
aws_storagegateway_cloud_bytes_uploadedCloudBytesUploadedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of data uploaded from the gateway to AWS.
aws_storagegateway_cloud_download_latencyCloudDownloadLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the latency experienced during downloads from AWS.
aws_storagegateway_queued_writesQueuedWritesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of write operations queued in the gateway.
aws_storagegateway_read_bytesReadBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of data read by the gateway.
aws_storagegateway_read_timeReadTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time spent on read operations.
aws_storagegateway_time_since_last_recovery_pointTimeSinceLastRecoveryPointAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time since the last recovery point was created.
aws_storagegateway_total_cache_sizeTotalCacheSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total size of the cache.
aws_storagegateway_upload_buffer_freeUploadBufferFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of free space in the upload buffer.
aws_storagegateway_upload_buffer_percent_usedUploadBufferPercentUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the percentage of the upload buffer that is used.
aws_storagegateway_upload_buffer_usedUploadBufferUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of upload buffer space used.
aws_storagegateway_working_storage_freeWorkingStorageFreeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of free working storage in the gateway.
aws_storagegateway_working_storage_percent_usedWorkingStoragePercentUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the percentage of working storage used.
aws_storagegateway_working_storage_usedWorkingStorageUsedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of working storage used.
aws_storagegateway_write_bytesWriteBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of working storage used.
aws_storagegateway_write_timeWriteTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time spent on write operations.

AWS/Timestream

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_timestream_data_scanned_bytesDataScannedBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total amount of data scanned by AWS Timestream during queries.
aws_timestream_successful_request_latencySuccessfulRequestLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the latency of successful requests sent to AWS Timestream.
aws_timestream_system_errorsSystemErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of system errors occurring in AWS Timestream.
aws_timestream_user_errorsUserErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of user-generated errors in AWS Timestream, such as invalid queries.

AWS/Transfer

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_transfer_bytes_inBytesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_bytes_outBytesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_files_inFilesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_files_outFilesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_inbound_failed_messageInboundFailedMessageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_inbound_messageInboundMessageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_partial_upload_executions_failedOnPartialUploadExecutionsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_partial_upload_executions_startedOnPartialUploadExecutionsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_partial_upload_executions_successOnPartialUploadExecutionsSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_upload_executions_failedOnUploadExecutionsFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_upload_executions_startedOnUploadExecutionsStartedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_transfer_on_upload_executions_successOnUploadExecutionsSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

AWS/TransitGateway

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_transitgateway_info
aws_transitgateway_bytes_inBytesInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of bytes received by the Transit Gateway.
aws_transitgateway_bytes_outBytesOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of bytes sent from the Transit Gateway.
aws_transitgateway_packet_drop_count_blackholePacketDropCountBlackholeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of packets dropped due to blackholing (unreachable routes).
aws_transitgateway_packet_drop_count_no_routePacketDropCountNoRouteAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of packets dropped due to no matching route found.
aws_transitgateway_packets_inPacketsInAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total number of packets received by the Transit Gateway.
aws_transitgateway_packets_outPacketsOutAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of packets sent from the Transit Gateway.

AWS/TrustedAdvisor

Note this service only produces metrics to specific regions in AWS. Any jobs configured with this service will only gather data from the us-east-1 region(s)

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_trustedadvisor_green_checksGreenChecksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of Trusted Advisor checks in the green (optimal) status.
aws_trustedadvisor_red_checksRedChecksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of Trusted Advisor checks that indicate critical issues (red status).
aws_trustedadvisor_red_resourcesRedResourcesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of resources flagged as critical or failing (red status).
aws_trustedadvisor_service_limit_usageServiceLimitUsageAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the usage of service limits based on Trusted Advisor service limit checks.
aws_trustedadvisor_yellow_checksYellowChecksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of checks that show warnings (yellow status).
aws_trustedadvisor_yellow_resourcesYellowResourcesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of resources flagged as warnings or requiring attention (yellow status).

AWS/Usage

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_usage_call_countCallCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of API or service calls made.
aws_usage_resource_countResourceCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of resources in use or allocated in the AWS environment.

AWS/VPN

  • Scrape Intervals: 5 minutes
  • Out of the box dashboard included
MetricCloudwatch MetricStatisticsDescription
aws_vpn_info
aws_vpn_tunnel_data_inTunnelDataInSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of inbound data being transferred through the VPN tunnel. Helps track network traffic.
aws_vpn_tunnel_data_outTunnelDataOutSum, Maximum, Minimum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the amount of outbound data being transferred through the VPN tunnel. Useful for bandwidth monitoring.
aws_vpn_tunnel_stateTunnelStateMaximum, Minimum, Sum, SampleCount, Average, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the current status of the VPN tunnel (e.g., up or down). Helps in identifying tunnel connectivity issues.

AWS/WAFV2

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_wafv2_info
aws_wafv2_allowed_requestsAllowedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests that are allowed by the WAF rules. Useful for monitoring legitimate traffic.
aws_wafv2_blocked_requestsBlockedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of requests that are blocked by the WAF rules. Helps detect and prevent malicious traffic.
aws_wafv2_captcha_requestsCaptchaRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests that triggered a CAPTCHA challenge. Useful for tracking potential bot traffic.
aws_wafv2_captchas_attemptedCaptchasAttemptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of CAPTCHA challenges that were attempted by users. Indicates user engagement with challenges.
aws_wafv2_captchas_solvedCaptchasSolvedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of CAPTCHA challenges successfully solved. Helps assess CAPTCHA effectiveness.
aws_wafv2_challenge_requestsChallengeRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of requests that triggered additional security challenges. Useful for advanced threat detection.
aws_wafv2_counted_requestsCountedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests counted for rule evaluation but not necessarily blocked or allowed.
aws_wafv2_passed_requestsPassedRequestsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors requests that passed through the challenge phase and were allowed access.
aws_wafv2_requests_with_valid_captcha_tokenRequestsWithValidCaptchaTokenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of requests with a valid CAPTCHA token. Useful for validating CAPTCHA implementation.
aws_wafv2_requests_with_valid_challenge_tokenRequestsWithValidChallengeTokenAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of requests with valid security challenge tokens. Helps track successful security checks.

AWS/WorkSpaces

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_workspaces_info
aws_workspaces_availableAvailableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of available WorkSpaces. Useful for tracking the availability of WorkSpaces for users.
aws_workspaces_connection_attemptConnectionAttemptAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of connection attempts to WorkSpaces. Helps monitor user access and demand.
aws_workspaces_connection_failureConnectionFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of failed connection attempts. Useful for identifying connectivity issues or failures.
aws_workspaces_connection_successConnectionSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful connections to WorkSpaces. Indicates the success rate of user connections.
aws_workspaces_in_session_latencyInSessionLatencyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the latency experienced by users during WorkSpaces sessions. Helps assess user experience quality.
aws_workspaces_maintenanceMaintenanceAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of WorkSpaces under maintenance. Useful for understanding maintenance impact on availability.
aws_workspaces_session_disconnectSessionDisconnectAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of session disconnections. Helps detect connectivity issues or user-initiated disconnects.
aws_workspaces_session_launch_timeSessionLaunchTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time taken to launch a WorkSpaces session. Useful for assessing the performance of WorkSpaces launches.
aws_workspaces_stoppedStoppedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of WorkSpaces that are in the stopped state. Helps track WorkSpaces that are not running.
aws_workspaces_unhealthyUnhealthyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of unhealthy WorkSpaces. Useful for identifying potential issues with WorkSpaces health.
aws_workspaces_user_connectedUserConnectedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of users currently connected to WorkSpaces. Helps measure active user engagement.

AmazonMWAA

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_amazonmwaa_info
aws_amazonmwaa_collect_dbdagsCollectDBDagsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_amazonmwaa_critical_section_busyCriticalSectionBusyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time spent when critical sections of code are busy.
aws_amazonmwaa_critical_section_durationCriticalSectionDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the duration for which critical sections remain busy.
aws_amazonmwaa_critical_section_query_durationCriticalSectionQueryDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the time spent querying within critical sections.
aws_amazonmwaa_dagdependency_checkDAGDependencyCheckAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors dependency checks between DAGs.
aws_amazonmwaa_dagduration_failedDAGDurationFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of failed DAG runs.
aws_amazonmwaa_dagduration_successDAGDurationSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of successful DAG runs.
aws_amazonmwaa_dagfile_processing_last_durationDAGFileProcessingLastDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the last processing time for DAG files.
aws_amazonmwaa_dagfile_processing_last_run_seconds_agoDAGFileProcessingLastRunSecondsAgoAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the time since the last DAG file processing run.
aws_amazonmwaa_dagfile_refresh_errorDAGFileRefreshErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors in refreshing DAG files.
aws_amazonmwaa_dagschedule_delayDAGScheduleDelayAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors delays in DAG scheduling.
aws_amazonmwaa_dag_bag_sizeDagBagSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the DAG bag.
aws_amazonmwaa_dag_callback_exceptionsDagCallbackExceptionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors exceptions occurring in DAG callbacks.
aws_amazonmwaa_exception_failuresExceptionFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of exception failures.
aws_amazonmwaa_executed_tasksExecutedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of executed tasks.
aws_amazonmwaa_failed_celery_task_executionFailedCeleryTaskExecutionAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors failed task executions in Celery.
aws_amazonmwaa_failed_slacallbackFailedSLACallbackAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks failures in SLA callbacks.
aws_amazonmwaa_failed_slaemail_attemptsFailedSLAEmailAttemptsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors failed attempts to send SLA emails.
aws_amazonmwaa_file_path_queue_update_countFilePathQueueUpdateCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of file path queue updates.
aws_amazonmwaa_first_task_scheduling_delayFirstTaskSchedulingDelayAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the delay in scheduling the first task.
aws_amazonmwaa_import_errorsImportErrorsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors errors encountered during imports.
aws_amazonmwaa_infra_failuresInfraFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks infrastructure failures in the environment.
aws_amazonmwaa_job_endJobEndAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of jobs completed.
aws_amazonmwaa_job_heartbeat_failureJobHeartbeatFailureAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks heartbeat failures for jobs.
aws_amazonmwaa_job_startJobStartAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of jobs started.
aws_amazonmwaa_loaded_tasksLoadedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tasks loaded in the environment.
aws_amazonmwaa_manager_stallsManagerStallsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of times the manager process stalls.
aws_amazonmwaa_open_slotsOpenSlotsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of open task slots.
aws_amazonmwaa_operator_failuresOperatorFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of operator task failures.
aws_amazonmwaa_operator_successesOperatorSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of operator task successes.
aws_amazonmwaa_orphanedOrphanedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors orphaned task instances.
aws_amazonmwaa_orphaned_tasks_adoptedOrphanedTasksAdoptedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of orphaned tasks adopted.
aws_amazonmwaa_orphaned_tasks_clearedOrphanedTasksClearedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of orphaned tasks cleared.
aws_amazonmwaa_other_callback_countOtherCallbackCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of other callbacks occurring in the environment.
aws_amazonmwaa_poked_exceptionsPokedExceptionsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of exceptions in poked tasks.
aws_amazonmwaa_poked_successPokedSuccessAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks successful pokes in tasks.
aws_amazonmwaa_poked_tasksPokedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of poked tasks.
aws_amazonmwaa_pool_deferred_slotsPoolDeferredSlotsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks deferred slots in task pools.
aws_amazonmwaa_pool_failuresPoolFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of task pool failures.
aws_amazonmwaa_pool_open_slotsPoolOpenSlotsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of open slots in the task pool.
aws_amazonmwaa_pool_queued_slotsPoolQueuedSlotsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of queued slots in the task pool.
aws_amazonmwaa_pool_running_slotsPoolRunningSlotsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of running slots in the task pool.
aws_amazonmwaa_pool_starving_tasksPoolStarvingTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks tasks that are starving for resources in the task pool.
aws_amazonmwaa_processesProcessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of processes running in the environment.
aws_amazonmwaa_processor_timeoutsProcessorTimeoutsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors timeouts in processors.
aws_amazonmwaa_queued_tasksQueuedTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tasks in the queue.
aws_amazonmwaa_running_tasksRunningTasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of running tasks in the environment.
aws_amazonmwaa_slamissedSLAMissedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of SLA misses in tasks.
aws_amazonmwaa_scheduler_heartbeatSchedulerHeartbeatAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the health of the scheduler through its heartbeat.
aws_amazonmwaa_scheduler_loop_durationSchedulerLoopDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the duration of scheduler loops.
aws_amazonmwaa_sla_callback_countSlaCallbackCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of SLA callbacks made.
aws_amazonmwaa_started_task_instancesStartedTaskInstancesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of started task instances.
aws_amazonmwaa_task_instance_created_using_operatorTaskInstanceCreatedUsingOperatorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of task instances created using an operator.
aws_amazonmwaa_task_instance_durationTaskInstanceDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the duration of task instances.
aws_amazonmwaa_task_instance_failuresTaskInstanceFailuresAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of task instance failures.
aws_amazonmwaa_task_instance_finishedTaskInstanceFinishedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of task instances that have finished.
aws_amazonmwaa_task_instance_previously_succeededTaskInstancePreviouslySucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of task instances that have previously succeeded.
aws_amazonmwaa_task_instance_queued_durationTaskInstanceQueuedDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the time task instances spend in the queue before execution.
aws_amazonmwaa_task_instance_scheduled_durationTaskInstanceScheduledDurationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the duration of time task instances were scheduled.
aws_amazonmwaa_task_instance_successesTaskInstanceSuccessesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of successful task instances.
aws_amazonmwaa_task_removed_from_dagTaskRemovedFromDAGAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors tasks that were removed from the DAG.
aws_amazonmwaa_task_restored_to_dagTaskRestoredToDAGAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks tasks that were restored to the DAG.
aws_amazonmwaa_task_timeout_errorTaskTimeoutErrorAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors timeout errors in tasks.
aws_amazonmwaa_tasks_executableTasksExecutableAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of executable tasks.
aws_amazonmwaa_tasks_killed_externallyTasksKilledExternallyAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks tasks that were killed externally.
aws_amazonmwaa_tasks_pendingTasksPendingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors pending tasks.
aws_amazonmwaa_tasks_runningTasksRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tasks currently running.
aws_amazonmwaa_tasks_starvingTasksStarvingAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of tasks starving for resources.
aws_amazonmwaa_tasks_without_dag_runTasksWithoutDagRunAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks tasks that are not associated with any DAG run.
aws_amazonmwaa_total_parse_timeTotalParseTimeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total time spent parsing DAG files.
aws_amazonmwaa_trigger_heartbeatTriggerHeartbeatAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the heartbeat of task triggers.
aws_amazonmwaa_triggered_dag_runsTriggeredDagRunsAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of DAG runs triggered.
aws_amazonmwaa_triggers_blocked_main_threadTriggersBlockedMainThreadAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9
aws_amazonmwaa_triggers_failedTriggersFailedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors failed task triggers.
aws_amazonmwaa_triggers_runningTriggersRunningAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of running task triggers.
aws_amazonmwaa_triggers_succeededTriggersSucceededAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors successful task triggers.
aws_amazonmwaa_updatesUpdatesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of updates made to DAGs and other configurations.
aws_amazonmwaa_zombies_killedZombiesKilledAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9

ECS/ContainerInsights

  • Scrape Intervals: 5 minutes
MetricCloudwatch MetricStatisticsDescription
aws_ecs_containerinsights_info
aws_ecs_containerinsights_container_instance_countContainerInstanceCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of container instances in a cluster.
aws_ecs_containerinsights_cpu_reservedCpuReservedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of CPU reserved for tasks.
aws_ecs_containerinsights_cpu_utilizedCpuUtilizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the CPU utilization of running tasks.
aws_ecs_containerinsights_deployment_countDeploymentCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of service deployments.
aws_ecs_containerinsights_desired_task_countDesiredTaskCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the desired number of running tasks in a service.
aws_ecs_containerinsights_ebsfilesystem_sizeEBSFilesystemSizeAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the size of the EBS filesystem attached to the ECS instance.
aws_ecs_containerinsights_ebsfilesystem_utilizedEBSFilesystemUtilizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the utilized space in the EBS filesystem.
aws_ecs_containerinsights_ephemeral_storage_reservedEphemeralStorageReservedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the amount of reserved ephemeral storage for tasks.
aws_ecs_containerinsights_ephemeral_storage_utilizedEphemeralStorageUtilizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the ephemeral storage utilized by tasks.
aws_ecs_containerinsights_memory_reservedMemoryReservedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the amount of memory reserved for tasks in ECS.
aws_ecs_containerinsights_memory_utilizedMemoryUtilizedAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory utilized by tasks.
aws_ecs_containerinsights_network_rx_bytesNetworkRxBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes received by the network interfaces on the instance.
aws_ecs_containerinsights_network_tx_bytesNetworkTxBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of bytes transmitted from the network interfaces on the instance.
aws_ecs_containerinsights_pending_task_countPendingTaskCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of tasks that are in the pending state in the service.
aws_ecs_containerinsights_running_task_countRunningTaskCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of running tasks in the service.
aws_ecs_containerinsights_service_countServiceCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the number of services running in the cluster.
aws_ecs_containerinsights_storage_read_bytesStorageReadBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the number of bytes read from the storage attached to the ECS instance.
aws_ecs_containerinsights_storage_write_bytesStorageWriteBytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of bytes written to storage.
aws_ecs_containerinsights_task_countTaskCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of tasks running in the ECS cluster.
aws_ecs_containerinsights_task_set_countTaskSetCountAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the number of task sets in a service.
aws_ecs_containerinsights_instance_cpu_limitinstance_cpu_limitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total CPU limit configured for the instance.
aws_ecs_containerinsights_instance_cpu_reserved_capacityinstance_cpu_reserved_capacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the reserved CPU capacity on the instance.
aws_ecs_containerinsights_instance_cpu_usage_totalinstance_cpu_usage_totalAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total CPU usage across all tasks on the instance.
aws_ecs_containerinsights_instance_cpu_utilizationinstance_cpu_utilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of CPU utilization on the ECS instance.
aws_ecs_containerinsights_instance_filesystem_utilizationinstance_filesystem_utilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the utilization of the filesystem attached to the ECS instance.
aws_ecs_containerinsights_instance_memory_limitinstance_memory_limitAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the total memory limit configured for the instance.
aws_ecs_containerinsights_instance_memory_reserved_capacityinstance_memory_reserved_capacityAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the reserved memory capacity on the instance.
aws_ecs_containerinsights_instance_memory_utilizationinstance_memory_utilizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the percentage of memory utilization on the ECS instance.
aws_ecs_containerinsights_instance_memory_working_setinstance_memory_working_setAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the working set memory on the instance, which is the amount of memory actively used.
aws_ecs_containerinsights_instance_network_total_bytesinstance_network_total_bytesAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Tracks the total number of bytes transferred (both received and transmitted) by the network interfaces.
aws_ecs_containerinsights_instance_number_of_running_tasksinstance_number_of_running_tasksAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Monitors the total number of running tasks on the instance.
aws_ecs_containerinsights_instance_memory_utliizationinstance_memory_utliizationAverage, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99, p99.5, p99.9Measures the memory utilization of the instance.