Menu
Grafana Cloud

Services

CloudWatch metrics supports the following services, and allows you to pick from a wide array of available metrics and statistics. Metrics in bold text are included in the default configuration. The statistics for all metrics are Average, Maximum, Minimum, Sum, SampleCount, p50, p75, p90, p95, p99.

AWS/ACMPrivateCA

Function: Provides a private certificate authority for managing SSL/TLS certificates

Scrape interval: 5 minutes

MetricCloudwatch metric
aws_acmprivateca_info
aws_acmprivateca_crlgeneratedCRLGenerated
aws_acmprivateca_failureFailure
aws_acmprivateca_misconfigured_crlbucketMisconfiguredCRLBucket
aws_acmprivateca_successSuccess
aws_acmprivateca_timeTime

AWS/AmazonMQ

Function: Managed message broker service for Apache ActiveMQ and RabbitMQ

Scrape interval: 5 minutes

MetricCloudwatch metric
aws_amazonmq_info
aws_amazonmq_ack_rateAckRate
aws_amazonmq_burst_balanceBurstBalance
aws_amazonmq_channel_countChannelCount
aws_amazonmq_confirm_rateConfirmRate
aws_amazonmq_connection_countConnectionCount
aws_amazonmq_consumer_countConsumerCount
aws_amazonmq_cpu_credit_balanceCpuCreditBalance
aws_amazonmq_cpu_utilizationCpuUtilization
aws_amazonmq_current_connections_countCurrentConnectionsCount
aws_amazonmq_dequeue_countDequeueCount
aws_amazonmq_dispatch_countDispatchCount
aws_amazonmq_enqueue_countEnqueueCount
aws_amazonmq_enqueue_timeEnqueueTime
aws_amazonmq_established_connections_countEstablishedConnectionsCount
aws_amazonmq_exchange_countExchangeCount
aws_amazonmq_expired_countExpiredCount
aws_amazonmq_heap_usageHeapUsage
aws_amazonmq_in_flight_countInFlightCount
aws_amazonmq_inactive_durable_topic_subscribers_countInactiveDurableTopicSubscribersCount
aws_amazonmq_job_scheduler_store_percent_usageJobSchedulerStorePercentUsage
aws_amazonmq_journal_files_for_fast_recoveryJournalFilesForFastRecovery
aws_amazonmq_journal_files_for_full_recoveryJournalFilesForFullRecovery
aws_amazonmq_memory_usageMemoryUsage
aws_amazonmq_message_countMessageCount
aws_amazonmq_message_ready_countMessageReadyCount
aws_amazonmq_message_unacknowledged_countMessageUnacknowledgedCount
aws_amazonmq_network_inNetworkIn
aws_amazonmq_network_outNetworkOut
aws_amazonmq_open_transaction_countOpenTransactionCount
aws_amazonmq_producer_countProducerCount
aws_amazonmq_publish_ratePublishRate
aws_amazonmq_queue_countQueueCount
aws_amazonmq_queue_sizeQueueSize
aws_amazonmq_rabbit_mqdisk_freeRabbitMQDiskFree
aws_amazonmq_rabbit_mqdisk_free_limitRabbitMQDiskFreeLimit
aws_amazonmq_rabbit_mqfd_usedRabbitMQFdUsed
aws_amazonmq_rabbit_mqmem_limitRabbitMQMemLimit
aws_amazonmq_rabbit_mqmem_usedRabbitMQMemUsed
aws_amazonmq_receive_countReceiveCount
aws_amazonmq_store_percent_usageStorePercentUsage
aws_amazonmq_system_cpu_utilizationSystemCpuUtilization
aws_amazonmq_temp_percent_usageTempPercentUsage
aws_amazonmq_total_consumer_countTotalConsumerCount
aws_amazonmq_total_dequeue_countTotalDequeueCount
aws_amazonmq_total_enqueue_countTotalEnqueueCount
aws_amazonmq_total_message_countTotalMessageCount
aws_amazonmq_total_producer_countTotalProducerCount
aws_amazonmq_volume_read_opsVolumeReadOps
aws_amazonmq_volume_write_opsVolumeWriteOps

AWS/ApiGateway

Function: Enables developers to create and manage APIs for accessing data and services

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_apigateway_info
aws_apigateway_4xx4xx
aws_apigateway_5xx5xx
aws_apigateway_countCount
aws_apigateway_integration_latencyIntegrationLatency
aws_apigateway_latencyLatency
aws_apigateway_4_xxerror4XXError
aws_apigateway_5_xxerror5XXError
aws_apigateway_cache_hit_countCacheHitCount
aws_apigateway_cache_miss_countCacheMissCount
aws_apigateway_client_errorClientError
aws_apigateway_connect_countConnectCount
aws_apigateway_data_processedDataProcessed
aws_apigateway_execution_errorExecutionError
aws_apigateway_integration_errorIntegrationError
aws_apigateway_message_countMessageCount

AWS/AppStream

Function: Delivers cloud-based desktops and applications to end-users on any device

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_appstream_info
aws_appstream_actual_capacityActualCapacity
aws_appstream_available_capacityAvailableCapacity
aws_appstream_capacity_utilizationCapacityUtilization
aws_appstream_desired_capacityDesiredCapacity
aws_appstream_in_use_capacityInUseCapacity
aws_appstream_insufficient_capacity_errorInsufficientCapacityError
aws_appstream_pending_capacityPendingCapacity
aws_appstream_running_capacityRunningCapacity

AWS/AppSync

Function: Managed service for building GraphQL APIs that connects to data sources like DynamoDB

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_appsync_info
aws_appsync_4_xxerror4XXError
aws_appsync_5_xxerror5XXError
aws_appsync_active_connectionsActiveConnections
aws_appsync_active_subscriptionsActiveSubscriptions
aws_appsync_connect_client_errorConnectClientError
aws_appsync_connect_server_errorConnectServerError
aws_appsync_connect_successConnectSuccess
aws_appsync_connection_durationConnectionDuration
aws_appsync_disconnect_client_errorDisconnectClientError
aws_appsync_disconnect_server_errorDisconnectServerError
aws_appsync_disconnect_successDisconnectSuccess
aws_appsync_latencyLatency
aws_appsync_publish_data_message_client_errorPublishDataMessageClientError
aws_appsync_publish_data_message_server_errorPublishDataMessageServerError
aws_appsync_publish_data_message_sizePublishDataMessageSize
aws_appsync_publish_data_message_successPublishDataMessageSuccess
aws_appsync_requestsRequests
aws_appsync_subscribe_client_errorSubscribeClientError
aws_appsync_subscribe_server_errorSubscribeServerError
aws_appsync_subscribe_successSubscribeSuccess
aws_appsync_tokens_consumedTokensConsumed
aws_appsync_unsubscribe_client_errorUnsubscribeClientError
aws_appsync_unsubscribe_server_errorUnsubscribeServerError
aws_appsync_unsubscribe_successUnsubscribeSuccess

AWS/ApplicationELB

Function: Distributes incoming traffic to targets like EC2 instances, containers, and IP addresses

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_applicationelb_info
aws_applicationelb_active_connection_countActiveConnectionCount
aws_applicationelb_client_tlsnegotiation_error_countClientTLSNegotiationErrorCount
aws_applicationelb_consumed_lcusConsumedLCUs
aws_applicationelb_elbauth_errorELBAuthError
aws_applicationelb_elbauth_failureELBAuthFailure
aws_applicationelb_elbauth_latencyELBAuthLatency
aws_applicationelb_elbauth_refresh_token_successELBAuthRefreshTokenSuccess
aws_applicationelb_elbauth_successELBAuthSuccess
aws_applicationelb_elbauth_user_claims_size_exceededELBAuthUserClaimsSizeExceeded
aws_applicationelb_httpcode_elb_3_xx_countHTTPCode_ELB_3XX_Count
aws_applicationelb_httpcode_elb_4_xx_countHTTPCode_ELB_4XX_Count
aws_applicationelb_httpcode_elb_5_xx_countHTTPCode_ELB_5XX_Count
aws_applicationelb_httpcode_target_2_xx_countHTTPCode_Target_2XX_Count
aws_applicationelb_httpcode_target_3_xx_countHTTPCode_Target_3XX_Count
aws_applicationelb_httpcode_target_4_xx_countHTTPCode_Target_4XX_Count
aws_applicationelb_httpcode_target_5_xx_countHTTPCode_Target_5XX_Count
aws_applicationelb_ipv6_processed_bytesIPv6ProcessedBytes
aws_applicationelb_ipv6_request_countIPv6RequestCount
aws_applicationelb_new_connection_countNewConnectionCount
aws_applicationelb_processed_bytesProcessedBytes
aws_applicationelb_rejected_connection_countRejectedConnectionCount
aws_applicationelb_request_countRequestCount
aws_applicationelb_rule_evaluationsRuleEvaluations
aws_applicationelb_target_connection_error_countTargetConnectionErrorCount
aws_applicationelb_target_response_timeTargetResponseTime
aws_applicationelb_target_tlsnegotiation_error_countTargetTLSNegotiationErrorCount
aws_applicationelb_anomalous_host_countAnomalousHostCount
aws_applicationelb_desync_mitigation_mode_non_compliant_request_countDesyncMitigationMode_NonCompliant_Request_Count
aws_applicationelb_dropped_invalid_header_request_countDroppedInvalidHeaderRequestCount
aws_applicationelb_forwarded_invalid_header_request_countForwardedInvalidHeaderRequestCount
aws_applicationelb_grpc_request_countGrpcRequestCount
aws_applicationelb_httpcode_elb_500_countHTTPCode_ELB_500_Count
aws_applicationelb_httpcode_elb_502_countHTTPCode_ELB_502_Count
aws_applicationelb_httpcode_elb_503_countHTTPCode_ELB_503_Count
aws_applicationelb_httpcode_elb_504_countHTTPCode_ELB_504_Count
aws_applicationelb_http_fixed_response_countHTTP_Fixed_Response_Count
aws_applicationelb_http_redirect_countHTTP_Redirect_Count
aws_applicationelb_http_redirect_url_limit_exceeded_countHTTP_Redirect_Url_Limit_Exceeded_Count
aws_applicationelb_healthy_host_countHealthyHostCount
aws_applicationelb_healthy_state_dnsHealthyStateDNS
aws_applicationelb_healthy_state_routingHealthyStateRouting
aws_applicationelb_lambda_internal_errorLambdaInternalError
aws_applicationelb_lambda_target_processed_bytesLambdaTargetProcessedBytes
aws_applicationelb_lambda_user_errorLambdaUserError
aws_applicationelb_mitigated_host_countMitigatedHostCount
aws_applicationelb_non_sticky_request_countNonStickyRequestCount
aws_applicationelb_request_count_per_targetRequestCountPerTarget
aws_applicationelb_standard_processed_bytesStandardProcessedBytes
aws_applicationelb_un_healthy_host_countUnHealthyHostCount
aws_applicationelb_unhealthy_routing_request_countUnhealthyRoutingRequestCount
aws_applicationelb_unhealthy_state_dnsUnhealthyStateDNS
aws_applicationelb_unhealthy_state_routingUnhealthyStateRouting

AWS/Athena

Function: Interactive query service to analyze data in S3 using SQL

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_athena_info
aws_athena_engine_execution_timeEngineExecutionTime
aws_athena_processed_bytesProcessedBytes
aws_athena_query_planning_timeQueryPlanningTime
aws_athena_query_queue_timeQueryQueueTime
aws_athena_service_processing_timeServiceProcessingTime
aws_athena_total_execution_timeTotalExecutionTime

AWS/AutoScaling

Function: Automatically adjusts capacity to maintain performance and cost efficiency

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_autoscaling_info
aws_autoscaling_group_and_warm_pool_desired_capacityGroupAndWarmPoolDesiredCapacity
aws_autoscaling_group_and_warm_pool_total_capacityGroupAndWarmPoolTotalCapacity
aws_autoscaling_group_desired_capacityGroupDesiredCapacity
aws_autoscaling_group_in_service_capacityGroupInServiceCapacity
aws_autoscaling_group_in_service_instancesGroupInServiceInstances
aws_autoscaling_group_max_sizeGroupMaxSize
aws_autoscaling_group_min_sizeGroupMinSize
aws_autoscaling_group_pending_capacityGroupPendingCapacity
aws_autoscaling_group_pending_instancesGroupPendingInstances
aws_autoscaling_group_standby_capacityGroupStandbyCapacity
aws_autoscaling_group_standby_instancesGroupStandbyInstances
aws_autoscaling_group_terminating_capacityGroupTerminatingCapacity
aws_autoscaling_group_terminating_instancesGroupTerminatingInstances
aws_autoscaling_group_total_capacityGroupTotalCapacity
aws_autoscaling_group_total_instancesGroupTotalInstances
aws_autoscaling_predictive_scaling_capacity_forecastPredictiveScalingCapacityForecast
aws_autoscaling_predictive_scaling_load_forecastPredictiveScalingLoadForecast
aws_autoscaling_predictive_scaling_metric_pair_correlationPredictiveScalingMetricPairCorrelation
aws_autoscaling_warm_pool_desired_capacityWarmPoolDesiredCapacity
aws_autoscaling_warm_pool_min_sizeWarmPoolMinSize
aws_autoscaling_warm_pool_pending_capacityWarmPoolPendingCapacity
aws_autoscaling_warm_pool_terminating_capacityWarmPoolTerminatingCapacity
aws_autoscaling_warm_pool_total_capacityWarmPoolTotalCapacity
aws_autoscaling_warm_pool_warmed_capacityWarmPoolWarmedCapacity

AWS/Backup

Function: Centralized backup service to automate and manage backups across AWS services

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_backup_info
aws_backup_number_of_backup_jobs_abortedNumberOfBackupJobsAborted
aws_backup_number_of_backup_jobs_completedNumberOfBackupJobsCompleted
aws_backup_number_of_backup_jobs_createdNumberOfBackupJobsCreated
aws_backup_number_of_backup_jobs_expiredNumberOfBackupJobsExpired
aws_backup_number_of_backup_jobs_failedNumberOfBackupJobsFailed
aws_backup_number_of_backup_jobs_pendingNumberOfBackupJobsPending
aws_backup_number_of_backup_jobs_runningNumberOfBackupJobsRunning
aws_backup_number_of_copy_jobs_completedNumberOfCopyJobsCompleted
aws_backup_number_of_copy_jobs_createdNumberOfCopyJobsCreated
aws_backup_number_of_copy_jobs_failedNumberOfCopyJobsFailed
aws_backup_number_of_copy_jobs_runningNumberOfCopyJobsRunning
aws_backup_number_of_recovery_points_coldNumberOfRecoveryPointsCold
aws_backup_number_of_recovery_points_completedNumberOfRecoveryPointsCompleted
aws_backup_number_of_recovery_points_deletingNumberOfRecoveryPointsDeleting
aws_backup_number_of_recovery_points_expiredNumberOfRecoveryPointsExpired
aws_backup_number_of_recovery_points_partialNumberOfRecoveryPointsPartial
aws_backup_number_of_restore_jobs_completedNumberOfRestoreJobsCompleted
aws_backup_number_of_restore_jobs_failedNumberOfRestoreJobsFailed
aws_backup_number_of_restore_jobs_pendingNumberOfRestoreJobsPending
aws_backup_number_of_restore_jobs_runningNumberOfRestoreJobsRunning

AWS/Billing

Function: Provides detailed usage and cost data for AWS services. This service only produces metrics to specific regions in AWS. Any jobs configured with this service will only gather data from the us-east-1 regions.

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_billing_estimated_chargesEstimatedCharges

AWS/Cassandra

Function: Managed Apache Cassandra-compatible database service

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_cassandra_info
aws_cassandra_account_max_readsAccountMaxReads
aws_cassandra_account_max_table_level_readsAccountMaxTableLevelReads
aws_cassandra_account_max_table_level_writesAccountMaxTableLevelWrites
aws_cassandra_account_max_writesAccountMaxWrites
aws_cassandra_account_provisioned_read_capacity_utilizationAccountProvisionedReadCapacityUtilization
aws_cassandra_account_provisioned_write_capacity_utilizationAccountProvisionedWriteCapacityUtilization
aws_cassandra_conditional_check_failed_requestsConditionalCheckFailedRequests
aws_cassandra_consumed_read_capacity_unitsConsumedReadCapacityUnits
aws_cassandra_consumed_write_capacity_unitsConsumedWriteCapacityUnits
aws_cassandra_max_provisioned_table_read_capacity_utilizationMaxProvisionedTableReadCapacityUtilization
aws_cassandra_max_provisioned_table_write_capacity_utilizationMaxProvisionedTableWriteCapacityUtilization
aws_cassandra_returned_item_countReturnedItemCount
aws_cassandra_returned_item_count_by_selectReturnedItemCountBySelect
aws_cassandra_successful_request_countSuccessfulRequestCount
aws_cassandra_successful_request_latencySuccessfulRequestLatency
aws_cassandra_system_errorsSystemErrors
aws_cassandra_user_errorsUserErrors

AWS/CertificateManager

Function: Manages the provisioning, renewal, and deployment of SSL/TLS certificates

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_certificatemanager_info
aws_certificatemanager_days_to_expiryDaysToExpiry

AWS/CloudFront

Function: Content delivery network to deliver data, videos, applications globally

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_cloudfront_info
aws_cloudfront_4xx_error_rate4xxErrorRate
aws_cloudfront_5xx_error_rate5xxErrorRate
aws_cloudfront_bytes_downloadedBytesDownloaded
aws_cloudfront_bytes_uploadedBytesUploaded
aws_cloudfront_requestsRequests
aws_cloudfront_total_error_rateTotalErrorRate
aws_cloudfront_401_error_rate401ErrorRate
aws_cloudfront_403_error_rate403ErrorRate
aws_cloudfront_404_error_rate404ErrorRate
aws_cloudfront_502_error_rate502ErrorRate
aws_cloudfront_503_error_rate503ErrorRate
aws_cloudfront_504_error_rate504ErrorRate
aws_cloudfront_cache_hit_rateCacheHitRate
aws_cloudfront_function_compute_utilizationFunctionComputeUtilization
aws_cloudfront_function_execution_errorsFunctionExecutionErrors
aws_cloudfront_function_invocationsFunctionInvocations
aws_cloudfront_function_throttlesFunctionThrottles
aws_cloudfront_function_validation_errorsFunctionValidationErrors
aws_cloudfront_lambda_execution_errorLambdaExecutionError
aws_cloudfront_lambda_limit_exceeded_errorsLambdaLimitExceededErrors
aws_cloudfront_lambda_validation_errorLambdaValidationError
aws_cloudfront_origin_latencyOriginLatency

AWS/Cognito

Function: Provides authentication, authorization, and user management for web and mobile apps

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_cognito_info
aws_cognito_account_take_over_riskAccountTakeOverRisk
aws_cognito_compromised_credentials_riskCompromisedCredentialsRisk
aws_cognito_federation_successesFederationSuccesses
aws_cognito_federation_throttlesFederationThrottles
aws_cognito_no_riskNoRisk
aws_cognito_override_blockOverrideBlock
aws_cognito_riskRisk
aws_cognito_sign_in_successesSignInSuccesses
aws_cognito_sign_in_throttlesSignInThrottles
aws_cognito_sign_up_successesSignUpSuccesses
aws_cognito_sign_up_throttlesSignUpThrottles
aws_cognito_token_refresh_successesTokenRefreshSuccesses
aws_cognito_token_refresh_throttlesTokenRefreshThrottles

AWS/DDoSProtection

Function: Protects against distributed denial of service attacks with AWS Shield

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_ddosprotection_info
aws_ddosprotection_ddo_sattack_bits_per_secondDDoSAttackBitsPerSecond
aws_ddosprotection_ddo_sattack_packets_per_secondDDoSAttackPacketsPerSecond
aws_ddosprotection_ddo_sattack_requests_per_secondDDoSAttackRequestsPerSecond
aws_ddosprotection_ddo_sdetectedDDoSDetected
aws_ddosprotection_volume_bits_per_secondVolumeBitsPerSecond
aws_ddosprotection_volume_packets_per_secondVolumePacketsPerSecond

AWS/DMS

Function: Migrates databases to AWS with minimal downtime

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_dms_info
aws_dms_cdcchanges_disk_sourceCDCChangesDiskSource
aws_dms_cdcchanges_disk_targetCDCChangesDiskTarget
aws_dms_cdcchanges_memory_sourceCDCChangesMemorySource
aws_dms_cdcchanges_memory_targetCDCChangesMemoryTarget
aws_dms_cdcincoming_changesCDCIncomingChanges
aws_dms_cdclatency_sourceCDCLatencySource
aws_dms_cdclatency_targetCDCLatencyTarget
aws_dms_cdcthroughput_bandwidth_sourceCDCThroughputBandwidthSource
aws_dms_cdcthroughput_bandwidth_targetCDCThroughputBandwidthTarget
aws_dms_cdcthroughput_rows_sourceCDCThroughputRowsSource
aws_dms_cdcthroughput_rows_targetCDCThroughputRowsTarget
aws_dms_cpuutilizationCPUUtilization
aws_dms_free_storage_spaceFreeStorageSpace
aws_dms_freeable_memoryFreeableMemory
aws_dms_full_load_throughput_bandwidth_sourceFullLoadThroughputBandwidthSource
aws_dms_full_load_throughput_bandwidth_targetFullLoadThroughputBandwidthTarget
aws_dms_full_load_throughput_rows_sourceFullLoadThroughputRowsSource
aws_dms_full_load_throughput_rows_targetFullLoadThroughputRowsTarget
aws_dms_network_receive_throughputNetworkReceiveThroughput
aws_dms_network_transmit_throughputNetworkTransmitThroughput
aws_dms_read_iopsReadIOPS
aws_dms_read_latencyReadLatency
aws_dms_read_throughputReadThroughput
aws_dms_swap_usageSwapUsage
aws_dms_write_iopsWriteIOPS
aws_dms_write_latencyWriteLatency
aws_dms_write_throughputWriteThroughput

AWS/DX

Function: AWS Direct Connect provides a dedicated network connection to AWS.

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_dx_info
aws_dx_connection_bps_egressConnectionBpsEgress
aws_dx_connection_bps_ingressConnectionBpsIngress
aws_dx_connection_crcerror_countConnectionCRCErrorCount
aws_dx_connection_encryption_stateConnectionEncryptionState
aws_dx_connection_error_countConnectionErrorCount
aws_dx_connection_light_level_rxConnectionLightLevelRx
aws_dx_connection_light_level_txConnectionLightLevelTx
aws_dx_connection_pps_egressConnectionPpsEgress
aws_dx_connection_pps_ingressConnectionPpsIngress
aws_dx_connection_stateConnectionState
aws_dx_virtual_interface_bps_egressVirtualInterfaceBpsEgress
aws_dx_virtual_interface_bps_ingressVirtualInterfaceBpsIngress
aws_dx_virtual_interface_pps_egressVirtualInterfacePpsEgress
aws_dx_virtual_interface_pps_ingressVirtualInterfacePpsIngress

AWS/DocDB

Function: Managed document database service that supports MongoDB workloads

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_docdb_info
aws_docdb_backup_retention_period_storage_usedBackupRetentionPeriodStorageUsed
aws_docdb_buffer_cache_hit_ratioBufferCacheHitRatio
aws_docdb_cpuutilizationCPUUtilization
aws_docdb_change_stream_log_sizeChangeStreamLogSize
aws_docdb_dbcluster_replica_lag_maximumDBClusterReplicaLagMaximum
aws_docdb_dbcluster_replica_lag_minimumDBClusterReplicaLagMinimum
aws_docdb_dbinstance_replica_lagDBInstanceReplicaLag
aws_docdb_database_connectionsDatabaseConnections
aws_docdb_database_connections_maxDatabaseConnectionsMax
aws_docdb_database_cursorsDatabaseCursors
aws_docdb_database_cursors_maxDatabaseCursorsMax
aws_docdb_database_cursors_timed_outDatabaseCursorsTimedOut
aws_docdb_disk_queue_depthDiskQueueDepth
aws_docdb_documents_deletedDocumentsDeleted
aws_docdb_documents_insertedDocumentsInserted
aws_docdb_documents_returnedDocumentsReturned
aws_docdb_documents_updatedDocumentsUpdated
aws_docdb_engine_uptimeEngineUptime
aws_docdb_free_local_storageFreeLocalStorage
aws_docdb_freeable_memoryFreeableMemory
aws_docdb_network_receive_throughputNetworkReceiveThroughput
aws_docdb_network_throughputNetworkThroughput
aws_docdb_network_transmit_throughputNetworkTransmitThroughput
aws_docdb_opcounters_commandOpcountersCommand
aws_docdb_opcounters_deleteOpcountersDelete
aws_docdb_opcounters_getmoreOpcountersGetmore
aws_docdb_opcounters_insertOpcountersInsert
aws_docdb_opcounters_queryOpcountersQuery
aws_docdb_opcounters_updateOpcountersUpdate
aws_docdb_read_iopsReadIOPS
aws_docdb_read_latencyReadLatency
aws_docdb_read_throughputReadThroughput
aws_docdb_snapshot_storage_usedSnapshotStorageUsed
aws_docdb_swap_usageSwapUsage
aws_docdb_total_backup_storage_billedTotalBackupStorageBilled
aws_docdb_volume_bytes_usedVolumeBytesUsed
aws_docdb_volume_read_iopsVolumeReadIOPs
aws_docdb_volume_write_iopsVolumeWriteIOPs
aws_docdb_write_iopsWriteIOPS
aws_docdb_write_latencyWriteLatency
aws_docdb_write_throughputWriteThroughput

AWS/DynamoDB

Function: Fully managed NoSQL database service for low-latency applications at scale

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_dynamodb_info
aws_dynamodb_account_max_readsAccountMaxReads
aws_dynamodb_account_max_table_level_readsAccountMaxTableLevelReads
aws_dynamodb_account_max_table_level_writesAccountMaxTableLevelWrites
aws_dynamodb_account_max_writesAccountMaxWrites
aws_dynamodb_account_provisioned_read_capacity_utilizationAccountProvisionedReadCapacityUtilization
aws_dynamodb_account_provisioned_write_capacity_utilizationAccountProvisionedWriteCapacityUtilization
aws_dynamodb_age_of_oldest_unreplicated_recordAgeOfOldestUnreplicatedRecord
aws_dynamodb_conditional_check_failed_requestsConditionalCheckFailedRequests
aws_dynamodb_consumed_change_data_capture_unitsConsumedChangeDataCaptureUnits
aws_dynamodb_consumed_read_capacity_unitsConsumedReadCapacityUnits
aws_dynamodb_consumed_write_capacity_unitsConsumedWriteCapacityUnits
aws_dynamodb_failed_to_replicate_record_countFailedToReplicateRecordCount
aws_dynamodb_max_provisioned_table_read_capacity_utilizationMaxProvisionedTableReadCapacityUtilization
aws_dynamodb_max_provisioned_table_write_capacity_utilizationMaxProvisionedTableWriteCapacityUtilization
aws_dynamodb_on_demand_max_read_request_unitsOnDemandMaxReadRequestUnits
aws_dynamodb_on_demand_max_write_request_unitsOnDemandMaxWriteRequestUnits
aws_dynamodb_online_index_consumed_write_capacityOnlineIndexConsumedWriteCapacity
aws_dynamodb_online_index_percentage_progressOnlineIndexPercentageProgress
aws_dynamodb_online_index_throttle_eventsOnlineIndexThrottleEvents
aws_dynamodb_pending_replication_countPendingReplicationCount
aws_dynamodb_provisioned_read_capacity_unitsProvisionedReadCapacityUnits
aws_dynamodb_provisioned_write_capacity_unitsProvisionedWriteCapacityUnits
aws_dynamodb_read_throttle_eventsReadThrottleEvents
aws_dynamodb_replication_latencyReplicationLatency
aws_dynamodb_returned_bytesReturnedBytes
aws_dynamodb_returned_item_countReturnedItemCount
aws_dynamodb_returned_records_countReturnedRecordsCount
aws_dynamodb_successful_request_latencySuccessfulRequestLatency
aws_dynamodb_system_errorsSystemErrors
aws_dynamodb_throttled_put_record_countThrottledPutRecordCount
aws_dynamodb_throttled_requestsThrottledRequests
aws_dynamodb_time_to_live_deleted_item_countTimeToLiveDeletedItemCount
aws_dynamodb_transaction_conflictTransactionConflict
aws_dynamodb_user_errorsUserErrors
aws_dynamodb_write_throttle_eventsWriteThrottleEvents

AWS/EBS

Function: Block storage for use with EC2 instances

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_ebs_info
aws_ebs_volume_read_bytesVolumeReadBytes
aws_ebs_volume_write_bytesVolumeWriteBytes
aws_ebs_volume_read_opsVolumeReadOps
aws_ebs_volume_write_opsVolumeWriteOps
aws_ebs_volume_total_read_timeVolumeTotalReadTime
aws_ebs_volume_total_write_timeVolumeTotalWriteTime
aws_ebs_volume_idle_timeVolumeIdleTime
aws_ebs_volume_queue_lengthVolumeQueueLength
aws_ebs_volume_throughput_percentageVolumeThroughputPercentage
aws_ebs_volume_consumed_read_write_opsVolumeConsumedReadWriteOps
aws_ebs_burst_balanceBurstBalance
aws_ebs_enable_copied_image_deprecation_completedEnableCopiedImageDeprecationCompleted
aws_ebs_enable_copied_image_deprecation_failedEnableCopiedImageDeprecationFailed
aws_ebs_enable_image_deprecation_completedEnableImageDeprecationCompleted
aws_ebs_enable_image_deprecation_failedEnableImageDeprecationFailed
aws_ebs_images_copied_region_completedImagesCopiedRegionCompleted
aws_ebs_images_copied_region_deregister_completedImagesCopiedRegionDeregisterCompleted
aws_ebs_images_copied_region_deregistered_failedImagesCopiedRegionDeregisteredFailed
aws_ebs_images_copied_region_failedImagesCopiedRegionFailed
aws_ebs_images_copied_region_startedImagesCopiedRegionStarted
aws_ebs_images_create_completedImagesCreateCompleted
aws_ebs_images_create_failedImagesCreateFailed
aws_ebs_images_create_startedImagesCreateStarted
aws_ebs_images_deregister_completedImagesDeregisterCompleted
aws_ebs_images_deregister_failedImagesDeregisterFailed
aws_ebs_resources_targetedResourcesTargeted
aws_ebs_snapshots_copied_account_completedSnapshotsCopiedAccountCompleted
aws_ebs_snapshots_copied_account_delete_completedSnapshotsCopiedAccountDeleteCompleted
aws_ebs_snapshots_copied_account_delete_failedSnapshotsCopiedAccountDeleteFailed
aws_ebs_snapshots_copied_account_failedSnapshotsCopiedAccountFailed
aws_ebs_snapshots_copied_account_startedSnapshotsCopiedAccountStarted
aws_ebs_snapshots_copied_region_completedSnapshotsCopiedRegionCompleted
aws_ebs_snapshots_copied_region_delete_completedSnapshotsCopiedRegionDeleteCompleted
aws_ebs_snapshots_copied_region_delete_failedSnapshotsCopiedRegionDeleteFailed
aws_ebs_snapshots_copied_region_failedSnapshotsCopiedRegionFailed
aws_ebs_snapshots_copied_region_startedSnapshotsCopiedRegionStarted
aws_ebs_snapshots_create_completedSnapshotsCreateCompleted
aws_ebs_snapshots_create_failedSnapshotsCreateFailed
aws_ebs_snapshots_create_startedSnapshotsCreateStarted
aws_ebs_snapshots_delete_completedSnapshotsDeleteCompleted
aws_ebs_snapshots_delete_failedSnapshotsDeleteFailed
aws_ebs_snapshots_shared_completedSnapshotsSharedCompleted

AWS/EC2

Function: Virtual servers in the cloud for running applications

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_ec2_info
aws_ec2_cpuutilizationCPUUtilization
aws_ec2_network_inNetworkIn
aws_ec2_network_outNetworkOut
aws_ec2_network_packets_inNetworkPacketsIn
aws_ec2_network_packets_outNetworkPacketsOut
aws_ec2_disk_read_bytesDiskReadBytes
aws_ec2_disk_write_bytesDiskWriteBytes
aws_ec2_disk_read_opsDiskReadOps
aws_ec2_disk_write_opsDiskWriteOps
aws_ec2_status_check_failedStatusCheckFailed
aws_ec2_status_check_failed_instanceStatusCheckFailed_Instance
aws_ec2_status_check_failed_systemStatusCheckFailed_System
aws_ec2_ebsiobalance_percentEBSIOBalance%
aws_ec2_ebsbyte_balance_percentEBSByteBalance%
aws_ec2_ebsread_opsEBSReadOps
aws_ec2_ebswrite_opsEBSWriteOps
aws_ec2_ebsread_bytesEBSReadBytes
aws_ec2_ebswrite_bytesEBSWriteBytes
aws_ec2_cpucredit_balanceCPUCreditBalance
aws_ec2_cpucredit_usageCPUCreditUsage
aws_ec2_cpusurplus_credit_balanceCPUSurplusCreditBalance
aws_ec2_cpusurplus_credits_chargedCPUSurplusCreditsCharged
aws_ec2_dedicated_host_cpuutilizationDedicatedHostCPUUtilization
aws_ec2_metadata_no_tokenMetadataNoToken
aws_ec2_status_check_failed_attached_ebsStatusCheckFailed_AttachedEBS

AWS/EC2Spot

Function: Uses spare EC2 capacity at reduced prices for workloads with flexible start times

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_ec2spot_info
aws_ec2spot_available_instance_pools_countAvailableInstancePoolsCount
aws_ec2spot_bids_submitted_for_capacityBidsSubmittedForCapacity
aws_ec2spot_eligible_instance_pool_countEligibleInstancePoolCount
aws_ec2spot_fulfilled_capacityFulfilledCapacity
aws_ec2spot_max_percent_capacity_allocationMaxPercentCapacityAllocation
aws_ec2spot_pending_capacityPendingCapacity
aws_ec2spot_percent_capacity_allocationPercentCapacityAllocation
aws_ec2spot_target_capacityTargetCapacity
aws_ec2spot_terminating_capacityTerminatingCapacity

AWS/ECR

Function: Managed container image registry for storing Docker images

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_ecr_repository_pull_countRepositoryPullCount

AWS/ECS

Function: Fully managed container orchestration service for running Docker containers

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_ecs_info
aws_ecs_cpureservationCPUReservation
aws_ecs_cpuutilizationCPUUtilization
aws_ecs_gpureservationGPUReservation
aws_ecs_memory_reservationMemoryReservation
aws_ecs_memory_utilizationMemoryUtilization

AWS/EFS

Function: Scalable and fully managed file storage for use with EC2 instances

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_efs_info
aws_efs_burst_credit_balanceBurstCreditBalance
aws_efs_client_connectionsClientConnections
aws_efs_data_read_iobytesDataReadIOBytes
aws_efs_data_write_iobytesDataWriteIOBytes
aws_efs_metadata_iobytesMetadataIOBytes
aws_efs_metered_iobytesMeteredIOBytes
aws_efs_percent_iolimitPercentIOLimit
aws_efs_permitted_throughputPermittedThroughput
aws_efs_storage_bytesStorageBytes
aws_efs_total_iobytesTotalIOBytes

AWS/ELB

Function: Distributes traffic across multiple targets like EC2 instances and containers

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_elb_info
aws_elb_backend_connection_errorsBackendConnectionErrors
aws_elb_healthy_host_countHealthyHostCount
aws_elb_httpcode_backend_2_xxHTTPCode_Backend_2XX
aws_elb_httpcode_backend_3_xxHTTPCode_Backend_3XX
aws_elb_httpcode_backend_4_xxHTTPCode_Backend_4XX
aws_elb_httpcode_backend_5_xxHTTPCode_Backend_5XX
aws_elb_httpcode_elb_4_xxHTTPCode_ELB_4XX
aws_elb_httpcode_elb_5_xxHTTPCode_ELB_5XX
aws_elb_latencyLatency
aws_elb_request_countRequestCount
aws_elb_spillover_countSpilloverCount
aws_elb_surge_queue_lengthSurgeQueueLength
aws_elb_un_healthy_host_countUnHealthyHostCount
aws_elb_estimated_albactive_connection_countEstimatedALBActiveConnectionCount
aws_elb_estimated_albconsumed_lcusEstimatedALBConsumedLCUs
aws_elb_estimated_albnew_connection_countEstimatedALBNewConnectionCount
aws_elb_estimated_processed_bytesEstimatedProcessedBytes

AWS/ES

Function: Managed Elasticsearch service for real-time search and analytics

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_es_info
aws_es_2xx2xx
aws_es_3xx3xx
aws_es_4xx4xx
aws_es_5xx5xx
aws_es_adanomaly_detectors_index_status_redADAnomalyDetectorsIndexStatus.red
aws_es_adanomaly_detectors_index_status_index_existsADAnomalyDetectorsIndexStatusIndexExists
aws_es_adanomaly_results_index_status_redADAnomalyResultsIndexStatus.red
aws_es_adanomaly_results_index_status_index_existsADAnomalyResultsIndexStatusIndexExists
aws_es_adexecute_failure_countADExecuteFailureCount
aws_es_adexecute_request_countADExecuteRequestCount
aws_es_adhcexecute_failure_countADHCExecuteFailureCount
aws_es_adhcexecute_request_countADHCExecuteRequestCount
aws_es_admodels_checkpoint_index_status_redADModelsCheckpointIndexStatus.red
aws_es_admodels_checkpoint_index_status_index_existsADModelsCheckpointIndexStatusIndexExists
aws_es_adplugin_unhealthyADPluginUnhealthy
aws_es_alerting_degradedAlertingDegraded
aws_es_alerting_index_existsAlertingIndexExists
aws_es_alerting_index_status_greenAlertingIndexStatus.green
aws_es_alerting_index_status_redAlertingIndexStatus.red
aws_es_alerting_index_status_yellowAlertingIndexStatus.yellow
aws_es_alerting_nodes_not_on_scheduleAlertingNodesNotOnSchedule
aws_es_alerting_nodes_on_scheduleAlertingNodesOnSchedule
aws_es_alerting_scheduled_job_enabledAlertingScheduledJobEnabled
aws_es_asynchronous_search_cancelledAsynchronousSearchCancelled
aws_es_asynchronous_search_completion_rateAsynchronousSearchCompletionRate
aws_es_asynchronous_search_failure_rateAsynchronousSearchFailureRate
aws_es_asynchronous_search_initialized_rateAsynchronousSearchInitializedRate
aws_es_asynchronous_search_max_running_timeAsynchronousSearchMaxRunningTime
aws_es_asynchronous_search_persist_failed_rateAsynchronousSearchPersistFailedRate
aws_es_asynchronous_search_persist_rateAsynchronousSearchPersistRate
aws_es_asynchronous_search_rejectedAsynchronousSearchRejected
aws_es_asynchronous_search_running_currentAsynchronousSearchRunningCurrent
aws_es_asynchronous_search_store_healthAsynchronousSearchStoreHealth
aws_es_asynchronous_search_store_sizeAsynchronousSearchStoreSize
aws_es_asynchronous_search_stored_response_countAsynchronousSearchStoredResponseCount
aws_es_asynchronous_search_submission_rateAsynchronousSearchSubmissionRate
aws_es_auto_follow_leader_call_failureAutoFollowLeaderCallFailure
aws_es_auto_follow_num_failed_start_replicationAutoFollowNumFailedStartReplication
aws_es_auto_follow_num_success_start_replicationAutoFollowNumSuccessStartReplication
aws_es_auto_tune_changes_history_heap_sizeAutoTuneChangesHistoryHeapSize
aws_es_auto_tune_changes_history_jvmyoung_gen_argsAutoTuneChangesHistoryJVMYoungGenArgs
aws_es_auto_tune_failedAutoTuneFailed
aws_es_auto_tune_succeededAutoTuneSucceeded
aws_es_auto_tune_valueAutoTuneValue
aws_es_automated_snapshot_failureAutomatedSnapshotFailure
aws_es_avg_point_in_time_alive_timeAvgPointInTimeAliveTime
aws_es_burst_balanceBurstBalance
aws_es_cpucredit_balanceCPUCreditBalance
aws_es_cpuutilizationCPUUtilization
aws_es_cluster_index_writes_blockedClusterIndexWritesBlocked
aws_es_cluster_status_greenClusterStatus.green
aws_es_cluster_status_redClusterStatus.red
aws_es_cluster_status_yellowClusterStatus.yellow
aws_es_cluster_used_spaceClusterUsedSpace
aws_es_cold_storage_space_utilizationColdStorageSpaceUtilization
aws_es_cold_to_warm_migration_failure_countColdToWarmMigrationFailureCount
aws_es_cold_to_warm_migration_latencyColdToWarmMigrationLatency
aws_es_cold_to_warm_migration_queue_sizeColdToWarmMigrationQueueSize
aws_es_cold_to_warm_migration_success_countColdToWarmMigrationSuccessCount
aws_es_coordinating_write_rejectedCoordinatingWriteRejected
aws_es_cross_cluster_inbound_replication_requestsCrossClusterInboundReplicationRequests
aws_es_cross_cluster_inbound_requestsCrossClusterInboundRequests
aws_es_cross_cluster_outbound_connectionsCrossClusterOutboundConnections
aws_es_cross_cluster_outbound_replication_requestsCrossClusterOutboundReplicationRequests
aws_es_cross_cluster_outbound_requestsCrossClusterOutboundRequests
aws_es_current_point_in_timeCurrentPointInTime
aws_es_data_nodesDataNodes
aws_es_data_nodes_shards_activeDataNodesShards.active
aws_es_data_nodes_shards_initializingDataNodesShards.initializing
aws_es_data_nodes_shards_relocatingDataNodesShards.relocating
aws_es_data_nodes_shards_unassignedDataNodesShards.unassigned
aws_es_deleted_documentsDeletedDocuments
aws_es_disk_queue_depthDiskQueueDepth
aws_es_reporting_failed_request_sys_err_countESReportingFailedRequestSysErrCount
aws_es_reporting_failed_request_user_err_countESReportingFailedRequestUserErrCount
aws_es_reporting_request_countESReportingRequestCount
aws_es_reporting_success_countESReportingSuccessCount
aws_es_elasticsearch_requestsElasticsearchRequests
aws_es_follower_check_pointFollowerCheckPoint
aws_es_free_storage_spaceFreeStorageSpace
aws_es_has_active_point_in_timeHasActivePointInTime
aws_es_has_used_point_in_timeHasUsedPointInTime
aws_es_hot_storage_space_utilizationHotStorageSpaceUtilization
aws_es_hot_to_warm_migration_failure_countHotToWarmMigrationFailureCount
aws_es_hot_to_warm_migration_force_merge_latencyHotToWarmMigrationForceMergeLatency
aws_es_hot_to_warm_migration_processing_latencyHotToWarmMigrationProcessingLatency
aws_es_hot_to_warm_migration_queue_sizeHotToWarmMigrationQueueSize
aws_es_hot_to_warm_migration_snapshot_latencyHotToWarmMigrationSnapshotLatency
aws_es_hot_to_warm_migration_success_countHotToWarmMigrationSuccessCount
aws_es_hot_to_warm_migration_success_latencyHotToWarmMigrationSuccessLatency
aws_es_indexing_latencyIndexingLatency
aws_es_indexing_rateIndexingRate
aws_es_invalid_host_header_requestsInvalidHostHeaderRequests
aws_es_iops_throttleIopsThrottle
aws_es_jvmgcold_collection_countJVMGCOldCollectionCount
aws_es_jvmgcold_collection_timeJVMGCOldCollectionTime
aws_es_jvmgcyoung_collection_countJVMGCYoungCollectionCount
aws_es_jvmgcyoung_collection_timeJVMGCYoungCollectionTime
aws_es_jvmmemory_pressureJVMMemoryPressure
aws_es_kmskey_errorKMSKeyError
aws_es_kmskey_inaccessibleKMSKeyInaccessible
aws_es_knncache_capacity_reachedKNNCacheCapacityReached
aws_es_knncircuit_breaker_triggeredKNNCircuitBreakerTriggered
aws_es_knneviction_countKNNEvictionCount
aws_es_knngraph_index_errorsKNNGraphIndexErrors
aws_es_knngraph_index_requestsKNNGraphIndexRequests
aws_es_knngraph_memory_usageKNNGraphMemoryUsage
aws_es_knngraph_query_errorsKNNGraphQueryErrors
aws_es_knngraph_query_requestsKNNGraphQueryRequests
aws_es_knnhit_countKNNHitCount
aws_es_knnload_exception_countKNNLoadExceptionCount
aws_es_knnload_success_countKNNLoadSuccessCount
aws_es_knnmiss_countKNNMissCount
aws_es_knnquery_requestsKNNQueryRequests
aws_es_knnscript_compilation_errorsKNNScriptCompilationErrors
aws_es_knnscript_compilationsKNNScriptCompilations
aws_es_knnscript_query_errorsKNNScriptQueryErrors
aws_es_knnscript_query_requestsKNNScriptQueryRequests
aws_es_knntotal_load_timeKNNTotalLoadTime
aws_es_kibana_concurrent_connectionsKibanaConcurrentConnections
aws_es_kibana_healthy_nodesKibanaHealthyNodes
aws_es_kibana_heap_totalKibanaHeapTotal
aws_es_kibana_heap_usedKibanaHeapUsed
aws_es_kibana_heap_utilizationKibanaHeapUtilization
aws_es_kibana_os1_minute_loadKibanaOS1MinuteLoad
aws_es_kibana_reporting_failed_request_sys_err_countKibanaReportingFailedRequestSysErrCount
aws_es_kibana_reporting_failed_request_user_err_countKibanaReportingFailedRequestUserErrCount
aws_es_kibana_reporting_request_countKibanaReportingRequestCount
aws_es_kibana_reporting_success_countKibanaReportingSuccessCount
aws_es_kibana_request_totalKibanaRequestTotal
aws_es_kibana_response_times_max_in_millisKibanaResponseTimesMaxInMillis
aws_es_ltrfeature_memory_usage_in_bytesLTRFeatureMemoryUsageInBytes
aws_es_ltrfeatureset_memory_usage_in_bytesLTRFeaturesetMemoryUsageInBytes
aws_es_ltrmemory_usageLTRMemoryUsage
aws_es_ltrmodel_memory_usage_in_bytesLTRModelMemoryUsageInBytes
aws_es_ltrrequest_error_countLTRRequestErrorCount
aws_es_ltrrequest_total_countLTRRequestTotalCount
aws_es_ltrstatus_redLTRStatus.red
aws_es_leader_check_pointLeaderCheckPoint
aws_es_master_cpucredit_balanceMasterCPUCreditBalance
aws_es_master_cpuutilizationMasterCPUUtilization
aws_es_master_free_storage_spaceMasterFreeStorageSpace
aws_es_master_jvmmemory_pressureMasterJVMMemoryPressure
aws_es_master_old_gen_jvmmemory_pressureMasterOldGenJVMMemoryPressure
aws_es_master_reachable_from_nodeMasterReachableFromNode
aws_es_master_sys_memory_utilizationMasterSysMemoryUtilization
aws_es_max_provisioned_throughputMaxProvisionedThroughput
aws_es_nodesNodes
aws_es_old_gen_jvmmemory_pressureOldGenJVMMemoryPressure
aws_es_open_search_dashboards_concurrent_connectionsOpenSearchDashboardsConcurrentConnections
aws_es_open_search_dashboards_healthy_nodeOpenSearchDashboardsHealthyNode
aws_es_open_search_dashboards_healthy_nodesOpenSearchDashboardsHealthyNodes
aws_es_open_search_dashboards_heap_totalOpenSearchDashboardsHeapTotal
aws_es_open_search_dashboards_heap_usedOpenSearchDashboardsHeapUsed
aws_es_open_search_dashboards_heap_utilizationOpenSearchDashboardsHeapUtilization
aws_es_open_search_dashboards_os1_minute_loadOpenSearchDashboardsOS1MinuteLoad
aws_es_open_search_dashboards_request_totalOpenSearchDashboardsRequestTotal
aws_es_open_search_dashboards_response_times_max_in_millisOpenSearchDashboardsResponseTimesMaxInMillis
aws_es_open_search_requestsOpenSearchRequests
aws_es_opensearch_dashboards_reporting_failed_request_sys_err_countOpensearchDashboardsReportingFailedRequestSysErrCount
aws_es_opensearch_dashboards_reporting_failed_request_user_err_countOpensearchDashboardsReportingFailedRequestUserErrCount
aws_es_opensearch_dashboards_reporting_request_countOpensearchDashboardsReportingRequestCount
aws_es_opensearch_dashboards_reporting_success_countOpensearchDashboardsReportingSuccessCount
aws_es_pplfailed_request_count_by_cus_errPPLFailedRequestCountByCusErr
aws_es_pplfailed_request_count_by_sys_errPPLFailedRequestCountBySysErr
aws_es_pplrequest_countPPLRequestCount
aws_es_primary_write_rejectedPrimaryWriteRejected
aws_es_read_iopsReadIOPS
aws_es_read_iopsmicro_burstingReadIOPSMicroBursting
aws_es_read_latencyReadLatency
aws_es_read_throughputReadThroughput
aws_es_read_throughput_micro_burstingReadThroughputMicroBursting
aws_es_remote_storage_used_spaceRemoteStorageUsedSpace
aws_es_remote_storage_write_rejectedRemoteStorageWriteRejected
aws_es_replica_write_rejectedReplicaWriteRejected
aws_es_replication_num_bootstrapping_indicesReplicationNumBootstrappingIndices
aws_es_replication_num_failed_indicesReplicationNumFailedIndices
aws_es_replication_num_paused_indicesReplicationNumPausedIndices
aws_es_replication_num_syncing_indicesReplicationNumSyncingIndices
aws_es_replication_rateReplicationRate
aws_es_sqldefault_cursor_request_countSQLDefaultCursorRequestCount
aws_es_sqlfailed_request_count_by_cus_errSQLFailedRequestCountByCusErr
aws_es_sqlfailed_request_count_by_sys_errSQLFailedRequestCountBySysErr
aws_es_sqlrequest_countSQLRequestCount
aws_es_sqlunhealthySQLUnhealthy
aws_es_search_latencySearchLatency
aws_es_search_rateSearchRate
aws_es_search_shard_task_cancelledSearchShardTaskCancelled
aws_es_search_task_cancelledSearchTaskCancelled
aws_es_searchable_documentsSearchableDocuments
aws_es_segment_countSegmentCount
aws_es_shards_activeShards.active
aws_es_shards_active_primaryShards.activePrimary
aws_es_shards_delayed_unassignedShards.delayedUnassigned
aws_es_shards_initializingShards.initializing
aws_es_shards_relocatingShards.relocating
aws_es_shards_unassignedShards.unassigned
aws_es_sys_memory_utilizationSysMemoryUtilization
aws_es_threadpool_bulk_queueThreadpoolBulkQueue
aws_es_threadpool_bulk_rejectedThreadpoolBulkRejected
aws_es_threadpool_bulk_threadsThreadpoolBulkThreads
aws_es_threadpool_force_merge_queueThreadpoolForce_mergeQueue
aws_es_threadpool_force_merge_rejectedThreadpoolForce_mergeRejected
aws_es_threadpool_force_merge_threadsThreadpoolForce_mergeThreads
aws_es_threadpool_index_queueThreadpoolIndexQueue
aws_es_threadpool_index_rejectedThreadpoolIndexRejected
aws_es_threadpool_index_threadsThreadpoolIndexThreads
aws_es_threadpool_search_queueThreadpoolSearchQueue
aws_es_threadpool_search_rejectedThreadpoolSearchRejected
aws_es_threadpool_search_threadsThreadpoolSearchThreads
aws_es_threadpool_write_queueThreadpoolWriteQueue
aws_es_threadpool_write_rejectedThreadpoolWriteRejected
aws_es_threadpool_write_threadsThreadpoolWriteThreads
aws_es_threadpoolsql_worker_queueThreadpoolsql-workerQueue
aws_es_threadpoolsql_worker_rejectedThreadpoolsql-workerRejected
aws_es_threadpoolsql_worker_threadsThreadpoolsql-workerThreads
aws_es_throughput_throttleThroughputThrottle
aws_es_total_point_in_timeTotalPointInTime
aws_es_warm_cpuutilizationWarmCPUUtilization
aws_es_warm_free_storage_spaceWarmFreeStorageSpace
aws_es_warm_jvmgcold_collection_countWarmJVMGCOldCollectionCount
aws_es_warm_jvmgcyoung_collection_countWarmJVMGCYoungCollectionCount
aws_es_warm_jvmgcyoung_collection_timeWarmJVMGCYoungCollectionTime
aws_es_warm_jvmmemory_pressureWarmJVMMemoryPressure
aws_es_warm_old_gen_jvmmemory_pressureWarmOldGenJVMMemoryPressure
aws_es_warm_search_latencyWarmSearchLatency
aws_es_warm_search_rateWarmSearchRate
aws_es_warm_searchable_documentsWarmSearchableDocuments
aws_es_warm_storage_space_utilizationWarmStorageSpaceUtilization
aws_es_warm_sys_memory_utilizationWarmSysMemoryUtilization
aws_es_warm_threadpool_search_queueWarmThreadpoolSearchQueue
aws_es_warm_threadpool_search_rejectedWarmThreadpoolSearchRejected
aws_es_warm_threadpool_search_threadsWarmThreadpoolSearchThreads
aws_es_warm_to_cold_migration_failure_countWarmToColdMigrationFailureCount
aws_es_warm_to_cold_migration_latencyWarmToColdMigrationLatency
aws_es_warm_to_cold_migration_queue_sizeWarmToColdMigrationQueueSize
aws_es_warm_to_cold_migration_success_countWarmToColdMigrationSuccessCount
aws_es_warm_to_hot_migration_queue_sizeWarmToHotMigrationQueueSize
aws_es_write_iopsWriteIOPS
aws_es_write_iopsmicro_burstingWriteIOPSMicroBursting
aws_es_write_latencyWriteLatency
aws_es_write_throughputWriteThroughput
aws_es_write_throughput_micro_burstingWriteThroughputMicroBursting

AWS/ElastiCache

Function: Managed Redis and Memcached for real-time caching

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_elasticache_info
aws_elasticache_active_defrag_hitsActiveDefragHits
aws_elasticache_authentication_failuresAuthenticationFailures
aws_elasticache_bytes_read_from_diskBytesReadFromDisk
aws_elasticache_bytes_read_into_memcachedBytesReadIntoMemcached
aws_elasticache_bytes_used_for_cacheBytesUsedForCache
aws_elasticache_bytes_used_for_cache_itemsBytesUsedForCacheItems
aws_elasticache_bytes_used_for_hashBytesUsedForHash
aws_elasticache_bytes_used_for_memory_dbBytesUsedForMemoryDB
aws_elasticache_bytes_written_out_from_memcachedBytesWrittenOutFromMemcached
aws_elasticache_bytes_written_to_diskBytesWrittenToDisk
aws_elasticache_cpucredit_balanceCPUCreditBalance
aws_elasticache_cpucredit_usageCPUCreditUsage
aws_elasticache_cpuutilizationCPUUtilization
aws_elasticache_cache_hit_rateCacheHitRate
aws_elasticache_cache_hitsCacheHits
aws_elasticache_cache_missesCacheMisses
aws_elasticache_cas_badvalCasBadval
aws_elasticache_cas_hitsCasHits
aws_elasticache_cas_missesCasMisses
aws_elasticache_channel_authorization_failuresChannelAuthorizationFailures
aws_elasticache_cluster_based_cmdsClusterBasedCmds
aws_elasticache_cluster_based_cmds_latencyClusterBasedCmdsLatency
aws_elasticache_cmd_config_getCmdConfigGet
aws_elasticache_cmd_config_setCmdConfigSet
aws_elasticache_cmd_flushCmdFlush
aws_elasticache_cmd_getCmdGet
aws_elasticache_cmd_setCmdSet
aws_elasticache_cmd_touchCmdTouch
aws_elasticache_command_authorization_failuresCommandAuthorizationFailures
aws_elasticache_curr_configCurrConfig
aws_elasticache_curr_connectionsCurrConnections
aws_elasticache_curr_itemsCurrItems
aws_elasticache_curr_volatile_itemsCurrVolatileItems
aws_elasticache_db0_average_ttlDB0AverageTTL
aws_elasticache_database_capacity_usage_counted_for_evict_percentageDatabaseCapacityUsageCountedForEvictPercentage
aws_elasticache_database_capacity_usage_percentageDatabaseCapacityUsagePercentage
aws_elasticache_database_memory_usage_counted_for_evict_percentageDatabaseMemoryUsageCountedForEvictPercentage
aws_elasticache_database_memory_usage_percentageDatabaseMemoryUsagePercentage
aws_elasticache_decr_hitsDecrHits
aws_elasticache_decr_missesDecrMisses
aws_elasticache_delete_hitsDeleteHits
aws_elasticache_delete_missesDeleteMisses
aws_elasticache_engine_cpuutilizationEngineCPUUtilization
aws_elasticache_eval_based_cmdsEvalBasedCmds
aws_elasticache_eval_based_cmds_latencyEvalBasedCmdsLatency
aws_elasticache_evicted_unfetchedEvictedUnfetched
aws_elasticache_evictionsEvictions
aws_elasticache_expired_unfetchedExpiredUnfetched
aws_elasticache_freeable_memoryFreeableMemory
aws_elasticache_geo_spatial_based_cmdsGeoSpatialBasedCmds
aws_elasticache_geo_spatial_based_cmds_latencyGeoSpatialBasedCmdsLatency
aws_elasticache_get_hitsGetHits
aws_elasticache_get_missesGetMisses
aws_elasticache_get_type_cmdsGetTypeCmds
aws_elasticache_get_type_cmds_latencyGetTypeCmdsLatency
aws_elasticache_global_datastore_replication_lagGlobalDatastoreReplicationLag
aws_elasticache_hash_based_cmdsHashBasedCmds
aws_elasticache_hash_based_cmds_latencyHashBasedCmdsLatency
aws_elasticache_hyper_log_log_based_cmdsHyperLogLogBasedCmds
aws_elasticache_hyper_log_log_based_cmds_latencyHyperLogLogBasedCmdsLatency
aws_elasticache_iam_authentication_expirationsIamAuthenticationExpirations
aws_elasticache_iam_authentication_throttlingIamAuthenticationThrottling
aws_elasticache_incr_hitsIncrHits
aws_elasticache_incr_missesIncrMisses
aws_elasticache_is_masterIsMaster
aws_elasticache_is_primaryIsPrimary
aws_elasticache_json_based_cmdsJsonBasedCmds
aws_elasticache_json_based_cmds_latencyJsonBasedCmdsLatency
aws_elasticache_json_based_get_cmdsJsonBasedGetCmds
aws_elasticache_key_authorization_failuresKeyAuthorizationFailures
aws_elasticache_key_based_cmdsKeyBasedCmds
aws_elasticache_key_based_cmds_latencyKeyBasedCmdsLatency
aws_elasticache_keys_trackedKeysTracked
aws_elasticache_keyspace_hitsKeyspaceHits
aws_elasticache_keyspace_missesKeyspaceMisses
aws_elasticache_list_based_cmdsListBasedCmds
aws_elasticache_list_based_cmds_latencyListBasedCmdsLatency
aws_elasticache_master_link_health_statusMasterLinkHealthStatus
aws_elasticache_max_replication_throughputMaxReplicationThroughput
aws_elasticache_memory_fragmentation_ratioMemoryFragmentationRatio
aws_elasticache_network_bandwidth_in_allowance_exceededNetworkBandwidthInAllowanceExceeded
aws_elasticache_network_bandwidth_out_allowance_exceededNetworkBandwidthOutAllowanceExceeded
aws_elasticache_network_bytes_inNetworkBytesIn
aws_elasticache_network_bytes_outNetworkBytesOut
aws_elasticache_network_conntrack_allowance_exceededNetworkConntrackAllowanceExceeded
aws_elasticache_network_link_local_allowance_exceededNetworkLinkLocalAllowanceExceeded
aws_elasticache_network_max_bytes_inNetworkMaxBytesIn
awselasticache_network_max_bytes_outNetworkMaxBytesOut
aws_elasticache_network_max_packets_inNetworkMaxPacketsIn
aws_elasticache_network_max_packets_outNetworkMaxPacketsOut
aws_elasticache_network_packets_inNetworkPacketsIn
aws_elasticache_network_packets_outNetworkPacketsOut
aws_elasticache_network_packets_per_second_allowance_exceededNetworkPacketsPerSecondAllowanceExceeded
aws_elasticache_new_connectionsNewConnections
aws_elasticache_new_itemsNewItems
aws_elasticache_num_items_read_from_diskNumItemsReadFromDisk
aws_elasticache_num_items_written_to_diskNumItemsWrittenToDisk
aws_elasticache_primary_link_health_statusPrimaryLinkHealthStatus
aws_elasticache_pub_sub_based_cmdsPubSubBasedCmds
aws_elasticache_pub_sub_based_cmds_latencyPubSubBasedCmdsLatency
aws_elasticache_reclaimedReclaimed
aws_elasticache_replication_bytesReplicationBytes
aws_elasticache_replication_delayed_write_commandsReplicationDelayedWriteCommands
aws_elasticache_replication_lagReplicationLag
aws_elasticache_save_in_progressSaveInProgress
aws_elasticache_search_based_cmdsSearchBasedCmds
aws_elasticache_search_based_get_cmdsSearchBasedGetCmds
aws_elasticache_search_based_set_cmdsSearchBasedSetCmds
aws_elasticache_search_number_of_indexed_keysSearchNumberOfIndexedKeys
aws_elasticache_search_number_of_indexesSearchNumberOfIndexes
aws_elasticache_search_total_index_sizeSearchTotalIndexSize
aws_elasticache_set_based_cmdsSetBasedCmds
aws_elasticache_set_based_cmds_latencySetBasedCmdsLatency
aws_elasticache_set_type_cmdsSetTypeCmds
aws_elasticache_set_type_cmds_latencySetTypeCmdsLatency
aws_elasticache_slabs_movedSlabsMoved
aws_elasticache_sorted_set_based_cmdsSortedSetBasedCmds
aws_elasticache_sorted_set_based_cmds_latencySortedSetBasedCmdsLatency
aws_elasticache_stream_based_cmdsStreamBasedCmds
aws_elasticache_stream_based_cmds_latencyStreamBasedCmdsLatency
aws_elasticache_string_based_cmdsStringBasedCmds
aws_elasticache_string_based_cmds_latencyStringBasedCmdsLatency
aws_elasticache_swap_usageSwapUsage
aws_elasticache_touch_hitsTouchHits
aws_elasticache_touch_missesTouchMisses
aws_elasticache_traffic_management_activeTrafficManagementActive
aws_elasticache_unused_memoryUnusedMemory

AWS/ElasticBeanstalk

Function: Service to quickly deploy and manage applications in the cloud without provisioning resources

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_elasticbeanstalk_info
aws_elasticbeanstalk_application_latency_p10ApplicationLatencyP10
aws_elasticbeanstalk_application_latency_p50ApplicationLatencyP50
aws_elasticbeanstalk_application_latency_p75ApplicationLatencyP75
aws_elasticbeanstalk_application_latency_p85ApplicationLatencyP85
aws_elasticbeanstalk_application_latency_p90ApplicationLatencyP90
aws_elasticbeanstalk_application_latency_p95ApplicationLatencyP95
aws_elasticbeanstalk_application_latency_p99ApplicationLatencyP99
aws_elasticbeanstalk_application_latency_p99_9ApplicationLatencyP99.9
aws_elasticbeanstalk_application_requests2xxApplicationRequests2xx
aws_elasticbeanstalk_application_requests3xxApplicationRequests3xx
aws_elasticbeanstalk_application_requests4xxApplicationRequests4xx
aws_elasticbeanstalk_application_requests5xxApplicationRequests5xx
aws_elasticbeanstalk_application_requests_totalApplicationRequestsTotal
aws_elasticbeanstalk_cpuidleCPUIdle
aws_elasticbeanstalk_cpuiowaitCPUIowait
aws_elasticbeanstalk_cpuirqCPUIrq
aws_elasticbeanstalk_cpuniceCPUNice
aws_elasticbeanstalk_cpusoftirqCPUSoftirq
aws_elasticbeanstalk_cpusystemCPUSystem
aws_elasticbeanstalk_cpuuserCPUUser
aws_elasticbeanstalk_environment_healthEnvironmentHealth
aws_elasticbeanstalk_instance_healthInstanceHealth
aws_elasticbeanstalk_instances_degradedInstancesDegraded
aws_elasticbeanstalk_instances_infoInstancesInfo
aws_elasticbeanstalk_instances_no_dataInstancesNoData
aws_elasticbeanstalk_instances_okInstancesOk
aws_elasticbeanstalk_instances_pendingInstancesPending
aws_elasticbeanstalk_instances_severeInstancesSevere
aws_elasticbeanstalk_instances_unknownInstancesUnknown
aws_elasticbeanstalk_instances_warningInstancesWarning
aws_elasticbeanstalk_load_average1minLoadAverage1min
aws_elasticbeanstalk_load_average5minLoadAverage5min
aws_elasticbeanstalk_root_filesystem_utilRootFilesystemUtil

AWS/ElasticMapReduce

Function: Managed big data platform for processing large amounts of data using Hadoop

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_elasticmapreduce_info
aws_elasticmapreduce_apps_completedAppsCompleted
aws_elasticmapreduce_apps_failedAppsFailed
aws_elasticmapreduce_apps_killedAppsKilled
aws_elasticmapreduce_apps_pendingAppsPending
aws_elasticmapreduce_apps_runningAppsRunning
aws_elasticmapreduce_apps_submittedAppsSubmitted
aws_elasticmapreduce_backup_failedBackupFailed
aws_elasticmapreduce_capacity_remaining_gbCapacityRemainingGB
aws_elasticmapreduce_cluster_statusCluster Status
aws_elasticmapreduce_container_allocatedContainerAllocated
aws_elasticmapreduce_container_pendingContainerPending
aws_elasticmapreduce_container_pending_ratioContainerPendingRatio
aws_elasticmapreduce_container_reservedContainerReserved
aws_elasticmapreduce_core_nodes_pendingCoreNodesPending
aws_elasticmapreduce_core_nodes_runningCoreNodesRunning
aws_elasticmapreduce_corrupt_blocksCorruptBlocks
aws_elasticmapreduce_dfs_pending_replication_blocksDfsPendingReplicationBlocks
aws_elasticmapreduce_hbaseHBase
aws_elasticmapreduce_hdfsbytes_readHDFSBytesRead
aws_elasticmapreduce_hdfsbytes_writtenHDFSBytesWritten
aws_elasticmapreduce_hdfsutilizationHDFSUtilization
aws_elasticmapreduce_hbase_backup_failedHbaseBackupFailed
aws_elasticmapreduce_ioIO
aws_elasticmapreduce_is_idleIsIdle
aws_elasticmapreduce_jobs_failedJobsFailed
aws_elasticmapreduce_jobs_runningJobsRunning
aws_elasticmapreduce_live_data_nodesLiveDataNodes
aws_elasticmapreduce_live_task_trackersLiveTaskTrackers
aws_elasticmapreduce_mractive_nodesMRActiveNodes
aws_elasticmapreduce_mrdecommissioned_nodesMRDecommissionedNodes
aws_elasticmapreduce_mrlost_nodesMRLostNodes
aws_elasticmapreduce_mrrebooted_nodesMRRebootedNodes
aws_elasticmapreduce_mrtotal_nodesMRTotalNodes
aws_elasticmapreduce_mrunhealthy_nodesMRUnhealthyNodes
aws_elasticmapreduce_map_reduceMap/Reduce
aws_elasticmapreduce_map_slots_openMapSlotsOpen
aws_elasticmapreduce_map_tasks_remainingMapTasksRemaining
aws_elasticmapreduce_map_tasks_runningMapTasksRunning
aws_elasticmapreduce_memory_allocated_mbMemoryAllocatedMB
aws_elasticmapreduce_memory_available_mbMemoryAvailableMB
aws_elasticmapreduce_memory_reserved_mbMemoryReservedMB
aws_elasticmapreduce_memory_total_mbMemoryTotalMB
aws_elasticmapreduce_missing_blocksMissingBlocks
aws_elasticmapreduce_most_recent_backup_durationMostRecentBackupDuration
aws_elasticmapreduce_node_statusNode Status
aws_elasticmapreduce_pending_deletion_blocksPendingDeletionBlocks
aws_elasticmapreduce_reduce_slots_openReduceSlotsOpen
aws_elasticmapreduce_reduce_tasks_remainingReduceTasksRemaining
aws_elasticmapreduce_reduce_tasks_runningReduceTasksRunning
aws_elasticmapreduce_remaining_map_tasks_per_slotRemainingMapTasksPerSlot
aws_elasticmapreduce_s3_bytes_readS3BytesRead
aws_elasticmapreduce_s3_bytes_writtenS3BytesWritten
aws_elasticmapreduce_task_nodes_pendingTaskNodesPending
aws_elasticmapreduce_task_nodes_runningTaskNodesRunning
aws_elasticmapreduce_time_since_last_successful_backupTimeSinceLastSuccessfulBackup
aws_elasticmapreduce_total_loadTotalLoad
aws_elasticmapreduce_under_replicated_blocksUnderReplicatedBlocks
aws_elasticmapreduce_yarnmemory_available_percentageYARNMemoryAvailablePercentage

AWS/Events

Function: Delivers a near real-time stream of system events for building reactive applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_events_info
aws_events_dead_letter_invocationsDeadLetterInvocations
awseventsEvents
aws_events_failed_invocationsFailedInvocations
aws_events_ingestionto_invocation_complete_latencyIngestiontoInvocationCompleteLatency
aws_events_ingestionto_invocation_start_latencyIngestiontoInvocationStartLatency
aws_events_invocation_attemptsInvocationAttempts
aws_events_invocationsInvocations
aws_events_invocations_createdInvocationsCreated
aws_events_invocations_failed_to_be_sent_to_dlqInvocationsFailedToBeSentToDlq
aws_events_invocations_sent_to_dlqInvocationsSentToDlq
aws_events_matched_eventsMatchedEvents
aws_events_put_events_approximate_call_countPutEventsApproximateCallCount
aws_events_put_events_approximate_failed_countPutEventsApproximateFailedCount
aws_events_put_events_approximate_success_countPutEventsApproximateSuccessCount
aws_events_put_events_approximate_throttled_countPutEventsApproximateThrottledCount
aws_events_put_events_entries_countPutEventsEntriesCount
aws_events_put_events_failed_entries_countPutEventsFailedEntriesCount
aws_events_put_events_latencyPutEventsLatency
aws_events_put_events_request_sizePutEventsRequestSize
aws_events_put_partner_events_approximate_call_countPutPartnerEventsApproximateCallCount
aws_events_put_partner_events_approximate_failed_countPutPartnerEventsApproximateFailedCount
aws_events_put_partner_events_approximate_success_countPutPartnerEventsApproximateSuccessCount
aws_events_put_partner_events_approximate_throttled_countPutPartnerEventsApproximateThrottledCount
aws_events_put_partner_events_entries_countPutPartnerEventsEntriesCount
aws_events_put_partner_events_failed_entries_countPutPartnerEventsFailedEntriesCount
aws_events_put_partner_events_latencyPutPartnerEventsLatency
aws_events_retry_invocation_attemptsRetryInvocationAttempts
aws_events_successful_invocation_attemptsSuccessfulInvocationAttempts
aws_events_throttled_rulesThrottledRules
aws_events_triggered_rulesTriggeredRules

AWS/FSx

Function: Managed file systems optimized for specific workloads like Windows and Lustre

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_fsx_info
aws_fsx_cpuutilizationCPUUtilization
aws_fsx_client_connectionsClientConnections
aws_fsx_data_read_bytesDataReadBytes
aws_fsx_data_read_operationsDataReadOperations
aws_fsx_data_write_bytesDataWriteBytes
aws_fsx_data_write_operationsDataWriteOperations
aws_fsx_deduplication_saved_storageDeduplicationSavedStorage
aws_fsx_disk_iops_utilizationDiskIopsUtilization
aws_fsx_disk_read_bytesDiskReadBytes
aws_fsx_disk_read_operationsDiskReadOperations
aws_fsx_disk_throughput_balanceDiskThroughputBalance
aws_fsx_disk_throughput_utilizationDiskThroughputUtilization
aws_fsx_disk_write_bytesDiskWriteBytes
aws_fsx_disk_write_operationsDiskWriteOperations
aws_fsx_file_server_disk_iops_balanceFileServerDiskIopsBalance
aws_fsx_file_server_disk_iops_utilizationFileServerDiskIopsUtilization
aws_fsx_file_server_disk_throughput_balanceFileServerDiskThroughputBalance
aws_fsx_file_server_disk_throughput_utilizationFileServerDiskThroughputUtilization
aws_fsx_free_data_storage_capacityFreeDataStorageCapacity
aws_fsx_free_storage_capacityFreeStorageCapacity
aws_fsx_memory_utilizationMemoryUtilization
aws_fsx_metadata_operationsMetadataOperations
aws_fsx_network_throughput_utilizationNetworkThroughputUtilization
aws_fsx_storage_capacity_utilizationStorageCapacityUtilization

AWS/Firehose

Function: Service to reliably load streaming data into AWS data stores like S3 and Redshift

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_firehose_info
aws_firehose_active_partitions_limitActivePartitionsLimit
aws_firehose_backup_to_s3_bytesBackupToS3.Bytes
aws_firehose_backup_to_s3_data_freshnessBackupToS3.DataFreshness
aws_firehose_backup_to_s3_recordsBackupToS3.Records
aws_firehose_backup_to_s3_successBackupToS3.Success
aws_firehose_bytes_per_second_limitBytesPerSecondLimit
aws_firehose_data_read_from_kinesis_stream_bytesDataReadFromKinesisStream.Bytes
aws_firehose_data_read_from_kinesis_stream_recordsDataReadFromKinesisStream.Records
aws_firehose_data_read_from_source_backpressuredDataReadFromSource.Backpressured
aws_firehose_data_read_from_source_bytesDataReadFromSource.Bytes
aws_firehose_data_read_from_source_recordsDataReadFromSource.Records
aws_firehose_delivery_to_amazon_open_search_serverless_auth_failureDeliveryToAmazonOpenSearchServerless.AuthFailure
aws_firehose_delivery_to_amazon_open_search_serverless_bytesDeliveryToAmazonOpenSearchServerless.Bytes
aws_firehose_delivery_to_amazon_open_search_serverless_data_freshnessDeliveryToAmazonOpenSearchServerless.DataFreshness
aws_firehose_delivery_to_amazon_open_search_serverless_delivery_rejectedDeliveryToAmazonOpenSearchServerless.DeliveryRejected
aws_firehose_delivery_to_amazon_open_search_serverless_recordsDeliveryToAmazonOpenSearchServerless.Records
aws_firehose_delivery_to_amazon_open_search_serverless_successDeliveryToAmazonOpenSearchServerless.Success
aws_firehose_delivery_to_amazon_open_search_service_auth_failureDeliveryToAmazonOpenSearchService.AuthFailure
aws_firehose_delivery_to_amazon_open_search_service_bytesDeliveryToAmazonOpenSearchService.Bytes
aws_firehose_delivery_to_amazon_open_search_service_data_freshnessDeliveryToAmazonOpenSearchService.DataFreshness
aws_firehose_delivery_to_amazon_open_search_service_delivery_rejectedDeliveryToAmazonOpenSearchService.DeliveryRejected
aws_firehose_delivery_to_amazon_open_search_service_recordsDeliveryToAmazonOpenSearchService.Records
aws_firehose_delivery_to_amazon_open_search_service_successDeliveryToAmazonOpenSearchService.Success
aws_firehose_delivery_to_elasticsearch_bytesDeliveryToElasticsearch.Bytes
aws_firehose_delivery_to_elasticsearch_recordsDeliveryToElasticsearch.Records
aws_firehose_delivery_to_elasticsearch_successDeliveryToElasticsearch.Success
aws_firehose_delivery_to_http_endpoint_bytesDeliveryToHttpEndpoint.Bytes
aws_firehose_delivery_to_http_endpoint_data_freshnessDeliveryToHttpEndpoint.DataFreshness
aws_firehose_delivery_to_http_endpoint_processed_bytesDeliveryToHttpEndpoint.ProcessedBytes
aws_firehose_delivery_to_http_endpoint_processed_recordsDeliveryToHttpEndpoint.ProcessedRecords
aws_firehose_delivery_to_http_endpoint_recordsDeliveryToHttpEndpoint.Records
aws_firehose_delivery_to_http_endpoint_successDeliveryToHttpEndpoint.Success
aws_firehose_delivery_to_redshift_bytesDeliveryToRedshift.Bytes
aws_firehose_delivery_to_redshift_recordsDeliveryToRedshift.Records
aws_firehose_delivery_to_redshift_successDeliveryToRedshift.Success
aws_firehose_delivery_to_s3_bytesDeliveryToS3.Bytes
aws_firehose_delivery_to_s3_data_freshnessDeliveryToS3.DataFreshness
aws_firehose_delivery_to_s3_object_countDeliveryToS3.ObjectCount
aws_firehose_delivery_to_s3_recordsDeliveryToS3.Records
aws_firehose_delivery_to_s3_successDeliveryToS3.Success
aws_firehose_delivery_to_snowflake_bytesDeliveryToSnowflake.Bytes
aws_firehose_delivery_to_snowflake_data_commit_latencyDeliveryToSnowflake.DataCommitLatency
aws_firehose_delivery_to_snowflake_data_freshnessDeliveryToSnowflake.DataFreshness
aws_firehose_delivery_to_snowflake_recordsDeliveryToSnowflake.Records
aws_firehose_delivery_to_snowflake_successDeliveryToSnowflake.Success
aws_firehose_delivery_to_splunk_bytesDeliveryToSplunk.Bytes
aws_firehose_delivery_to_splunk_data_ack_latencyDeliveryToSplunk.DataAckLatency
aws_firehose_delivery_to_splunk_data_freshnessDeliveryToSplunk.DataFreshness
aws_firehose_delivery_to_splunk_recordsDeliveryToSplunk.Records
aws_firehose_delivery_to_splunk_successDeliveryToSplunk.Success
aws_firehose_describe_delivery_stream_latencyDescribeDeliveryStream.Latency
aws_firehose_describe_delivery_stream_requestsDescribeDeliveryStream.Requests
aws_firehose_execute_processing_durationExecuteProcessing.Duration
aws_firehose_execute_processing_successExecuteProcessing.Success
aws_firehose_failed_conversion_bytesFailedConversion.Bytes
aws_firehose_failed_conversion_recordsFailedConversion.Records
aws_firehose_failed_validation_bytesFailedValidation.Bytes
aws_firehose_failed_validation_recordsFailedValidation.Records
aws_firehose_incoming_bytesIncomingBytes
aws_firehose_incoming_put_requestsIncomingPutRequests
aws_firehose_incoming_recordsIncomingRecords
aws_firehose_jqprocessing_durationJQProcessing.Duration
aws_firehose_kmskey_access_deniedKMSKeyAccessDenied
aws_firehose_kmskey_disabledKMSKeyDisabled
aws_firehose_kmskey_invalid_stateKMSKeyInvalidState
aws_firehose_kmskey_not_foundKMSKeyNotFound
aws_firehose_kafka_offset_lagKafkaOffsetLag
aws_firehose_kinesis_millis_behind_latestKinesisMillisBehindLatest
aws_firehose_list_delivery_streams_latencyListDeliveryStreams.Latency
aws_firehose_list_delivery_streams_requestsListDeliveryStreams.Requests
aws_firehose_output_decompressed_bytes_failedOutputDecompressedBytes.Failed
aws_firehose_output_decompressed_bytes_successOutputDecompressedBytes.Success
aws_firehose_output_decompressed_records_failedOutputDecompressedRecords.Failed
aws_firehose_output_decompressed_records_successOutputDecompressedRecords.Success
aws_firehose_partition_countPartitionCount
aws_firehose_partition_count_exceededPartitionCountExceeded
aws_firehose_per_partition_throughputPerPartitionThroughput
aws_firehose_put_record_bytesPutRecord.Bytes
aws_firehose_put_record_latencyPutRecord.Latency
aws_firehose_put_record_requestsPutRecord.Requests
aws_firehose_put_record_batch_bytesPutRecordBatch.Bytes
aws_firehose_put_record_batch_latencyPutRecordBatch.Latency
aws_firehose_put_record_batch_recordsPutRecordBatch.Records
aws_firehose_put_record_batch_requestsPutRecordBatch.Requests
aws_firehose_put_requests_per_second_limitPutRequestsPerSecondLimit
aws_firehose_records_per_second_limitRecordsPerSecondLimit
aws_firehose_resource_countResourceCount
aws_firehose_source_throttled_delaySourceThrottled.Delay
aws_firehose_succeed_conversion_bytesSucceedConversion.Bytes
aws_firehose_succeed_conversion_recordsSucceedConversion.Records
aws_firehose_succeed_processing_bytesSucceedProcessing.Bytes
aws_firehose_succeed_processing_recordsSucceedProcessing.Records
aws_firehose_throttled_describe_streamThrottledDescribeStream
aws_firehose_throttled_get_recordsThrottledGetRecords
aws_firehose_throttled_get_shard_iteratorThrottledGetShardIterator
aws_firehose_throttled_recordsThrottledRecords
aws_firehose_update_delivery_stream_latencyUpdateDeliveryStream.Latency
aws_firehose_update_delivery_stream_requestsUpdateDeliveryStream.Requests

AWS/GameLift

Function: Managed service for deploying, operating, and scaling dedicated game servers

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_gamelift_info
aws_gamelift_activating_game_sessionsActivatingGameSessions
aws_gamelift_active_game_sessionsActiveGameSessions
aws_gamelift_active_instancesActiveInstances
aws_gamelift_active_server_processesActiveServerProcesses
aws_gamelift_available_game_serversAvailableGameServers
aws_gamelift_available_game_sessionsAvailableGameSessions
aws_gamelift_average_wait_timeAverageWaitTime
aws_gamelift_current_player_sessionsCurrentPlayerSessions
aws_gamelift_current_ticketsCurrentTickets
aws_gamelift_desired_instancesDesiredInstances
aws_gamelift_draining_available_game_serversDrainingAvailableGameServers
aws_gamelift_draining_utilized_game_serversDrainingUtilizedGameServers
aws_gamelift_first_choice_not_viableFirstChoiceNotViable
aws_gamelift_first_choice_out_of_capacityFirstChoiceOutOfCapacity
aws_gamelift_game_session_interruptionsGameSessionInterruptions
aws_gamelift_healthy_server_processesHealthyServerProcesses
aws_gamelift_idle_instancesIdleInstances
aws_gamelift_instance_interruptionsInstanceInterruptions
aws_gamelift_lowest_latency_placementLowestLatencyPlacement
aws_gamelift_lowest_price_placementLowestPricePlacement
aws_gamelift_match_acceptances_timed_outMatchAcceptancesTimedOut
aws_gamelift_matches_acceptedMatchesAccepted
aws_gamelift_matches_createdMatchesCreated
aws_gamelift_matches_placedMatchesPlaced
aws_gamelift_matches_rejectedMatchesRejected
aws_gamelift_max_instancesMaxInstances
aws_gamelift_min_instancesMinInstances
aws_gamelift_percent_available_game_sessionsPercentAvailableGameSessions
aws_gamelift_percent_healthy_server_processesPercentHealthyServerProcesses
aws_gamelift_percent_idle_instancesPercentIdleInstances
aws_gamelift_placementPlacement
aws_gamelift_placements_canceledPlacementsCanceled
aws_gamelift_placements_failedPlacementsFailed
aws_gamelift_placements_startedPlacementsStarted
aws_gamelift_placements_succeededPlacementsSucceeded
aws_gamelift_placements_timed_outPlacementsTimedOut
aws_gamelift_player_session_activationsPlayerSessionActivations
aws_gamelift_players_startedPlayersStarted
aws_gamelift_queue_depthQueueDepth
aws_gamelift_rule_evaluations_failedRuleEvaluationsFailed
aws_gamelift_rule_evaluations_passedRuleEvaluationsPassed
aws_gamelift_server_process_abnormal_terminationsServerProcessAbnormalTerminations
aws_gamelift_server_process_activationsServerProcessActivations
aws_gamelift_server_process_terminationsServerProcessTerminations
aws_gamelift_tickets_failedTicketsFailed
aws_gamelift_tickets_startedTicketsStarted
aws_gamelift_tickets_timed_outTicketsTimedOut
aws_gamelift_time_to_matchTimeToMatch
aws_gamelift_time_to_ticket_successTimeToTicketSuccess
aws_gamelift_utilized_game_serversUtilizedGameServers

AWS/GlobalAccelerator

Function: Provides static IP addresses to improve availability and performance for global applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_globalaccelerator_info
aws_globalaccelerator_healthy_endpoint_countHealthyEndpointCount
aws_globalaccelerator_new_flow_countNewFlowCount
aws_globalaccelerator_processed_bytes_inProcessedBytesIn
aws_globalaccelerator_processed_bytes_outProcessedBytesOut
aws_globalaccelerator_unhealthy_endpoint_countUnhealthyEndpointCount

AWS/Glue

Function: Managed ETL service that prepares and loads data for analytics

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_glue_info
aws_glue_all_disk_available_gbglue.ALL.disk.available_GB
aws_glue_all_disk_used_percentageglue.ALL.disk.used.percentage
aws_glue_all_disk_used_gbglue.ALL.disk.used_GB
aws_glue_all_jvm_heap_usageglue.ALL.jvm.heap.usage
aws_glue_all_jvm_heap_usedglue.ALL.jvm.heap.used
aws_glue_all_memory_heap_availableglue.ALL.memory.heap.available
aws_glue_all_memory_heap_usedglue.ALL.memory.heap.used
aws_glue_all_memory_heap_used_percentageglue.ALL.memory.heap.used.percentage
aws_glue_all_memory_non_heap_availableglue.ALL.memory.non-heap.available
aws_glue_all_memory_non_heap_percentageglue.ALL.memory.non-heap.percentage
aws_glue_all_memory_non_heap_usedglue.ALL.memory.non-heap.used
aws_glue_all_memory_total_availableglue.ALL.memory.total.available
aws_glue_all_memory_total_usedglue.ALL.memory.total.used
aws_glue_all_memory_total_used_percentageglue.ALL.memory.total.used.percentage
aws_glue_all_s3_filesystem_read_bytesglue.ALL.s3.filesystem.read_bytes
aws_glue_all_s3_filesystem_write_bytesglue.ALL.s3.filesystem.write_bytes
aws_glue_all_system_cpu_system_loadglue.ALL.system.cpuSystemLoad
aws_glue_driver_block_manager_disk_disk_space_used_mbglue.driver.BlockManager.disk.diskSpaceUsed_MB
aws_glue_driver_executor_allocation_manager_executors_number_all_executorsglue.driver.ExecutorAllocationManager.executors.numberAllExecutors
aws_glue_driver_executor_allocation_manager_executors_number_max_needed_executorsglue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors
aws_glue_driver_aggregate_bytes_readglue.driver.aggregate.bytesRead
aws_glue_driver_aggregate_elapsed_timeglue.driver.aggregate.elapsedTime
aws_glue_driver_aggregate_num_completed_stagesglue.driver.aggregate.numCompletedStages
aws_glue_driver_aggregate_num_completed_tasksglue.driver.aggregate.numCompletedTasks
aws_glue_driver_aggregate_num_failed_tasksglue.driver.aggregate.numFailedTasks
aws_glue_driver_aggregate_num_killed_tasksglue.driver.aggregate.numKilledTasks
aws_glue_driver_aggregate_records_readglue.driver.aggregate.recordsRead
aws_glue_driver_aggregate_shuffle_bytes_writtenglue.driver.aggregate.shuffleBytesWritten
aws_glue_driver_aggregate_shuffle_local_bytes_readglue.driver.aggregate.shuffleLocalBytesRead
aws_glue_driver_bytes_readglue.driver.bytesRead
aws_glue_driver_bytes_writtenglue.driver.bytesWritten
aws_glue_driver_disk_available_gbglue.driver.disk.available_GB
aws_glue_driver_disk_used_percentageglue.driver.disk.used.percentage
aws_glue_driver_disk_used_gbglue.driver.disk.used_GB
aws_glue_driver_files_readglue.driver.filesRead
aws_glue_driver_files_writtenglue.driver.filesWritten
aws_glue_driver_jvm_heap_usageglue.driver.jvm.heap.usage
aws_glue_driver_jvm_heap_usedglue.driver.jvm.heap.used
aws_glue_driver_memory_heap_availableglue.driver.memory.heap.available
aws_glue_driver_memory_heap_usedglue.driver.memory.heap.used
aws_glue_driver_memory_heap_used_percentageglue.driver.memory.heap.used.percentage
aws_glue_driver_memory_non_heap_availableglue.driver.memory.non-heap.available
aws_glue_driver_memory_non_heap_percentageglue.driver.memory.non-heap.percentage
aws_glue_driver_memory_non_heap_usedglue.driver.memory.non-heap.used
aws_glue_driver_memory_total_availableglue.driver.memory.total.available
aws_glue_driver_memory_total_usedglue.driver.memory.total.used
aws_glue_driver_memory_total_used_percentageglue.driver.memory.total.used.percentage
aws_glue_driver_partitions_readglue.driver.partitionsRead
aws_glue_driver_records_readglue.driver.recordsRead
aws_glue_driver_records_writtenglue.driver.recordsWritten
aws_glue_driver_s3_filesystem_read_bytesglue.driver.s3.filesystem.read_bytes
aws_glue_driver_s3_filesystem_write_bytesglue.driver.s3.filesystem.write_bytes
aws_glue_driver_skewness_jobglue.driver.skewness.job
aws_glue_driver_skewness_stageglue.driver.skewness.stage
aws_glue_driver_streaming_batch_processing_time_in_msglue.driver.streaming.batchProcessingTimeInMs
aws_glue_driver_streaming_num_recordsglue.driver.streaming.numRecords
aws_glue_driver_system_cpu_system_loadglue.driver.system.cpuSystemLoad
aws_glue_driver_worker_utilizationglue.driver.workerUtilization
aws_glue_error_allglue.error.ALL
aws_glue_succeed_allglue.succeed.ALL

AWS/IoT

Function: Provides cloud services to connect IoT devices to the cloud and manage IoT workloads

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_iot_info
aws_iot_canceled_job_execution_countCanceledJobExecutionCount
aws_iot_canceled_job_execution_total_countCanceledJobExecutionTotalCount
aws_iot_client_errorClientError
aws_iot_connect_auth_errorConnect.AuthError
aws_iot_connect_client_errorConnect.ClientError
aws_iot_connect_server_errorConnect.ServerError
aws_iot_connect_successConnect.Success
aws_iot_connect_throttleConnect.Throttle
aws_iot_delete_thing_shadow_acceptedDeleteThingShadow.Accepted
aws_iot_failed_job_execution_countFailedJobExecutionCount
aws_iot_failed_job_execution_total_countFailedJobExecutionTotalCount
aws_iot_failureFailure
aws_iot_get_thing_shadow_acceptedGetThingShadow.Accepted
aws_iot_in_progress_job_execution_countInProgressJobExecutionCount
aws_iot_in_progress_job_execution_total_countInProgressJobExecutionTotalCount
aws_iot_non_compliant_resourcesNonCompliantResources
aws_iot_num_log_batches_failed_to_publish_throttledNumLogBatchesFailedToPublishThrottled
aws_iot_num_log_events_failed_to_publish_throttledNumLogEventsFailedToPublishThrottled
aws_iot_parse_errorParseError
aws_iot_ping_successPing.Success
aws_iot_publish_in_auth_errorPublishIn.AuthError
aws_iot_publish_in_client_errorPublishIn.ClientError
aws_iot_publish_in_server_errorPublishIn.ServerError
aws_iot_publish_in_successPublishIn.Success
aws_iot_publish_in_throttlePublishIn.Throttle
aws_iot_publish_out_auth_errorPublishOut.AuthError
aws_iot_publish_out_client_errorPublishOut.ClientError
aws_iot_publish_out_successPublishOut.Success
aws_iot_queued_job_execution_countQueuedJobExecutionCount
aws_iot_queued_job_execution_total_countQueuedJobExecutionTotalCount
aws_iot_rejected_job_execution_countRejectedJobExecutionCount
aws_iot_rejected_job_execution_total_countRejectedJobExecutionTotalCount
aws_iot_removed_job_execution_countRemovedJobExecutionCount
aws_iot_removed_job_execution_total_countRemovedJobExecutionTotalCount
aws_iot_resources_evaluatedResourcesEvaluated
aws_iot_rule_message_throttledRuleMessageThrottled
aws_iot_rule_not_foundRuleNotFound
aws_iot_rules_executedRulesExecuted
aws_iot_server_errorServerError
aws_iot_subscribe_auth_errorSubscribe.AuthError
aws_iot_subscribe_client_errorSubscribe.ClientError
aws_iot_subscribe_server_errorSubscribe.ServerError
aws_iot_subscribe_successSubscribe.Success
aws_iot_subscribe_throttleSubscribe.Throttle
aws_iot_succeded_job_execution_countSuccededJobExecutionCount
aws_iot_succeded_job_execution_total_countSuccededJobExecutionTotalCount
aws_iot_successSuccess
aws_iot_topic_matchTopicMatch
aws_iot_unsubscribe_client_errorUnsubscribe.ClientError
aws_iot_unsubscribe_server_errorUnsubscribe.ServerError
aws_iot_unsubscribe_successUnsubscribe.Success
aws_iot_unsubscribe_throttleUnsubscribe.Throttle
aws_iot_update_thing_shadow_acceptedUpdateThingShadow.Accepted
aws_iot_violationsViolations
aws_iot_violations_clearedViolationsCleared
aws_iot_violations_invalidatedViolationsInvalidated

AWS/Kafka

Function: Managed Apache Kafka service for building real-time streaming applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_kafka_info
aws_kafka_active_controller_countActiveControllerCount
aws_kafka_burst_balanceBurstBalance
aws_kafka_bw_in_allowance_exceededBwInAllowanceExceeded
aws_kafka_bw_out_allowance_exceededBwOutAllowanceExceeded
aws_kafka_bytes_in_per_secBytesInPerSec
aws_kafka_bytes_out_per_secBytesOutPerSec
aws_kafka_cpucredit_balanceCPUCreditBalance
aws_kafka_client_connection_countClientConnectionCount
aws_kafka_conn_track_allowance_exceededConnTrackAllowanceExceeded
aws_kafka_connection_close_rateConnectionCloseRate
aws_kafka_connection_countConnectionCount
aws_kafka_connection_creation_rateConnectionCreationRate
aws_kafka_cpu_credit_usageCpuCreditUsage
aws_kafka_cpu_idleCpuIdle
aws_kafka_cpu_io_waitCpuIoWait
aws_kafka_cpu_systemCpuSystem
aws_kafka_cpu_userCpuUser
aws_kafka_estimated_max_time_lagEstimatedMaxTimeLag
aws_kafka_estimated_time_lagEstimatedTimeLag
aws_kafka_fetch_consumer_local_time_ms_meanFetchConsumerLocalTimeMsMean
aws_kafka_fetch_consumer_request_queue_time_ms_meanFetchConsumerRequestQueueTimeMsMean
aws_kafka_fetch_consumer_response_queue_time_ms_meanFetchConsumerResponseQueueTimeMsMean
aws_kafka_fetch_consumer_response_send_time_ms_meanFetchConsumerResponseSendTimeMsMean
aws_kafka_fetch_consumer_total_time_ms_meanFetchConsumerTotalTimeMsMean
aws_kafka_fetch_follower_local_time_ms_meanFetchFollowerLocalTimeMsMean
aws_kafka_fetch_follower_request_queue_time_ms_meanFetchFollowerRequestQueueTimeMsMean
aws_kafka_fetch_follower_response_queue_time_ms_meanFetchFollowerResponseQueueTimeMsMean
aws_kafka_fetch_follower_response_send_time_ms_meanFetchFollowerResponseSendTimeMsMean
aws_kafka_fetch_follower_total_time_ms_meanFetchFollowerTotalTimeMsMean
aws_kafka_fetch_message_conversions_per_secFetchMessageConversionsPerSec
aws_kafka_fetch_throttle_byte_rateFetchThrottleByteRate
aws_kafka_fetch_throttle_queue_sizeFetchThrottleQueueSize
aws_kafka_fetch_throttle_timeFetchThrottleTime
aws_kafka_global_partition_countGlobalPartitionCount
aws_kafka_global_topic_countGlobalTopicCount
aws_kafka_heap_memory_after_gcHeapMemoryAfterGC
aws_kafka_app_logs_disk_usedKafkaAppLogsDiskUsed
aws_kafka_data_logs_disk_usedKafkaDataLogsDiskUsed
aws_kafka_leader_countLeaderCount
aws_kafka_max_offset_lagMaxOffsetLag
aws_kafka_memory_bufferedMemoryBuffered
aws_kafka_memory_cachedMemoryCached
aws_kafka_memory_freeMemoryFree
aws_kafka_memory_usedMemoryUsed
aws_kafka_messages_in_per_secMessagesInPerSec
aws_kafka_network_processor_avg_idle_percentNetworkProcessorAvgIdlePercent
aws_kafka_network_rx_droppedNetworkRxDropped
aws_kafka_network_rx_errorsNetworkRxErrors
aws_kafka_network_rx_packetsNetworkRxPackets
aws_kafka_network_tx_droppedNetworkTxDropped
aws_kafka_network_tx_errorsNetworkTxErrors
aws_kafka_network_tx_packetsNetworkTxPackets
aws_kafka_offline_partitions_countOfflinePartitionsCount
aws_kafka_offset_lagOffsetLag
aws_kafka_partition_countPartitionCount
aws_kafka_pps_allowance_exceededPpsAllowanceExceeded
aws_kafka_produce_local_time_ms_meanProduceLocalTimeMsMean
aws_kafka_produce_message_conversions_per_secProduceMessageConversionsPerSec
aws_kafka_produce_message_conversions_time_ms_meanProduceMessageConversionsTimeMsMean
aws_kafka_produce_request_queue_time_ms_meanProduceRequestQueueTimeMsMean
aws_kafka_produce_response_queue_time_ms_meanProduceResponseQueueTimeMsMean
aws_kafka_produce_response_send_time_ms_meanProduceResponseSendTimeMsMean
aws_kafka_produce_throttle_byte_rateProduceThrottleByteRate
aws_kafka_produce_throttle_queue_sizeProduceThrottleQueueSize
aws_kafka_produce_throttle_timeProduceThrottleTime
aws_kafka_produce_total_time_ms_meanProduceTotalTimeMsMean
aws_kafka_remote_copy_bytes_per_secRemoteCopyBytesPerSec
aws_kafka_remote_copy_errors_per_secRemoteCopyErrorsPerSec
aws_kafka_remote_copy_lag_bytesRemoteCopyLagBytes
aws_kafka_remote_fetch_bytes_per_secRemoteFetchBytesPerSec
aws_kafka_remote_fetch_errors_per_secRemoteFetchErrorsPerSec
aws_kafka_remote_fetch_requests_per_secRemoteFetchRequestsPerSec
aws_kafka_remote_log_manager_tasks_avg_idle_percentRemoteLogManagerTasksAvgIdlePercent
aws_kafka_remote_log_reader_avg_idle_percentRemoteLogReaderAvgIdlePercent
aws_kafka_remote_log_reader_task_queue_sizeRemoteLogReaderTaskQueueSize
aws_kafka_replication_bytes_in_per_secReplicationBytesInPerSec
aws_kafka_replication_bytes_out_per_secReplicationBytesOutPerSec
aws_kafka_request_bytes_meanRequestBytesMean
aws_kafka_request_exempt_from_throttle_timeRequestExemptFromThrottleTime
aws_kafka_request_handler_avg_idle_percentRequestHandlerAvgIdlePercent
aws_kafka_request_throttle_queue_sizeRequestThrottleQueueSize
aws_kafka_request_throttle_timeRequestThrottleTime
aws_kafka_request_timeRequestTime
aws_kafka_root_disk_usedRootDiskUsed
aws_kafka_sum_offset_lagSumOffsetLag
aws_kafka_swap_freeSwapFree
aws_kafka_swap_usedSwapUsed
aws_kafka_tcpconnectionsTCPConnections
aws_kafka_tcp_connectionsTcpConnections
aws_kafka_traffic_bytesTrafficBytes
aws_kafka_traffic_shapingTrafficShaping
aws_kafka_under_min_isr_partition_countUnderMinIsrPartitionCount
aws_kafka_under_replicated_partitionsUnderReplicatedPartitions
aws_kafka_volume_queue_lengthVolumeQueueLength
aws_kafka_volume_read_bytesVolumeReadBytes
aws_kafka_volume_read_opsVolumeReadOps
aws_kafka_volume_total_read_timeVolumeTotalReadTime
aws_kafka_volume_total_write_timeVolumeTotalWriteTime
aws_kafka_volume_write_bytesVolumeWriteBytes
aws_kafka_volume_write_opsVolumeWriteOps
aws_kafka_zoo_keeper_request_latency_ms_meanZooKeeperRequestLatencyMsMean
aws_kafka_zoo_keeper_session_stateZooKeeperSessionState

AWS/Kinesis

Function: Managed service for real-time data processing and analytics

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_kinesis_info
aws_kinesis_get_records_bytesGetRecords.Bytes
aws_kinesis_get_records_iterator_ageGetRecords.IteratorAge
aws_kinesis_get_records_iterator_age_millisecondsGetRecords.IteratorAgeMilliseconds
aws_kinesis_get_records_latencyGetRecords.Latency
aws_kinesis_get_records_recordsGetRecords.Records
aws_kinesis_get_records_successGetRecords.Success
aws_kinesis_incoming_bytesIncomingBytes
aws_kinesis_incoming_recordsIncomingRecords
aws_kinesis_iterator_age_millisecondsIteratorAgeMilliseconds
aws_kinesis_outgoing_bytesOutgoingBytes
aws_kinesis_outgoing_recordsOutgoingRecords
aws_kinesis_put_record_bytesPutRecord.Bytes
aws_kinesis_put_record_latencyPutRecord.Latency
aws_kinesis_put_record_successPutRecord.Success
aws_kinesis_put_records_bytesPutRecords.Bytes
aws_kinesis_put_records_failed_recordsPutRecords.FailedRecords
aws_kinesis_put_records_latencyPutRecords.Latency
aws_kinesis_put_records_recordsPutRecords.Records
aws_kinesis_put_records_successPutRecords.Success
aws_kinesis_put_records_successful_recordsPutRecords.SuccessfulRecords
aws_kinesis_put_records_throttled_recordsPutRecords.ThrottledRecords
aws_kinesis_put_records_total_recordsPutRecords.TotalRecords
aws_kinesis_read_provisioned_throughput_exceededReadProvisionedThroughputExceeded
aws_kinesis_subscribe_to_shard_rate_exceededSubscribeToShard.RateExceeded
aws_kinesis_subscribe_to_shard_successSubscribeToShard.Success
aws_kinesis_subscribe_to_shard_event_bytesSubscribeToShardEvent.Bytes
aws_kinesis_subscribe_to_shard_event_millis_behind_latestSubscribeToShardEvent.MillisBehindLatest
aws_kinesis_subscribe_to_shard_event_recordsSubscribeToShardEvent.Records
aws_kinesis_subscribe_to_shard_event_successSubscribeToShardEvent.Success
aws_kinesis_write_provisioned_throughput_exceededWriteProvisionedThroughputExceeded

AWS/KinesisAnalytics

Function: Processes streaming data in real time using SQL

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_kinesisanalytics_info
aws_kinesisanalytics_bytesBytes
aws_kinesisanalytics_input_processing_dropped_recordsInputProcessing.DroppedRecords
aws_kinesisanalytics_input_processing_durationInputProcessing.Duration
aws_kinesisanalytics_input_processing_ok_bytesInputProcessing.OkBytes
aws_kinesisanalytics_input_processing_ok_recordsInputProcessing.OkRecords
aws_kinesisanalytics_input_processing_processing_failed_recordsInputProcessing.ProcessingFailedRecords
aws_kinesisanalytics_input_processing_successInputProcessing.Success
aws_kinesisanalytics_kpusKPUs
aws_kinesisanalytics_lambda_delivery_delivery_failed_recordsLambdaDelivery.DeliveryFailedRecords
aws_kinesisanalytics_lambda_delivery_durationLambdaDelivery.Duration
aws_kinesisanalytics_lambda_delivery_ok_recordsLambdaDelivery.OkRecords
aws_kinesisanalytics_millis_behind_latestMillisBehindLatest
aws_kinesisanalytics_recordsRecords
aws_kinesisanalytics_successSuccess
aws_kinesisanalytics_back_pressured_time_ms_per_secondbackPressuredTimeMsPerSecond
aws_kinesisanalytics_busy_time_ms_per_secondbusyTimeMsPerSecond
aws_kinesisanalytics_bytes_requested_per_fetchbytesRequestedPerFetch
aws_kinesisanalytics_bytes_consumed_ratebytes_consumed_rate
aws_kinesisanalytics_commits_failedcommitsFailed
aws_kinesisanalytics_commits_succeededcommitsSucceeded
aws_kinesisanalytics_committedoffsetscommittedoffsets
aws_kinesisanalytics_container_cpuutilizationcontainerCPUUtilization
aws_kinesisanalytics_container_disk_utilizationcontainerDiskUtilization
aws_kinesisanalytics_container_memory_utilizationcontainerMemoryUtilization
aws_kinesisanalytics_cpu_utilizationcpuUtilization
aws_kinesisanalytics_current_input_watermarkcurrentInputWatermark
aws_kinesisanalytics_current_output_watermarkcurrentOutputWatermark
aws_kinesisanalytics_currentoffsetscurrentoffsets
aws_kinesisanalytics_downtimedowntime
aws_kinesisanalytics_full_restartsfullRestarts
aws_kinesisanalytics_heap_memory_utilizationheapMemoryUtilization
aws_kinesisanalytics_idle_time_ms_per_secondidleTimeMsPerSecond
aws_kinesisanalytics_last_checkpoint_durationlastCheckpointDuration
aws_kinesisanalytics_last_checkpoint_sizelastCheckpointSize
aws_kinesisanalytics_managed_memory_totalmanagedMemoryTotal
aws_kinesisanalytics_managed_memory_usedmanagedMemoryUsed
aws_kinesisanalytics_managed_memory_utilizationmanagedMemoryUtilization
aws_kinesisanalytics_num_late_records_droppednumLateRecordsDropped
aws_kinesisanalytics_num_records_innumRecordsIn
aws_kinesisanalytics_num_records_in_per_secondnumRecordsInPerSecond
aws_kinesisanalytics_num_records_outnumRecordsOut
aws_kinesisanalytics_num_records_out_per_secondnumRecordsOutPerSecond
aws_kinesisanalytics_number_of_failed_checkpointsnumberOfFailedCheckpoints
aws_kinesisanalytics_old_generation_gccountoldGenerationGCCount
aws_kinesisanalytics_old_generation_gctimeoldGenerationGCTime
aws_kinesisanalytics_records_lag_maxrecords_lag_max
aws_kinesisanalytics_thread_countthreadCount
aws_kinesisanalytics_uptimeuptime
aws_kinesisanalytics_zeppelin_cpu_utilizationzeppelinCpuUtilization
aws_kinesisanalytics_zeppelin_heap_memory_utilizationzeppelinHeapMemoryUtilization
aws_kinesisanalytics_zeppelin_server_uptimezeppelinServerUptime
aws_kinesisanalytics_zeppelin_thread_countzeppelinThreadCount
aws_kinesisanalytics_zeppelin_waiting_jobszeppelinWaitingJobs

AWS/Lambda

Function: Serverless compute service that runs code in response to events

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_lambda_info
aws_lambda_invocationsInvocations
aws_lambda_errorsErrors
aws_lambda_throttlesThrottles
aws_lambda_durationDuration
aws_lambda_async_event_ageAsyncEventAge
aws_lambda_async_events_droppedAsyncEventsDropped
aws_lambda_async_events_receivedAsyncEventsReceived
aws_lambda_claimed_account_concurrencyClaimedAccountConcurrency
aws_lambda_concurrent_executionsConcurrentExecutions
aws_lambda_dead_letter_errorsDeadLetterErrors
aws_lambda_destination_delivery_failuresDestinationDeliveryFailures
aws_lambda_iterator_ageIteratorAge
aws_lambda_offset_lagOffsetLag
aws_lambda_oversized_record_countOversizedRecordCount
aws_lambda_post_runtime_extensions_durationPostRuntimeExtensionsDuration
aws_lambda_provisioned_concurrency_invocationsProvisionedConcurrencyInvocations
aws_lambda_provisioned_concurrency_spillover_invocationsProvisionedConcurrencySpilloverInvocations
aws_lambda_provisioned_concurrency_utilizationProvisionedConcurrencyUtilization
aws_lambda_provisioned_concurrent_executionsProvisionedConcurrentExecutions
aws_lambda_recursive_invocations_droppedRecursiveInvocationsDropped
aws_lambda_unreserved_concurrent_executionsUnreservedConcurrentExecutions

AWS/Logs

Function: Centralized logging service for monitoring and troubleshooting applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_logs_info
aws_logs_delivery_errorsDeliveryErrors
aws_logs_delivery_throttlingDeliveryThrottling
aws_logs_forwarded_bytesForwardedBytes
aws_logs_forwarded_log_eventsForwardedLogEvents
aws_logs_incoming_bytesIncomingBytes
aws_logs_incoming_log_eventsIncomingLogEvents

AWS/MWAA

Function: Managed service for Apache Airflow to manage workflows and orchestration

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_mwaa_active_connection_countActiveConnectionCount
aws_mwaa_approximate_age_of_oldest_taskApproximateAgeOfOldestTask
aws_mwaa_cpuutilizationCPUUtilization
aws_mwaa_database_connectionsDatabaseConnections
aws_mwaa_disk_queue_depthDiskQueueDepth
aws_mwaa_freeable_memoryFreeableMemory
aws_mwaa_memory_utilizationMemoryUtilization
aws_mwaa_queued_tasksQueuedTasks
aws_mwaa_running_tasksRunningTasks
aws_mwaa_volume_write_iopsVolumeWriteIOPS
aws_mwaa_write_iopsWriteIOPS
aws_mwaa_write_latencyWriteLatency
aws_mwaa_write_throughputWriteThroughput

AWS/MediaConnect

Function: Secure and reliable transport of live video streams

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_mediaconnect_info
aws_mediaconnect_arqrecoveredARQRecovered
aws_mediaconnect_arqrequestsARQRequests
aws_mediaconnect_bit_rateBitRate
aws_mediaconnect_caterrorCATError
aws_mediaconnect_crcerrorCRCError
aws_mediaconnect_connectedConnected
aws_mediaconnect_connected_outputsConnectedOutputs
aws_mediaconnect_connection_attemptsConnectionAttempts
aws_mediaconnect_consecutive_dropsConsecutiveDrops
aws_mediaconnect_consecutive_not_recoveredConsecutiveNotRecovered
aws_mediaconnect_continuity_counterContinuityCounter
aws_mediaconnect_disconnectionsDisconnections
aws_mediaconnect_dropped_packetsDroppedPackets
aws_mediaconnect_egress_bridge_bit_rateEgressBridgeBitRate
aws_mediaconnect_egress_bridge_caterrorEgressBridgeCATError
aws_mediaconnect_egress_bridge_crcerrorEgressBridgeCRCError
aws_mediaconnect_egress_bridge_continuity_counterEgressBridgeContinuityCounter
aws_mediaconnect_egress_bridge_dropped_packetsEgressBridgeDroppedPackets
aws_mediaconnect_egress_bridge_failover_switchesEgressBridgeFailoverSwitches
aws_mediaconnect_egress_bridge_merge_activeEgressBridgeMergeActive
aws_mediaconnect_egress_bridge_not_recovered_packetsEgressBridgeNotRecoveredPackets
aws_mediaconnect_egress_bridge_paterrorEgressBridgePATError
aws_mediaconnect_egress_bridge_pcraccuracy_errorEgressBridgePCRAccuracyError
aws_mediaconnect_egress_bridge_pcrerrorEgressBridgePCRError
aws_mediaconnect_egress_bridge_piderrorEgressBridgePIDError
aws_mediaconnect_egress_bridge_pmterrorEgressBridgePMTError
aws_mediaconnect_egress_bridge_ptserrorEgressBridgePTSError
aws_mediaconnect_egress_bridge_packet_loss_percentEgressBridgePacketLossPercent
aws_mediaconnect_egress_bridge_recovered_packetsEgressBridgeRecoveredPackets
aws_mediaconnect_egress_bridge_source_bit_rateEgressBridgeSourceBitRate
aws_mediaconnect_egress_bridge_source_caterrorEgressBridgeSourceCATError
aws_mediaconnect_egress_bridge_source_crcerrorEgressBridgeSourceCRCError
aws_mediaconnect_egress_bridge_source_continuity_counterEgressBridgeSourceContinuityCounter
aws_mediaconnect_egress_bridge_source_dropped_packetsEgressBridgeSourceDroppedPackets
aws_mediaconnect_egress_bridge_source_merge_activeEgressBridgeSourceMergeActive
aws_mediaconnect_egress_bridge_source_merge_latencyEgressBridgeSourceMergeLatency
aws_mediaconnect_egress_bridge_source_not_recovered_packetsEgressBridgeSourceNotRecoveredPackets
aws_mediaconnect_egress_bridge_source_paterrorEgressBridgeSourcePATError
aws_mediaconnect_egress_bridge_source_pcraccuracy_errorEgressBridgeSourcePCRAccuracyError
aws_mediaconnect_egress_bridge_source_pcrerrorEgressBridgeSourcePCRError
aws_mediaconnect_egress_bridge_source_piderrorEgressBridgeSourcePIDError
aws_mediaconnect_egress_bridge_source_pmterrorEgressBridgeSourcePMTError
aws_mediaconnect_egress_bridge_source_ptserrorEgressBridgeSourcePTSError
aws_mediaconnect_egress_bridge_source_packet_loss_percentEgressBridgeSourcePacketLossPercent
aws_mediaconnect_egress_bridge_source_recovered_packetsEgressBridgeSourceRecoveredPackets
aws_mediaconnect_egress_bridge_source_tsbyte_errorEgressBridgeSourceTSByteError
aws_mediaconnect_egress_bridge_source_tssync_lossEgressBridgeSourceTSSyncLoss
aws_mediaconnect_egress_bridge_source_total_packetsEgressBridgeSourceTotalPackets
aws_mediaconnect_egress_bridge_source_transport_errorEgressBridgeSourceTransportError
aws_mediaconnect_egress_bridge_tsbyte_errorEgressBridgeTSByteError
aws_mediaconnect_egress_bridge_tssync_lossEgressBridgeTSSyncLoss
aws_mediaconnect_egress_bridge_total_packetsEgressBridgeTotalPackets
aws_mediaconnect_egress_bridge_transport_errorEgressBridgeTransportError
aws_mediaconnect_failover_switchesFailoverSwitches
aws_mediaconnect_ingress_bridge_bit_rateIngressBridgeBitRate
aws_mediaconnect_ingress_bridge_caterrorIngressBridgeCATError
aws_mediaconnect_ingress_bridge_crcerrorIngressBridgeCRCError
aws_mediaconnect_ingress_bridge_continuity_counterIngressBridgeContinuityCounter
aws_mediaconnect_ingress_bridge_dropped_packetsIngressBridgeDroppedPackets
aws_mediaconnect_ingress_bridge_failover_switchesIngressBridgeFailoverSwitches
aws_mediaconnect_ingress_bridge_merge_activeIngressBridgeMergeActive
aws_mediaconnect_ingress_bridge_not_recovered_packetsIngressBridgeNotRecoveredPackets
aws_mediaconnect_ingress_bridge_paterrorIngressBridgePATError
aws_mediaconnect_ingress_bridge_pcraccuracy_errorIngressBridgePCRAccuracyError
aws_mediaconnect_ingress_bridge_pcrerrorIngressBridgePCRError
aws_mediaconnect_ingress_bridge_piderrorIngressBridgePIDError
aws_mediaconnect_ingress_bridge_pmterrorIngressBridgePMTError
aws_mediaconnect_ingress_bridge_ptserrorIngressBridgePTSError
aws_mediaconnect_ingress_bridge_packet_loss_percentIngressBridgePacketLossPercent
aws_mediaconnect_ingress_bridge_recovered_packetsIngressBridgeRecoveredPackets
aws_mediaconnect_ingress_bridge_source_arqrecoveredIngressBridgeSourceARQRecovered
aws_mediaconnect_ingress_bridge_source_arqrequestsIngressBridgeSourceARQRequests
aws_mediaconnect_ingress_bridge_source_bit_rateIngressBridgeSourceBitRate
aws_mediaconnect_ingress_bridge_source_caterrorIngressBridgeSourceCATError
aws_mediaconnect_ingress_bridge_source_crcerrorIngressBridgeSourceCRCError
aws_mediaconnect_ingress_bridge_source_continuity_counterIngressBridgeSourceContinuityCounter
aws_mediaconnect_ingress_bridge_source_dropped_packetsIngressBridgeSourceDroppedPackets
aws_mediaconnect_ingress_bridge_source_fecpacketsIngressBridgeSourceFECPackets
aws_mediaconnect_ingress_bridge_source_fecrecoveredIngressBridgeSourceFECRecovered
aws_mediaconnect_ingress_bridge_source_merge_activeIngressBridgeSourceMergeActive
aws_mediaconnect_ingress_bridge_source_merge_latencyIngressBridgeSourceMergeLatency
aws_mediaconnect_ingress_bridge_source_not_recovered_packetsIngressBridgeSourceNotRecoveredPackets
aws_mediaconnect_ingress_bridge_source_overflow_packetsIngressBridgeSourceOverflowPackets
aws_mediaconnect_ingress_bridge_source_paterrorIngressBridgeSourcePATError
aws_mediaconnect_ingress_bridge_source_pcraccuracy_errorIngressBridgeSourcePCRAccuracyError
aws_mediaconnect_ingress_bridge_source_pcrerrorIngressBridgeSourcePCRError
aws_mediaconnect_ingress_bridge_source_piderrorIngressBridgeSourcePIDError
aws_mediaconnect_ingress_bridge_source_pmterrorIngressBridgeSourcePMTError
aws_mediaconnect_ingress_bridge_source_ptserrorIngressBridgeSourcePTSError
aws_mediaconnect_ingress_bridge_source_packet_loss_percentIngressBridgeSourcePacketLossPercent
aws_mediaconnect_ingress_bridge_source_recovered_packetsIngressBridgeSourceRecoveredPackets
aws_mediaconnect_ingress_bridge_source_round_trip_timeIngressBridgeSourceRoundTripTime
aws_mediaconnect_ingress_bridge_source_tsbyte_errorIngressBridgeSourceTSByteError
aws_mediaconnect_ingress_bridge_source_tssync_lossIngressBridgeSourceTSSyncLoss
aws_mediaconnect_ingress_bridge_source_total_packetsIngressBridgeSourceTotalPackets
aws_mediaconnect_ingress_bridge_source_transport_errorIngressBridgeSourceTransportError
aws_mediaconnect_ingress_bridge_tsbyte_errorIngressBridgeTSByteError
aws_mediaconnect_ingress_bridge_tssync_lossIngressBridgeTSSyncLoss
aws_mediaconnect_ingress_bridge_total_packetsIngressBridgeTotalPackets
aws_mediaconnect_ingress_bridge_transport_errorIngressBridgeTransportError
aws_mediaconnect_jitterJitter
aws_mediaconnect_latencyLatency
aws_mediaconnect_maintenance_canceledMaintenanceCanceled
aws_mediaconnect_maintenance_failedMaintenanceFailed
aws_mediaconnect_maintenance_rescheduledMaintenanceRescheduled
aws_mediaconnect_maintenance_scheduledMaintenanceScheduled
aws_mediaconnect_maintenance_startedMaintenanceStarted
aws_mediaconnect_maintenance_succeededMaintenanceSucceeded
aws_mediaconnect_merge_activeMergeActive
aws_mediaconnect_merge_latencyMergeLatency
aws_mediaconnect_not_recovered_packetsNotRecoveredPackets
aws_mediaconnect_output_connectedOutputConnected
aws_mediaconnect_output_disconnectionsOutputDisconnections
aws_mediaconnect_output_dropped_payloadsOutputDroppedPayloads
aws_mediaconnect_output_late_payloadsOutputLatePayloads
aws_mediaconnect_output_total_bytesOutputTotalBytes
aws_mediaconnect_output_total_payloadsOutputTotalPayloads
aws_mediaconnect_overflow_packetsOverflowPackets
aws_mediaconnect_paterrorPATError
aws_mediaconnect_pcraccuracy_errorPCRAccuracyError
aws_mediaconnect_pcrerrorPCRError
aws_mediaconnect_piderrorPIDError
aws_mediaconnect_pmterrorPMTError
aws_mediaconnect_ptserrorPTSError
aws_mediaconnect_packet_loss_percentPacketLossPercent
aws_mediaconnect_recovered_packetsRecoveredPackets
aws_mediaconnect_round_trip_timeRoundTripTime
aws_mediaconnect_source_arqrecoveredSourceARQRecovered
aws_mediaconnect_source_arqrequestsSourceARQRequests
aws_mediaconnect_source_bit_rateSourceBitRate
aws_mediaconnect_source_caterrorSourceCATError
aws_mediaconnect_source_crcerrorSourceCRCError
aws_mediaconnect_source_connectedSourceConnected
aws_mediaconnect_source_continuity_counterSourceContinuityCounter
aws_mediaconnect_source_disconnectionsSourceDisconnections
aws_mediaconnect_source_dropped_packetsSourceDroppedPackets
aws_mediaconnect_source_dropped_payloadsSourceDroppedPayloads
aws_mediaconnect_source_fecpacketsSourceFECPackets
aws_mediaconnect_source_fecrecoveredSourceFECRecovered
aws_mediaconnect_source_late_payloadsSourceLatePayloads
aws_mediaconnect_source_merge_activeSourceMergeActive
aws_mediaconnect_source_merge_latencySourceMergeLatency
aws_mediaconnect_source_merge_status_warn_mismatchSourceMergeStatusWarnMismatch
aws_mediaconnect_source_merge_status_warn_soloSourceMergeStatusWarnSolo
aws_mediaconnect_source_missing_packetsSourceMissingPackets
aws_mediaconnect_source_not_recovered_packetsSourceNotRecoveredPackets
aws_mediaconnect_source_overflow_packetsSourceOverflowPackets
aws_mediaconnect_source_paterrorSourcePATError
aws_mediaconnect_source_pcraccuracy_errorSourcePCRAccuracyError
aws_mediaconnect_source_pcrerrorSourcePCRError
aws_mediaconnect_source_piderrorSourcePIDError
aws_mediaconnect_source_pmterrorSourcePMTError
aws_mediaconnect_source_ptserrorSourcePTSError
aws_mediaconnect_source_packet_loss_percentSourcePacketLossPercent
aws_mediaconnect_source_recovered_packetsSourceRecoveredPackets
aws_mediaconnect_source_round_trip_timeSourceRoundTripTime
aws_mediaconnect_source_selectedSourceSelected
aws_mediaconnect_source_tsbyte_errorSourceTSByteError
aws_mediaconnect_source_tssync_lossSourceTSSyncLoss
aws_mediaconnect_source_total_bytesSourceTotalBytes
aws_mediaconnect_source_total_packetsSourceTotalPackets
aws_mediaconnect_source_total_payloadsSourceTotalPayloads
aws_mediaconnect_source_transport_errorSourceTransportError
aws_mediaconnect_tsbyte_errorTSByteError
aws_mediaconnect_tssync_lossTSSyncLoss
aws_mediaconnect_total_packetsTotalPackets
aws_mediaconnect_transport_errorTransportError
aws_mediaconnect_uptimeUptime

AWS/MediaTailor

Function: Personalizes advertisement insertion in video streams for a seamless experience

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_mediatailor_info
aws_mediatailor_ad_decision_server_adsAdDecisionServer.Ads
aws_mediatailor_ad_decision_server_durationAdDecisionServer.Duration
aws_mediatailor_ad_decision_server_errorsAdDecisionServer.Errors
aws_mediatailor_ad_decision_server_fill_rateAdDecisionServer.FillRate
aws_mediatailor_ad_decision_server_timeoutsAdDecisionServer.Timeouts
aws_mediatailor_ad_not_readyAdNotReady
aws_mediatailor_avails_durationAvails.Duration
aws_mediatailor_avails_fill_rateAvails.FillRate
aws_mediatailor_avails_filled_durationAvails.FilledDuration
aws_mediatailor_get_manifest_errorsGetManifest.Errors
aws_mediatailor_origin_errorsOrigin.Errors
aws_mediatailor_origin_timeoutsOrigin.Timeouts

AWS/NATGateway

Function: Manages network address translation to securely connect instances to the internet

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_natgateway_info
aws_natgateway_active_connection_countActiveConnectionCount
aws_natgateway_bytes_in_from_destinationBytesInFromDestination
aws_natgateway_bytes_in_from_sourceBytesInFromSource
aws_natgateway_bytes_out_to_destinationBytesOutToDestination
aws_natgateway_bytes_out_to_sourceBytesOutToSource
aws_natgateway_connection_attempt_countConnectionAttemptCount
aws_natgateway_connection_established_countConnectionEstablishedCount
aws_natgateway_error_port_allocationErrorPortAllocation
aws_natgateway_idle_timeout_countIdleTimeoutCount
aws_natgateway_packets_drop_countPacketsDropCount
aws_natgateway_packets_in_from_destinationPacketsInFromDestination
aws_natgateway_packets_in_from_sourcePacketsInFromSource
aws_natgateway_packets_out_to_destinationPacketsOutToDestination
aws_natgateway_packets_out_to_sourcePacketsOutToSource

AWS/Neptune

Function: Managed graph database service for building and running graph applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_neptune_info
aws_neptune_cpuutilizationCPUUtilization
aws_neptune_cluster_replica_lagClusterReplicaLag
aws_neptune_cluster_replica_lag_maximumClusterReplicaLagMaximum
aws_neptune_cluster_replica_lag_minimumClusterReplicaLagMinimum
aws_neptune_engine_uptimeEngineUptime
aws_neptune_free_local_storageFreeLocalStorage
aws_neptune_freeable_memoryFreeableMemory
aws_neptune_gremlin_errorsGremlinErrors
aws_neptune_gremlin_http1xxGremlinHttp1xx
aws_neptune_gremlin_http2xxGremlinHttp2xx
aws_neptune_gremlin_http4xxGremlinHttp4xx
aws_neptune_gremlin_http5xxGremlinHttp5xx
aws_neptune_gremlin_requestsGremlinRequests
aws_neptune_gremlin_requests_per_secGremlinRequestsPerSec
aws_neptune_gremlin_web_socket_available_connectionsGremlinWebSocketAvailableConnections
aws_neptune_gremlin_web_socket_client_errorsGremlinWebSocketClientErrors
aws_neptune_gremlin_web_socket_server_errorsGremlinWebSocketServerErrors
aws_neptune_gremlin_web_socket_successGremlinWebSocketSuccess
aws_neptune_http100Http100
aws_neptune_http101Http101
aws_neptune_http1xxHttp1xx
aws_neptune_http200Http200
aws_neptune_http2xxHttp2xx
aws_neptune_http400Http400
aws_neptune_http403Http403
aws_neptune_http405Http405
aws_neptune_http413Http413
aws_neptune_http429Http429
aws_neptune_http4xxHttp4xx
aws_neptune_http500Http500
aws_neptune_http501Http501
aws_neptune_http5xxHttp5xx
aws_neptune_loader_errorsLoaderErrors
aws_neptune_loader_requestsLoaderRequests
aws_neptune_network_receive_throughputNetworkReceiveThroughput
aws_neptune_network_throughputNetworkThroughput
aws_neptune_network_transmit_throughputNetworkTransmitThroughput
aws_neptune_sparql_errorsSparqlErrors
aws_neptune_sparql_http1xxSparqlHttp1xx
aws_neptune_sparql_http2xxSparqlHttp2xx
aws_neptune_sparql_http4xxSparqlHttp4xx
aws_neptune_sparql_http5xxSparqlHttp5xx
aws_neptune_sparql_requestsSparqlRequests
aws_neptune_sparql_requests_per_secSparqlRequestsPerSec
aws_neptune_status_errorsStatusErrors
aws_neptune_status_requestsStatusRequests
aws_neptune_volume_bytes_usedVolumeBytesUsed
aws_neptune_volume_read_iopsVolumeReadIOPs
aws_neptune_volume_write_iopsVolumeWriteIOPs

AWS/NetworkELB

Function: Provides highly scalable and fault-tolerant network load balancing for traffic distribution

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_networkelb_info
aws_networkelb_active_flow_countActiveFlowCount
aws_networkelb_active_flow_count_tlsActiveFlowCount_TLS
aws_networkelb_client_tlsnegotiation_error_countClientTLSNegotiationErrorCount
aws_networkelb_consumed_lcusConsumedLCUs
aws_networkelb_healthy_host_countHealthyHostCount
aws_networkelb_new_flow_countNewFlowCount
aws_networkelb_new_flow_count_tlsNewFlowCount_TLS
aws_networkelb_processed_bytesProcessedBytes
aws_networkelb_target_tlsnegotiation_error_countTargetTLSNegotiationErrorCount
aws_networkelb_tcp_client_reset_countTCP_Client_Reset_Count
aws_networkelb_tcp_target_reset_countTCP_Target_Reset_Count
aws_networkelb_un_healthy_host_countUnHealthyHostCount
aws_networkelb_active_flow_count_tcpActiveFlowCount_TCP
aws_networkelb_active_flow_count_udpActiveFlowCount_UDP
aws_networkelb_consumed_lcus_tcpConsumedLCUs_TCP
aws_networkelb_consumed_lcus_tlsConsumedLCUs_TLS
aws_networkelb_consumed_lcus_udpConsumedLCUs_UDP
aws_networkelb_new_flow_count_tcpNewFlowCount_TCP
aws_networkelb_new_flow_count_udpNewFlowCount_UDP
aws_networkelb_peak_packets_per_secondPeakPacketsPerSecond
aws_networkelb_port_allocation_error_countPortAllocationErrorCount
aws_networkelb_processed_bytes_tcpProcessedBytes_TCP
aws_networkelb_processed_bytes_tlsProcessedBytes_TLS
aws_networkelb_processed_bytes_udpProcessedBytes_UDP
aws_networkelb_processed_packetsProcessedPackets
aws_networkelb_security_group_blocked_flow_count_inbound_icmpSecurityGroupBlockedFlowCount_Inbound_ICMP
aws_networkelb_security_group_blocked_flow_count_inbound_tcpSecurityGroupBlockedFlowCount_Inbound_TCP
aws_networkelb_security_group_blocked_flow_count_inbound_udpSecurityGroupBlockedFlowCount_Inbound_UDP
aws_networkelb_security_group_blocked_flow_count_outbound_icmpSecurityGroupBlockedFlowCount_Outbound_ICMP
aws_networkelb_security_group_blocked_flow_count_outbound_tcpSecurityGroupBlockedFlowCount_Outbound_TCP
aws_networkelb_security_group_blocked_flow_count_outbound_udpSecurityGroupBlockedFlowCount_Outbound_UDP
aws_networkelb_tcp_elb_reset_countTCP_ELB_Reset_Count
aws_networkelb_unhealthy_routing_flow_countUnhealthyRoutingFlowCount

AWS/NetworkFirewall

Function: Managed network firewall service to secure VPCs

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_networkfirewall_info
aws_networkfirewall_dropped_packetsDroppedPackets
aws_networkfirewall_packetsPackets
aws_networkfirewall_passed_packetsPassedPackets
aws_networkfirewall_received_packet_countReceivedPacketCount

AWS/PrivateLinkEndpoints

Function: Provides private connectivity between VPCs and AWS services or third-party services

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_privatelinkendpoints_info
aws_privatelinkendpoints_active_connectionsActiveConnections
aws_privatelinkendpoints_bytes_processedBytesProcessed
aws_privatelinkendpoints_new_connectionsNewConnections
aws_privatelinkendpoints_packets_droppedPacketsDropped
aws_privatelinkendpoints_rst_packets_receivedRstPacketsReceived

AWS/PrivateLinkServices

Function: Service for building services accessible over AWS PrivateLink

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_privatelinkservices_info
aws_privatelinkservices_active_connectionsActiveConnections
aws_privatelinkservices_bytes_processedBytesProcessed
aws_privatelinkservices_endpoints_countEndpointsCount
aws_privatelinkservices_new_connectionsNewConnections
aws_privatelinkservices_rst_packets_receivedRstPacketsReceived

AWS/Prometheus

Function: Managed Prometheus service for monitoring and alerting metrics

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_prometheus_info
aws_prometheus_alert_manager_alerts_receivedAlertManagerAlertsReceived
aws_prometheus_alert_manager_notifications_failedAlertManagerNotificationsFailed
aws_prometheus_alert_manager_notifications_throttledAlertManagerNotificationsThrottled
aws_prometheus_discarded_samplesDiscardedSamples
aws_prometheus_rule_evaluation_failuresRuleEvaluationFailures
aws_prometheus_rule_evaluationsRuleEvaluations
aws_prometheus_rule_group_iterations_missedRuleGroupIterationsMissed

AWS/RDS

Function: Managed relational database service for databases like MySQL, PostgreSQL, and Oracle

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_rds_info
aws_rds_cpuutilizationCPUUtilization
aws_rds_database_connectionsDatabaseConnections
aws_rds_replica_lagReplicaLag
aws_rds_freeable_memoryFreeableMemory
aws_rds_free_storage_spaceFreeStorageSpace
aws_rds_free_storage_space_log_volumeFreeStorageSpaceLogVolume
aws_rds_swap_usageSwapUsage
aws_rds_read_throughputReadThroughput
aws_rds_read_latencyReadLatency
aws_rds_read_iopsReadIOPS
aws_rds_write_throughputWriteThroughput
aws_rds_write_latencyWriteLatency
aws_rds_write_iopsWriteIOPS
aws_rds_burst_balanceBurstBalance
aws_rds_ebsbyte_balance_percentEBSByteBalance%
aws_rds_ebsiobalance_percentEBSIOBalance%
aws_rds_dbloadDBLoad
aws_rds_dbload_cpuDBLoadCPU
aws_rds_dbload_non_cpuDBLoadNonCPU
aws_rds_cpucredit_usageCPUCreditUsage
aws_rds_cpucredit_balanceCPUCreditBalance
aws_rds_acuutilizationACUUtilization
aws_rds_aborted_clientsAbortedClients
aws_rds_active_transactionsActiveTransactions
aws_rds_aurora_binlog_replica_lagAuroraBinlogReplicaLag
aws_rds_aurora_dmlrejected_master_fullAuroraDMLRejectedMasterFull
aws_rds_aurora_dmlrejected_writer_fullAuroraDMLRejectedWriterFull
aws_rds_aurora_estimated_shared_memory_bytesAuroraEstimatedSharedMemoryBytes
aws_rds_aurora_global_dbdata_transfer_bytesAuroraGlobalDBDataTransferBytes
aws_rds_aurora_global_dbprogress_lagAuroraGlobalDBProgressLag
aws_rds_aurora_global_dbrpolagAuroraGlobalDBRPOLag
aws_rds_aurora_global_dbreplicated_write_ioAuroraGlobalDBReplicatedWriteIO
aws_rds_aurora_global_dbreplication_lagAuroraGlobalDBReplicationLag
aws_rds_aurora_memory_health_stateAuroraMemoryHealthState
aws_rds_aurora_memory_num_declined_sql_totalAuroraMemoryNumDeclinedSqlTotal
aws_rds_aurora_memory_num_kill_conn_totalAuroraMemoryNumKillConnTotal
aws_rds_aurora_memory_num_kill_query_totalAuroraMemoryNumKillQueryTotal
aws_rds_aurora_optimized_reads_cache_hit_ratioAuroraOptimizedReadsCacheHitRatio
aws_rds_aurora_replica_lagAuroraReplicaLag
aws_rds_aurora_replica_lag_maximumAuroraReplicaLagMaximum
aws_rds_aurora_replica_lag_minimumAuroraReplicaLagMinimum
aws_rds_aurora_slow_connection_handle_countAuroraSlowConnectionHandleCount
aws_rds_aurora_slow_handshake_countAuroraSlowHandshakeCount
aws_rds_aurora_volume_bytes_left_totalAuroraVolumeBytesLeftTotal
aws_rds_availability_percentageAvailabilityPercentage
aws_rds_backtrack_change_records_creation_rateBacktrackChangeRecordsCreationRate
aws_rds_backtrack_change_records_storedBacktrackChangeRecordsStored
aws_rds_backtrack_window_actualBacktrackWindowActual
aws_rds_backtrack_window_alertBacktrackWindowAlert
aws_rds_backup_retention_period_storage_usedBackupRetentionPeriodStorageUsed
aws_rds_bin_log_disk_usageBinLogDiskUsage
aws_rds_blocked_transactionsBlockedTransactions
aws_rds_buffer_cache_hit_ratioBufferCacheHitRatio
aws_rds_cpusurplus_credit_balanceCPUSurplusCreditBalance
aws_rds_cpusurplus_credits_chargedCPUSurplusCreditsCharged
aws_rds_checkpoint_lagCheckpointLag
aws_rds_client_connectionsClientConnections
aws_rds_client_connections_closedClientConnectionsClosed
aws_rds_client_connections_no_tlsClientConnectionsNoTLS
aws_rds_client_connections_receivedClientConnectionsReceived
aws_rds_client_connections_setup_failed_authClientConnectionsSetupFailedAuth
aws_rds_client_connections_setup_succeededClientConnectionsSetupSucceeded
aws_rds_client_connections_tlsClientConnectionsTLS
aws_rds_commit_latencyCommitLatency
aws_rds_commit_throughputCommitThroughput
aws_rds_connection_attemptsConnectionAttempts
aws_rds_ddllatencyDDLLatency
aws_rds_ddlthroughputDDLThroughput
aws_rds_dmllatencyDMLLatency
aws_rds_dmlthroughputDMLThroughput
aws_rds_database_connection_requestsDatabaseConnectionRequests
aws_rds_database_connection_requests_with_tlsDatabaseConnectionRequestsWithTLS
aws_rds_database_connections_borrow_latencyDatabaseConnectionsBorrowLatency
aws_rds_database_connections_currently_borrowedDatabaseConnectionsCurrentlyBorrowed
aws_rds_database_connections_currently_in_transactionDatabaseConnectionsCurrentlyInTransaction
aws_rds_database_connections_currently_session_pinnedDatabaseConnectionsCurrentlySessionPinned
aws_rds_database_connections_setup_failedDatabaseConnectionsSetupFailed
aws_rds_database_connections_setup_succeededDatabaseConnectionsSetupSucceeded
aws_rds_database_connections_with_tlsDatabaseConnectionsWithTLS
aws_rds_deadlocksDeadlocks
aws_rds_delete_latencyDeleteLatency
aws_rds_delete_throughputDeleteThroughput
aws_rds_disk_queue_depthDiskQueueDepth
aws_rds_disk_queue_depth_log_volumeDiskQueueDepthLogVolume
aws_rds_engine_uptimeEngineUptime
aws_rds_failed_sqlserver_agent_jobs_countFailedSQLServerAgentJobsCount
aws_rds_free_ephemeral_storageFreeEphemeralStorage
aws_rds_free_local_storageFreeLocalStorage
aws_rds_insert_latencyInsertLatency
aws_rds_insert_throughputInsertThroughput
aws_rds_login_failuresLoginFailures
aws_rds_max_database_connections_allowedMaxDatabaseConnectionsAllowed
aws_rds_maximum_used_transaction_idsMaximumUsedTransactionIDs
aws_rds_network_receive_throughputNetworkReceiveThroughput
aws_rds_network_throughputNetworkThroughput
aws_rds_network_transmit_throughputNetworkTransmitThroughput
aws_rds_num_binary_log_filesNumBinaryLogFiles
aws_rds_oldest_replication_slot_lagOldestReplicationSlotLag
aws_rds_purge_boundaryPurgeBoundary
aws_rds_purge_finished_pointPurgeFinishedPoint
aws_rds_queriesQueries
aws_rds_query_database_response_latencyQueryDatabaseResponseLatency
aws_rds_query_requestsQueryRequests
aws_rds_query_requests_no_tlsQueryRequestsNoTLS
aws_rds_query_requests_tlsQueryRequestsTLS
aws_rds_query_response_latencyQueryResponseLatency
aws_rds_to_aurora_postgre_sqlreplica_lagRDSToAuroraPostgreSQLReplicaLag
aws_rds_read_iopsephemeral_storageReadIOPSEphemeralStorage
aws_rds_read_iopslog_volumeReadIOPSLogVolume
aws_rds_read_latency_ephemeral_storageReadLatencyEphemeralStorage
aws_rds_read_latency_log_volumeReadLatencyLogVolume
aws_rds_read_throughput_ephemeral_storageReadThroughputEphemeralStorage
aws_rds_read_throughput_log_volumeReadThroughputLogVolume
aws_rds_replication_channel_lagReplicationChannelLag
aws_rds_replication_slot_disk_usageReplicationSlotDiskUsage
aws_rds_result_set_cache_hit_ratioResultSetCacheHitRatio
aws_rds_rollback_segment_history_list_lengthRollbackSegmentHistoryListLength
aws_rds_row_lock_timeRowLockTime
aws_rds_select_latencySelectLatency
aws_rds_select_throughputSelectThroughput
aws_rds_serverless_database_capacityServerlessDatabaseCapacity
aws_rds_snapshot_storage_usedSnapshotStorageUsed
aws_rds_storage_network_receive_throughputStorageNetworkReceiveThroughput
aws_rds_storage_network_throughputStorageNetworkThroughput
aws_rds_storage_network_transmit_throughputStorageNetworkTransmitThroughput
aws_rds_sum_binary_log_sizeSumBinaryLogSize
aws_rds_temp_storage_iopsTempStorageIOPS
aws_rds_temp_storage_throughputTempStorageThroughput
aws_rds_total_backup_storage_billedTotalBackupStorageBilled
aws_rds_transaction_logs_disk_usageTransactionLogsDiskUsage
aws_rds_transaction_logs_generationTransactionLogsGeneration
aws_rds_truncate_finished_pointTruncateFinishedPoint
aws_rds_update_latencyUpdateLatency
aws_rds_update_throughputUpdateThroughput
aws_rds_volume_bytes_usedVolumeBytesUsed
aws_rds_volume_read_iopsVolumeReadIOPs
aws_rds_volume_write_iopsVolumeWriteIOPs
aws_rds_write_iopsephemeral_storageWriteIOPSEphemeralStorage
aws_rds_write_iopslog_volumeWriteIOPSLogVolume
aws_rds_write_latency_ephemeral_storageWriteLatencyEphemeralStorage
aws_rds_write_latency_log_volumeWriteLatencyLogVolume
aws_rds_write_throughput_ephemeral_storageWriteThroughputEphemeralStorage
aws_rds_write_throughput_log_volumeWriteThroughputLogVolume

AWS/Redshift

Function: Fully managed data warehouse for large-scale data analytics

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_redshift_info
aws_redshift_cpuutilizationCPUUtilization
aws_redshift_commit_queue_lengthCommitQueueLength
aws_redshift_concurrency_scaling_active_clustersConcurrencyScalingActiveClusters
aws_redshift_concurrency_scaling_secondsConcurrencyScalingSeconds
aws_redshift_database_connectionsDatabaseConnections
aws_redshift_health_statusHealthStatus
aws_redshift_maintenance_modeMaintenanceMode
aws_redshift_max_configured_concurrency_scaling_clustersMaxConfiguredConcurrencyScalingClusters
aws_redshift_network_receive_throughputNetworkReceiveThroughput
aws_redshift_network_transmit_throughputNetworkTransmitThroughput
aws_redshift_num_exceeded_schema_quotasNumExceededSchemaQuotas
aws_redshift_percentage_disk_space_usedPercentageDiskSpaceUsed
aws_redshift_percentage_quota_usedPercentageQuotaUsed
aws_redshift_queries_completed_per_secondQueriesCompletedPerSecond
aws_redshift_query_durationQueryDuration
aws_redshift_query_runtime_breakdownQueryRuntimeBreakdown
aws_redshift_read_iopsReadIOPS
aws_redshift_read_latencyReadLatency
aws_redshift_read_throughputReadThroughput
aws_redshift_schema_quotaSchemaQuota
aws_redshift_storage_usedStorageUsed
aws_redshift_total_table_countTotalTableCount
aws_redshift_wlmqueries_completed_per_secondWLMQueriesCompletedPerSecond
aws_redshift_wlmquery_durationWLMQueryDuration
aws_redshift_wlmqueue_lengthWLMQueueLength
aws_redshift_wlmqueue_wait_timeWLMQueueWaitTime
aws_redshift_wlmrunning_queriesWLMRunningQueries
aws_redshift_write_iopsWriteIOPS
aws_redshift_write_latencyWriteLatency
aws_redshift_write_throughputWriteThroughput

AWS/Route53

Function: Scalable DNS and domain registration service

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_route53_info
aws_route53_child_health_check_healthy_countChildHealthCheckHealthyCount
aws_route53_connection_timeConnectionTime
aws_route53_dnsqueriesDNSQueries
aws_route53_health_check_percentage_healthyHealthCheckPercentageHealthy
aws_route53_health_check_statusHealthCheckStatus
aws_route53_sslhandshake_timeSSLHandshakeTime
aws_route53_time_to_first_byteTimeToFirstByte

AWS/Route53Resolver

Function: DNS firewall to filter and monitor DNS queries

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_route53resolver_info
aws_route53resolver_inbound_query_volumeInboundQueryVolume
aws_route53resolver_outbound_query_aggregated_volumeOutboundQueryAggregatedVolume
aws_route53resolver_outbound_query_volumeOutboundQueryVolume

AWS/S3

Function: Scalable object storage service for a wide range of data types

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_s3_info
aws_s3_number_of_objectsNumberOfObjects
aws_s3_bucket_size_bytesBucketSizeBytes
aws_s3_all_requestsAllRequests
aws_s3_4xx_errors4xxErrors
aws_s3_total_request_latencyTotalRequestLatency
aws_s3_5xx_errors5xxErrors
aws_s3_bytes_downloadedBytesDownloaded
aws_s3_bytes_pending_replicationBytesPendingReplication
aws_s3_bytes_uploadedBytesUploaded
aws_s3_delete_requestsDeleteRequests
aws_s3_first_byte_latencyFirstByteLatency
aws_s3_get_requestsGetRequests
aws_s3_head_requestsHeadRequests
aws_s3_list_requestsListRequests
aws_s3_operations_failed_replicationOperationsFailedReplication
aws_s3_operations_pending_replicationOperationsPendingReplication
aws_s3_post_requestsPostRequests
aws_s3_put_requestsPutRequests
aws_s3_replication_latencyReplicationLatency
aws_s3_select_requestsSelectRequests
aws_s3_select_returned_bytesSelectReturnedBytes
aws_s3_select_scanned_bytesSelectScannedBytes

AWS/SES

Function: Email service for sending marketing, notification, and transactional emails

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_ses_bounceBounce
aws_ses_complaintComplaint
aws_ses_deliveryDelivery
aws_ses_rejectReject
aws_ses_sendSend
aws_ses_clicksClicks
aws_ses_opensOpens
aws_ses_rendering_failuresRendering Failures
aws_ses_reputation_bounce_rateReputation.BounceRate
aws_ses_reputation_complaint_rateReputation.ComplaintRate

AWS/SNS

Function: Managed messaging service for sending notifications to mobile devices or other services

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sns_info
aws_sns_number_of_messages_publishedNumberOfMessagesPublished
aws_sns_number_of_notifications_deliveredNumberOfNotificationsDelivered
aws_sns_number_of_notifications_failedNumberOfNotificationsFailed
aws_sns_number_of_notifications_filtered_outNumberOfNotificationsFilteredOut
aws_sns_number_of_notifications_filtered_out_invalid_attributesNumberOfNotificationsFilteredOut-InvalidAttributes
aws_sns_number_of_notifications_filtered_out_message_bodyNumberOfNotificationsFilteredOut-MessageBody
aws_sns_number_of_notifications_filtered_out_no_message_attributesNumberOfNotificationsFilteredOut-NoMessageAttributes
aws_sns_publish_sizePublishSize
aws_sns_smsmonth_to_date_spent_usdSMSMonthToDateSpentUSD
aws_sns_smssuccess_rateSMSSuccessRate

AWS/SQS

Function: Fully managed message queuing service for decoupling and scaling microservices

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_sqs_info
aws_sqs_approximate_age_of_oldest_messageApproximateAgeOfOldestMessage
aws_sqs_approximate_number_of_messages_delayedApproximateNumberOfMessagesDelayed
aws_sqs_approximate_number_of_messages_not_visibleApproximateNumberOfMessagesNotVisible
aws_sqs_approximate_number_of_messages_visibleApproximateNumberOfMessagesVisible
aws_sqs_number_of_empty_receivesNumberOfEmptyReceives
aws_sqs_number_of_messages_deletedNumberOfMessagesDeleted
aws_sqs_number_of_messages_receivedNumberOfMessagesReceived
aws_sqs_number_of_messages_sentNumberOfMessagesSent
aws_sqs_sent_message_sizeSentMessageSize

AWS/SageMaker

Function: Managed service for building, training, and deploying machine learning models

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_info
aws_sagemaker_invocation4_xxerrorsInvocation4XXErrors
aws_sagemaker_invocation5_xxerrorsInvocation5XXErrors
aws_sagemaker_invocation_model_errorsInvocationModelErrors
aws_sagemaker_invocationsInvocations
aws_sagemaker_invocations_per_copyInvocationsPerCopy
aws_sagemaker_invocations_per_instanceInvocationsPerInstance
aws_sagemaker_model_cache_hitModelCacheHit
aws_sagemaker_model_downloading_timeModelDownloadingTime
aws_sagemaker_model_latencyModelLatency
aws_sagemaker_model_loading_timeModelLoadingTime
aws_sagemaker_model_loading_wait_timeModelLoadingWaitTime
aws_sagemaker_model_setup_timeModelSetupTime
aws_sagemaker_model_unloading_timeModelUnloadingTime
aws_sagemaker_overhead_latencyOverheadLatency

AWS/SageMaker/Endpoints

Function: Provides real-time and batch inference capabilities for deployed machine learning models

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_endpoints_info
aws_sagemaker_endpoints_cpureservationCPUReservation
aws_sagemaker_endpoints_cpuutilizationCPUUtilization
aws_sagemaker_endpoints_cpuutilization_normalizedCPUUtilizationNormalized
aws_sagemaker_endpoints_disk_utilizationDiskUtilization
aws_sagemaker_endpoints_gpumemory_utilizationGPUMemoryUtilization
aws_sagemaker_endpoints_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalized
aws_sagemaker_endpoints_gpureservationGPUReservation
aws_sagemaker_endpoints_gpuutilizationGPUUtilization
aws_sagemaker_endpoints_gpuutilization_normalizedGPUUtilizationNormalized
aws_sagemaker_endpoints_loaded_model_countLoadedModelCount
aws_sagemaker_endpoints_memory_reservationMemoryReservation
aws_sagemaker_endpoints_memory_utilizationMemoryUtilization

AWS/SageMaker/InferenceRecommendationsJobs

Function: Offers guidance on optimizing inference workloads for ML models

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_inferencerecommendationsjobs_info
aws_sagemaker_inferencerecommendationsjobs_client_invocation_errorsClientInvocationErrors
aws_sagemaker_inferencerecommendationsjobs_client_invocationsClientInvocations
aws_sagemaker_inferencerecommendationsjobs_client_latencyClientLatency
aws_sagemaker_inferencerecommendationsjobs_number_of_usersNumberOfUsers

AWS/SageMaker/ModelBuildingPipeline

Function: Managed pipelines to automate model training and deployment processes

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_modelbuildingpipeline_info
aws_sagemaker_modelbuildingpipeline_execution_durationExecutionDuration
aws_sagemaker_modelbuildingpipeline_execution_failedExecutionFailed
aws_sagemaker_modelbuildingpipeline_execution_startedExecutionStarted
aws_sagemaker_modelbuildingpipeline_execution_stoppedExecutionStopped
aws_sagemaker_modelbuildingpipeline_execution_succeededExecutionSucceeded
aws_sagemaker_modelbuildingpipeline_step_durationStepDuration
aws_sagemaker_modelbuildingpipeline_step_failedStepFailed
aws_sagemaker_modelbuildingpipeline_step_startedStepStarted
aws_sagemaker_modelbuildingpipeline_step_stoppedStepStopped
aws_sagemaker_modelbuildingpipeline_step_succeededStepSucceeded

AWS/SageMaker/ProcessingJobs

Function: Managed service for processing and transforming data at scale for machine learning

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_processingjobs_info
aws_sagemaker_processingjobs_cpureservationCPUReservation
aws_sagemaker_processingjobs_cpuutilizationCPUUtilization
aws_sagemaker_processingjobs_cpuutilization_normalizedCPUUtilizationNormalized
aws_sagemaker_processingjobs_disk_utilizationDiskUtilization
aws_sagemaker_processingjobs_gpumemory_utilizationGPUMemoryUtilization
aws_sagemaker_processingjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalized
aws_sagemaker_processingjobs_gpureservationGPUReservation
aws_sagemaker_processingjobs_gpuutilizationGPUUtilization
aws_sagemaker_processingjobs_gpuutilization_normalizedGPUUtilizationNormalized
aws_sagemaker_processingjobs_memory_reservationMemoryReservation
aws_sagemaker_processingjobs_memory_utilizationMemoryUtilization

AWS/SageMaker/TrainingJobs

Function: Managed service for training ML models on large datasets

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_trainingjobs_info
aws_sagemaker_trainingjobs_cpureservationCPUReservation
aws_sagemaker_trainingjobs_cpuutilizationCPUUtilization
aws_sagemaker_trainingjobs_cpuutilization_normalizedCPUUtilizationNormalized
aws_sagemaker_trainingjobs_disk_utilizationDiskUtilization
aws_sagemaker_trainingjobs_gpumemory_utilizationGPUMemoryUtilization
aws_sagemaker_trainingjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalized
aws_sagemaker_trainingjobs_gpureservationGPUReservation
aws_sagemaker_trainingjobs_gpuutilizationGPUUtilization
aws_sagemaker_trainingjobs_gpuutilization_normalizedGPUUtilizationNormalized
aws_sagemaker_trainingjobs_memory_reservationMemoryReservation
aws_sagemaker_trainingjobs_memory_utilizationMemoryUtilization

AWS/SageMaker/TransformJobs

Function: Enables large-scale, batch ML model inferences for data transformations

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_sagemaker_transformjobs_info
aws_sagemaker_transformjobs_cpureservationCPUReservation
aws_sagemaker_transformjobs_cpuutilizationCPUUtilization
aws_sagemaker_transformjobs_cpuutilization_normalizedCPUUtilizationNormalized
aws_sagemaker_transformjobs_disk_utilizationDiskUtilization
aws_sagemaker_transformjobs_gpumemory_utilizationGPUMemoryUtilization
aws_sagemaker_transformjobs_gpumemory_utilization_normalizedGPUMemoryUtilizationNormalized
aws_sagemaker_transformjobs_gpureservationGPUReservation
aws_sagemaker_transformjobs_gpuutilizationGPUUtilization
aws_sagemaker_transformjobs_gpuutilization_normalizedGPUUtilizationNormalized
aws_sagemaker_transformjobs_memory_reservationMemoryReservation
aws_sagemaker_transformjobs_memory_utilizationMemoryUtilization

AWS/Scheduler

Function: Managed service to trigger events or workflows at a scheduled time

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_scheduler_invocation_attempt_countInvocationAttemptCount
aws_scheduler_invocation_dropped_countInvocationDroppedCount
aws_scheduler_invocation_throttle_countInvocationThrottleCount
aws_scheduler_invocations_failed_to_be_sent_to_dead_letter_countInvocationsFailedToBeSentToDeadLetterCount
aws_scheduler_invocations_sent_to_dead_letter_countInvocationsSentToDeadLetterCount
aws_scheduler_invocations_sent_to_dead_letter_count_truncated_message_size_exceededInvocationsSentToDeadLetterCount_Truncated_MessageSizeExceeded
aws_scheduler_target_error_countTargetErrorCount
aws_scheduler_target_error_throttled_countTargetErrorThrottledCount

AWS/States

Function: AWS Step Functions for orchestrating workflows and coordinating services

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_states_info
aws_states_activities_failedActivitiesFailed
aws_states_activities_heartbeat_timed_outActivitiesHeartbeatTimedOut
aws_states_activities_scheduledActivitiesScheduled
aws_states_activities_startedActivitiesStarted
aws_states_activities_succeededActivitiesSucceeded
aws_states_activities_timed_outActivitiesTimedOut
aws_states_activity_run_timeActivityRunTime
aws_states_activity_schedule_timeActivityScheduleTime
aws_states_activity_timeActivityTime
aws_states_consumed_capacityConsumedCapacity
aws_states_execution_throttledExecutionThrottled
aws_states_execution_timeExecutionTime
aws_states_executions_abortedExecutionsAborted
aws_states_executions_failedExecutionsFailed
aws_states_executions_startedExecutionsStarted
aws_states_executions_succeededExecutionsSucceeded
aws_states_executions_timed_outExecutionsTimedOut
aws_states_express_execution_billed_durationExpressExecutionBilledDuration
aws_states_express_execution_billed_memoryExpressExecutionBilledMemory
aws_states_express_execution_memoryExpressExecutionMemory
aws_states_lambda_function_run_timeLambdaFunctionRunTime
aws_states_lambda_function_schedule_timeLambdaFunctionScheduleTime
aws_states_lambda_function_timeLambdaFunctionTime
aws_states_lambda_functions_failedLambdaFunctionsFailed
aws_states_lambda_functions_scheduledLambdaFunctionsScheduled
aws_states_lambda_functions_startedLambdaFunctionsStarted
aws_states_lambda_functions_succeededLambdaFunctionsSucceeded
aws_states_lambda_functions_timed_outLambdaFunctionsTimedOut
aws_states_provisioned_bucket_sizeProvisionedBucketSize
aws_states_provisioned_refill_rateProvisionedRefillRate
aws_states_service_integration_run_timeServiceIntegrationRunTime
aws_states_service_integration_schedule_timeServiceIntegrationScheduleTime
aws_states_service_integration_timeServiceIntegrationTime
aws_states_service_integrations_failedServiceIntegrationsFailed
aws_states_service_integrations_scheduledServiceIntegrationsScheduled
aws_states_service_integrations_startedServiceIntegrationsStarted
aws_states_service_integrations_succeededServiceIntegrationsSucceeded
aws_states_service_integrations_timed_outServiceIntegrationsTimedOut
aws_states_throttled_eventsThrottledEvents

AWS/StorageGateway

Function: Hybrid cloud storage service connecting on-premises software appliances to AWS

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_storagegateway_info
aws_storagegateway_cache_freeCacheFree
aws_storagegateway_cache_hit_percentCacheHitPercent
aws_storagegateway_cache_percent_dirtyCachePercentDirty
aws_storagegateway_cache_percent_usedCachePercentUsed
aws_storagegateway_cache_usedCacheUsed
aws_storagegateway_cloud_bytes_downloadedCloudBytesDownloaded
aws_storagegateway_cloud_bytes_uploadedCloudBytesUploaded
aws_storagegateway_cloud_download_latencyCloudDownloadLatency
aws_storagegateway_queued_writesQueuedWrites
aws_storagegateway_read_bytesReadBytes
aws_storagegateway_read_timeReadTime
aws_storagegateway_time_since_last_recovery_pointTimeSinceLastRecoveryPoint
aws_storagegateway_total_cache_sizeTotalCacheSize
aws_storagegateway_upload_buffer_freeUploadBufferFree
aws_storagegateway_upload_buffer_percent_usedUploadBufferPercentUsed
aws_storagegateway_upload_buffer_usedUploadBufferUsed
aws_storagegateway_working_storage_freeWorkingStorageFree
aws_storagegateway_working_storage_percent_usedWorkingStoragePercentUsed
aws_storagegateway_working_storage_usedWorkingStorageUsed
aws_storagegateway_write_bytesWriteBytes
aws_storagegateway_write_timeWriteTime

AWS/Timestream

Function: Managed time series database for IoT and operational applications

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_timestream_data_scanned_bytesDataScannedBytes
aws_timestream_successful_request_latencySuccessfulRequestLatency
aws_timestream_system_errorsSystemErrors
aws_timestream_user_errorsUserErrors

AWS/TransitGateway

Function: Service for connecting VPCs and on-premises networks through a central hub

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_transitgateway_info
aws_transitgateway_bytes_inBytesIn
aws_transitgateway_bytes_outBytesOut
aws_transitgateway_packet_drop_count_blackholePacketDropCountBlackhole
aws_transitgateway_packet_drop_count_no_routePacketDropCountNoRoute
aws_transitgateway_packets_inPacketsIn
aws_transitgateway_packets_outPacketsOut

AWS/TrustedAdvisor

Function: Provides real-time recommendations to improve AWS resource optimization and security. This service only produces metrics to specific regions in AWS. Any jobs configured with this service will only gather data from the us-east-1 regions.

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_trustedadvisor_green_checksGreenChecks
aws_trustedadvisor_red_checksRedChecks
aws_trustedadvisor_red_resourcesRedResources
aws_trustedadvisor_service_limit_usageServiceLimitUsage
aws_trustedadvisor_yellow_checksYellowChecks
aws_trustedadvisor_yellow_resourcesYellowResources

AWS/Usage

Function: Tracks AWS service usage for cost monitoring and optimization

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_usage_call_countCallCount
aws_usage_resource_countResourceCount

AWS/VPN

Function: Managed VPN service to securely connect on-premises networks to AWS

Scrape interval: 5 minutes

Includes: Out-of-the-box dashboard

MetricCloudwatch Metric
aws_vpn_info
aws_vpn_tunnel_data_inTunnelDataIn
aws_vpn_tunnel_data_outTunnelDataOut
aws_vpn_tunnel_stateTunnelState

AWS/WAFV2

Function: Web application firewall to protect applications from common web exploits

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_wafv2_info
aws_wafv2_allowed_requestsAllowedRequests
aws_wafv2_blocked_requestsBlockedRequests
aws_wafv2_captcha_requestsCaptchaRequests
aws_wafv2_captchas_attemptedCaptchasAttempted
aws_wafv2_captchas_solvedCaptchasSolved
aws_wafv2_challenge_requestsChallengeRequests
aws_wafv2_counted_requestsCountedRequests
aws_wafv2_passed_requestsPassedRequests
aws_wafv2_requests_with_valid_captcha_tokenRequestsWithValidCaptchaToken
aws_wafv2_requests_with_valid_challenge_tokenRequestsWithValidChallengeToken

AWS/WorkSpaces

Function: Managed desktop virtualization service for delivering cloud-based desktops

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_workspaces_info
aws_workspaces_availableAvailable
aws_workspaces_connection_attemptConnectionAttempt
aws_workspaces_connection_failureConnectionFailure
aws_workspaces_connection_successConnectionSuccess
aws_workspaces_in_session_latencyInSessionLatency
aws_workspaces_maintenanceMaintenance
aws_workspaces_session_disconnectSessionDisconnect
aws_workspaces_session_launch_timeSessionLaunchTime
aws_workspaces_stoppedStopped
aws_workspaces_unhealthyUnhealthy
aws_workspaces_user_connectedUserConnected

AmazonMWAA

Function: Managed service for Apache Airflow workflows in the cloud

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_amazonmwaa_info
aws_amazonmwaa_collect_dbdagsCollectDBDags
aws_amazonmwaa_critical_section_busyCriticalSectionBusy
aws_amazonmwaa_critical_section_durationCriticalSectionDuration
aws_amazonmwaa_critical_section_query_durationCriticalSectionQueryDuration
aws_amazonmwaa_dagdependency_checkDAGDependencyCheck
aws_amazonmwaa_dagduration_failedDAGDurationFailed
aws_amazonmwaa_dagduration_successDAGDurationSuccess
aws_amazonmwaa_dagfile_processing_last_durationDAGFileProcessingLastDuration
aws_amazonmwaa_dagfile_processing_last_run_seconds_agoDAGFileProcessingLastRunSecondsAgo
aws_amazonmwaa_dagfile_refresh_errorDAGFileRefreshError
aws_amazonmwaa_dagschedule_delayDAGScheduleDelay
aws_amazonmwaa_dag_bag_sizeDagBagSize
aws_amazonmwaa_dag_callback_exceptionsDagCallbackExceptions
aws_amazonmwaa_exception_failuresExceptionFailures
aws_amazonmwaa_executed_tasksExecutedTasks
aws_amazonmwaa_failed_celery_task_executionFailedCeleryTaskExecution
aws_amazonmwaa_failed_slacallbackFailedSLACallback
aws_amazonmwaa_failed_slaemail_attemptsFailedSLAEmailAttempts
aws_amazonmwaa_file_path_queue_update_countFilePathQueueUpdateCount
aws_amazonmwaa_first_task_scheduling_delayFirstTaskSchedulingDelay
aws_amazonmwaa_import_errorsImportErrors
aws_amazonmwaa_infra_failuresInfraFailures
aws_amazonmwaa_job_endJobEnd
aws_amazonmwaa_job_heartbeat_failureJobHeartbeatFailure
aws_amazonmwaa_job_startJobStart
aws_amazonmwaa_loaded_tasksLoadedTasks
aws_amazonmwaa_manager_stallsManagerStalls
aws_amazonmwaa_open_slotsOpenSlots
aws_amazonmwaa_operator_failuresOperatorFailures
aws_amazonmwaa_operator_successesOperatorSuccesses
aws_amazonmwaa_orphanedOrphaned
aws_amazonmwaa_orphaned_tasks_adoptedOrphanedTasksAdopted
aws_amazonmwaa_orphaned_tasks_clearedOrphanedTasksCleared
aws_amazonmwaa_other_callback_countOtherCallbackCount
aws_amazonmwaa_poked_exceptionsPokedExceptions
aws_amazonmwaa_poked_successPokedSuccess
aws_amazonmwaa_poked_tasksPokedTasks
aws_amazonmwaa_pool_deferred_slotsPoolDeferredSlots
aws_amazonmwaa_pool_failuresPoolFailures
aws_amazonmwaa_pool_open_slotsPoolOpenSlots
aws_amazonmwaa_pool_queued_slotsPoolQueuedSlots
aws_amazonmwaa_pool_running_slotsPoolRunningSlots
aws_amazonmwaa_pool_starving_tasksPoolStarvingTasks
aws_amazonmwaa_processesProcesses
aws_amazonmwaa_processor_timeoutsProcessorTimeouts
aws_amazonmwaa_queued_tasksQueuedTasks
aws_amazonmwaa_running_tasksRunningTasks
aws_amazonmwaa_slamissedSLAMissed
aws_amazonmwaa_scheduler_heartbeatSchedulerHeartbeat
aws_amazonmwaa_scheduler_loop_durationSchedulerLoopDuration
aws_amazonmwaa_sla_callback_countSlaCallbackCount
aws_amazonmwaa_started_task_instancesStartedTaskInstances
aws_amazonmwaa_task_instance_created_using_operatorTaskInstanceCreatedUsingOperator
aws_amazonmwaa_task_instance_durationTaskInstanceDuration
aws_amazonmwaa_task_instance_failuresTaskInstanceFailures
aws_amazonmwaa_task_instance_finishedTaskInstanceFinished
aws_amazonmwaa_task_instance_previously_succeededTaskInstancePreviouslySucceeded
aws_amazonmwaa_task_instance_queued_durationTaskInstanceQueuedDuration
aws_amazonmwaa_task_instance_scheduled_durationTaskInstanceScheduledDuration
aws_amazonmwaa_task_instance_successesTaskInstanceSuccesses
aws_amazonmwaa_task_removed_from_dagTaskRemovedFromDAG
aws_amazonmwaa_task_restored_to_dagTaskRestoredToDAG
aws_amazonmwaa_task_timeout_errorTaskTimeoutError
aws_amazonmwaa_tasks_executableTasksExecutable
aws_amazonmwaa_tasks_killed_externallyTasksKilledExternally
aws_amazonmwaa_tasks_pendingTasksPending
aws_amazonmwaa_tasks_runningTasksRunning
aws_amazonmwaa_tasks_starvingTasksStarving
aws_amazonmwaa_tasks_without_dag_runTasksWithoutDagRun
aws_amazonmwaa_total_parse_timeTotalParseTime
aws_amazonmwaa_trigger_heartbeatTriggerHeartbeat
aws_amazonmwaa_triggered_dag_runsTriggeredDagRuns
aws_amazonmwaa_triggers_blocked_main_threadTriggersBlockedMainThread
aws_amazonmwaa_triggers_failedTriggersFailed
aws_amazonmwaa_triggers_runningTriggersRunning
aws_amazonmwaa_triggers_succeededTriggersSucceeded
aws_amazonmwaa_updatesUpdates
aws_amazonmwaa_zombies_killedZombiesKilled

ECS/ContainerInsights

Function: Provides monitoring and insights for ECS clusters, tasks, and containers

Scrape interval: 5 minutes

MetricCloudwatch Metric
aws_ecs_containerinsights_info
aws_ecs_containerinsights_container_instance_countContainerInstanceCount
aws_ecs_containerinsights_cpu_reservedCpuReserved
aws_ecs_containerinsights_cpu_utilizedCpuUtilized
aws_ecs_containerinsights_deployment_countDeploymentCount
aws_ecs_containerinsights_desired_task_countDesiredTaskCount
aws_ecs_containerinsights_ebsfilesystem_sizeEBSFilesystemSize
aws_ecs_containerinsights_ebsfilesystem_utilizedEBSFilesystemUtilized
aws_ecs_containerinsights_ephemeral_storage_reservedEphemeralStorageReserved
aws_ecs_containerinsights_ephemeral_storage_utilizedEphemeralStorageUtilized
aws_ecs_containerinsights_memory_reservedMemoryReserved
aws_ecs_containerinsights_memory_utilizedMemoryUtilized
aws_ecs_containerinsights_network_rx_bytesNetworkRxBytes
aws_ecs_containerinsights_network_tx_bytesNetworkTxBytes
aws_ecs_containerinsights_pending_task_countPendingTaskCount
aws_ecs_containerinsights_running_task_countRunningTaskCount
aws_ecs_containerinsights_service_countServiceCount
aws_ecs_containerinsights_storage_read_bytesStorageReadBytes
aws_ecs_containerinsights_storage_write_bytesStorageWriteBytes
aws_ecs_containerinsights_task_countTaskCount
aws_ecs_containerinsights_task_set_countTaskSetCount
aws_ecs_containerinsights_instance_cpu_limitinstance_cpu_limit
aws_ecs_containerinsights_instance_cpu_reserved_capacityinstance_cpu_reserved_capacity
aws_ecs_containerinsights_instance_cpu_usage_totalinstance_cpu_usage_total
aws_ecs_containerinsights_instance_cpu_utilizationinstance_cpu_utilization
aws_ecs_containerinsights_instance_filesystem_utilizationinstance_filesystem_utilization
aws_ecs_containerinsights_instance_memory_limitinstance_memory_limit
aws_ecs_containerinsights_instance_memory_reserved_capacityinstance_memory_reserved_capacity
aws_ecs_containerinsights_instance_memory_utilizationinstance_memory_utilization
aws_ecs_containerinsights_instance_memory_working_setinstance_memory_working_set
aws_ecs_containerinsights_instance_network_total_bytesinstance_network_total_bytes
aws_ecs_containerinsights_instance_number_of_running_tasksinstance_number_of_running_tasks
aws_ecs_containerinsights_instance_memory_utliizationinstance_memory_utliization