8 Copyright (C) The Arvados Authors. All rights reserved.
10 SPDX-License-Identifier: CC-BY-SA-3.0
13 Some Arvados services publish Prometheus/OpenMetrics-compatible metrics at @/metrics@, and some provide additional runtime status at @/status.json@. Metrics can help you understand how components perform under load, find performance bottlenecks, and detect and diagnose problems.
15 To access metrics endpoints, services must be configured with a "management token":management-token.html. When accessing a metrics endpoint, prefix the management token with @"Bearer "@ and supply it in the @Authorization@ request header.
17 <pre>curl -sfH "Authorization: Bearer your_management_token_goes_here" "https://0.0.0.0:25107/status.json"
22 Keep-web exports metrics at @/metrics@ -- e.g., @https://collections.zzzzz.arvadosapi.com/metrics@.
24 table(table table-bordered table-condensed).
25 |_. Name|_. Type|_. Description|
26 |request_duration_seconds|summary|elapsed time between receiving a request and sending the last byte of the response body (segmented by HTTP request method and response status code)|
27 |time_to_status_seconds|summary|elapsed time between receiving a request and sending the HTTP response status code (segmented by HTTP request method and response status code)|
29 Metrics in the @arvados_keepweb_collectioncache@ namespace report keep-web's internal cache of Arvados collection metadata.
31 table(table table-bordered table-condensed).
32 |_. Name|_. Type|_. Description|
33 |arvados_keepweb_collectioncache_requests|counter|cache lookups|
34 |arvados_keepweb_collectioncache_api_calls|counter|outgoing API calls|
35 |arvados_keepweb_collectioncache_permission_hits|counter|collection-to-permission cache hits|
36 |arvados_keepweb_collectioncache_pdh_hits|counter|UUID-to-PDH cache hits|
37 |arvados_keepweb_collectioncache_hits|counter|PDH-to-manifest cache hits|
38 |arvados_keepweb_collectioncache_cached_manifests|gauge|number of collections in the cache|
39 |arvados_keepweb_collectioncache_cached_manifest_bytes|gauge|memory consumed by cached collection manifests|
43 Keepstore exports metrics at @/status.json@ -- e.g., @http://keep0.zzzzz.arvadosapi.com:25107/status.json@.
47 table(table table-bordered table-condensed).
48 |_. Attribute|_. Type|_. Description|
49 |Volumes| array of "volumeStatusEnt":#volumeStatusEnt ||
50 |BufferPool| "PoolStatus":#PoolStatus ||
51 |PullQueue| "WorkQueueStatus":#WorkQueueStatus ||
52 |TrashQueue| "WorkQueueStatus":#WorkQueueStatus ||
53 |RequestsCurrent| int ||
57 h3(#volumeStatusEnt). volumeStatusEnt
59 table(table table-bordered table-condensed).
60 |_. Attribute|_. Type|_. Description|
62 |Status| "VolumeStatus":#VolumeStatus ||
63 |VolumeStats| "ioStats":#ioStats ||
65 h3(#VolumeStatus). VolumeStatus
67 table(table table-bordered table-condensed).
68 |_. Attribute|_. Type|_. Description|
76 table(table table-bordered table-condensed).
77 |_. Attribute|_. Type|_. Description|
87 h3(#PoolStatus). PoolStatus
89 table(table table-bordered table-condensed).
90 |_. Attribute|_. Type|_. Description|
91 |BytesAllocatedCumulative| uint64||
95 h3(#WorkQueueStatus). WorkQueueStatus
97 table(table table-bordered table-condensed).
98 |_. Attribute|_. Type|_. Description|
108 "Label": "[UnixVolume /var/lib/arvados/keep0]",
110 "MountPoint": "/var/lib/arvados/keep0",
112 "BytesFree": 222532972544,
113 "BytesUsed": 435456679936
131 "BytesAllocatedCumulative": 67108864,
143 "RequestsCurrent": 1,
151 The node manager status end point provides a snapshot of internal status at the time of the most recent wishlist update.
153 table(table table-bordered table-condensed).
154 |_. Attribute|_. Type|_. Description|
155 |nodes_booting|int|Number of nodes in booting state|
156 |nodes_unpaired|int|Number of nodes in unpaired state|
157 |nodes_busy|int|Number of nodes in busy state|
158 |nodes_idle|int|Number of nodes in idle state|
159 |nodes_fail|int|Number of nodes in fail state|
160 |nodes_down|int|Number of nodes in down state|
161 |nodes_shutdown|int|Number of nodes in shutdown state|
162 |nodes_wish|int|Number of nodes in the current wishlist|
163 |node_quota|int|Current node count ceiling due to cloud quota limits|
164 |config_max_nodes|int|Configured max node count|
170 "actor_exceptions": 0,
177 "create_node_errors": 0,
178 "destroy_node_errors": 0,
180 "config_max_nodes": 8,
181 "list_nodes_errors": 0,
183 "Version": "1.1.4.20180719160944",