Canton metrics

Piotr_Ostrowski · November 9, 2022, 7:54am

Hello,

We’re setting up monitoring for Canton domain, part of the job is to create dashboards based on the metrics scraped from Canton components. Looking at all the metrics available in docs and wondering whether there is a set of the most valuable/recommended metrics one should include?

Thanks for any suggestions!

Ratko_Veprek · November 10, 2022, 10:06am

Yes, we know. We are working on an improvement that will label “key metrics” that you should observe appropriately, such that you can distinguish them from metrics that are more useful to debug detailed behaviour.

Which metrics you want to monitor depends on your taste. The ones I find important are:

canton.db-storage..executor.queued
daml.commands.failed_command_interpretations
daml.commands.submissions_running
daml.lapi.streams.active
canton..sequencer-client.delay and load
canton.mediator.outstanding-requests
canton.mediator.requests
canton.sequencer.processed
canton.sequencer.subscriptions

But if you are a bit patient, then @simon might point you to a new piece of documentation that gives concrete recommendations.

Curtis_Hrischuk · November 10, 2022, 1:27pm

Hi Piotr. Metris are important for monitoring and there are two levels.

The first level would be infrastructure related metrics like: CPU utilization, virtual memory paging, disk IO rates / latencies, JVM garbage collection, etc. These first level metrics are key to monitoring for capacity and health.

The second level of metrics include what @Ratko_Veprek mentions but these are very fine grained. We are working on metrics that support the SRE Golden signals methodology (aka “REDS”). Development work is progressing quickly and I was wondering if you might be willing to evaluate the Beta version and provide feedback?

Piotr_Ostrowski · November 11, 2022, 4:46am

Hi Curtis,

I gathered the infrastructure level metrics from canton jvm_metrics as well as additional node_exporter for host metrics. The more fine grained like requests, errors, saturation etc. is what i’m after - seems you’re already on that following RED/Golden signals so i’d gladly evaluate your beta version.

Topic		Replies	Views
Is there an example how to use metrics within canton Questions canton	1	177	September 30, 2022
Monitoring Canton using Grafana Questions daml , canton , grafana	1	323	January 24, 2022
Recommended way to measure latency Questions canton	1	240	September 1, 2022
Tools for monitoring and measurement Questions	1	134	April 13, 2022
Performance testing Questions daml , canton	5	604	February 28, 2022

Canton metrics

Related topics