DamlLedgerClient grpc issues

Turan · September 7, 2022, 3:35pm

Hi,

We have begun getting the below error messages when making use of the ledger client. It has manifested itself in a way in that making a query on any ledgerclient we have, results in the error appearing in all ledger clients connected to the same domain. We believe this is a consequence of another issue within canton as certain commands submissions via the client fail silently when we start seeing these.

Versions
canton: 2.3.2
java bindings: 2.3.2

2022-09-07 15:01:19.098 ERROR 12 --- [     parallel-1] i.g.i.ManagedChannelOrphanWrapper        : *~*~*~ Channel ManagedChannelImpl{logId=649, target=HOST:PORT} was not shutdown properly!!! ~*~*~*
    Make sure to call shutdown()/shutdownNow() and wait until awaitTermination() returns true.
java.lang.RuntimeException: ManagedChannel allocation site
	at io.grpc.internal.ManagedChannelOrphanWrapper$ManagedChannelReference.<init>(ManagedChannelOrphanWrapper.java:93) ~[grpc-core-1.44.0.jar!/:1.44.0]
	at io.grpc.internal.ManagedChannelOrphanWrapper.<init>(ManagedChannelOrphanWrapper.java:53) ~[grpc-core-1.44.0.jar!/:1.44.0]
	at io.grpc.internal.ManagedChannelOrphanWrapper.<init>(ManagedChannelOrphanWrapper.java:44) ~[grpc-core-1.44.0.jar!/:1.44.0]
	at io.grpc.internal.ManagedChannelImplBuilder.build(ManagedChannelImplBuilder.java:630) ~[grpc-core-1.44.0.jar!/:1.44.0]
	at io.grpc.internal.AbstractManagedChannelImplBuilder.build(AbstractManagedChannelImplBuilder.java:297) ~[grpc-core-1.44.0.jar!/:1.44.0]
	at com.daml.ledger.rxjava.DamlLedgerClient.<init>(DamlLedgerClient.java:134) ~[bindings-rxjava-2.3.2.jar!/:na]
	at com.daml.ledger.rxjava.DamlLedgerClient$Builder.build(DamlLedgerClient.java:83) ~[bindings-rxjava-2.3.2.jar!/:na]

bernhard · September 7, 2022, 3:50pm

Hi Turan, do you have the logs of the Participant node that ledger client was connected to to hand? It looks to me like the Participant shut down the connection, which suggests some sort of catastrophic failure. Such catastrophic failures can occur for example when running in containers with very little allocated memory and the JVM throws OOMs. But I’m only speculating here. Participant (and ideally also domain) node logs would help diagnose.

Turan · September 7, 2022, 4:27pm

We are indeed running in containers. Will look to extract the logs.

Do you have a recommended core/thread/memory count for canton (or JVM args to override)?

By the way this only began manifesting itself after we went from daml 2.0.0 to 2.3.2

bernhard · September 9, 2022, 3:02pm

If you are running with n CPU cores, try setting 2+n GB memory.

Turan · September 13, 2022, 11:20am

Is there a way to limit a canton node from being taken down by a large number of connections?

bernhard · September 13, 2022, 11:54am

Is this what you are after? Static Configuration — Daml SDK 2.3.4 documentation

Topic		Replies	Views
Facing error in the canton participant node (UNAVAILABLE: SERVICE_NOT_RUNNING(1,0): Ledger API offset dispatcher has been shut down.) Questions daml , canton	2	97	January 29, 2024
Grpcurl accessing ledger-api in daml v1.18.1 and v2.0 Questions daml , ledger-api	1	253	April 26, 2022
Command Service Queue Has Been Shutdown Questions canton	8	352	April 26, 2023
Problems with using DamlLedgerClient Questions	0	127	February 24, 2023
Getting NOT_CONNECTED_TO_ANY_DOMAIN in kubernetes after a couple of hours of the infrastructure running Questions canton	7	644	December 21, 2022

DamlLedgerClient grpc issues

Related topics