[IGNITE-21059] We have upgraded our ignite instance from 2.7.6 to 2.14. Found long running cache operations - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Critical
Resolution: Unresolved
Affects Version/s: 2.14
Fix Version/s: None
Component/s: binary, clients
Labels:
None

Ignite Flags:

Docs Required, Release Notes Required

Description

We have recently upgraded from 2.7.6 to 2.14 due to the issue observed in production environment where cluster would go in hang state due to partition map exchange.

Please find the below ticket which i created a while back for ignite 2.7.6

https://issues.apache.org/jira/browse/IGNITE-13298

So we migrated the apache ignite version to 2.14 and upgrade happened smoothly but on the third day we could see cluster traffic dip again.

We have 5 nodes in a cluster where we provide 400 GB of RAM and more than 1 TB SDD.

PFB for the attached config.[I have added it as attachment for review]

I have also added the server logs from the same time when issue happened.

We have set txn timeout as well as socket timeout both at server and client end for our write operations but seems like sometimes cluster goes into hang state and all our get calls are stuck and slowly everything starts to freeze our jms listener threads and every thread reaches a choked up state in sometime.

Due to which our read services which does not even use txn to retrieve data also starts to choke. Ultimately leading to end user traffic dip.

We were hoping product upgrade will help but that has not been the case till now.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

digiapi-eventprocessing-app-zone1-696c8c4946-7d57w-jstck.txt2
12/Dec/23 06:41
1.45 MB
Vipul Thakur
digiapi-eventprocessing-app-zone1-696c8c4946-62jbx-jstck.txt1
12/Dec/23 06:41
1.30 MB
Vipul Thakur
digiapi-eventprocessing-app-zone1-696c8c4946-62jbx-jstck.txt2
12/Dec/23 06:41
1.31 MB
Vipul Thakur
digiapi-eventprocessing-app-zone1-696c8c4946-7d57w-jstck.txt1
12/Dec/23 06:41
1.46 MB
Vipul Thakur
digiapi-eventprocessing-app-zone1-696c8c4946-62jbx-jstck.txt3
12/Dec/23 06:42
1.32 MB
Vipul Thakur
cache-config-1.xml
12/Dec/23 06:48
27 kB
Vipul Thakur
ignite-server-nohup.out
12/Dec/23 06:54
12.64 MB
Vipul Thakur
Ignite_server_logs.zip
14/Dec/23 12:42
28.65 MB
Vipul Thakur
client-service.zip
14/Dec/23 12:47
413 kB
Vipul Thakur
long_txn_.png
26/Dec/23 06:07
946 kB
Vipul Thakur
ignite-server-nohup-1.out
26/Dec/23 07:08
12.64 MB
Vipul Thakur
image.png
28/Dec/23 07:41
32 kB
Vipul Thakur
nohup_12.out
29/Dec/23 14:53
7.30 MB
Vipul Thakur
digiapi-eventprocessing-app-zone1-6685b8d7f7-ntw27.log
02/Jan/24 03:32
22.19 MB
Vipul Thakur
ignite_issue_1101.zip
11/Jan/24 16:53
375 kB
Vipul Thakur
image-2024-01-11-22-28-51-501.png
11/Jan/24 16:58
281 kB
Vipul Thakur

Activity

People

Assignee:: Unassigned

Reporter:: Vipul Thakur

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 12/Dec/23 06:49

Updated:: 25/Jan/24 07:08