We're updating the issue view to help you get more done. 

Killing primary and backup of same partition cause the client to stuck even though the partition has been allocated elsewhere

Description

It reproduced in 10.2.1 but not in 12.1 M19
Reproduction:
1. start gs-agent with 4 gsc 2 lus and 2 gsm
2. deploy mySpace with sla 2,1 so each space instance will be hosted on a separate gsc
3. from the client do write multiple of 1M entries.
4. wait 5 sec - in my test in this point nothing has been written yet.
5. kill both gsc's which host primary and backup of same partition.
6. take 2 thread dumps from the client. The main thread is stuck on:
at com.gigaspaces.internal.remoting.RemoteOperationFutureListener.waitForCompletion(RemoteOperationFutureListener.java:120)
7. the partitions were relocated on a new gscs.

See here (full thread dump is attached):

"main" #1 prio=5 os_prio=31 tid=0x00007fc501010800 nid=0x1703 waiting on condition [0x0000700000218000]
java.lang.Thread.State: WAITING (parking)
at sun.misc.Unsafe.park(Native Method)

  • parking to wait for <0x0000000766f45c38> (a java.util.concurrent.CountDownLatch$Sync)
    at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
    at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
    at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
    at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
    at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
    at com.gigaspaces.internal.remoting.RemoteOperationFutureListener.waitForCompletion(RemoteOperationFutureListener.java:120)
    at com.gigaspaces.internal.remoting.routing.partitioned.PartitionedClusterRemoteOperationRouter.executeScatterConcurrent(PartitionedClusterRemoteOperationRouter.java:326)
    at com.gigaspaces.internal.remoting.routing.partitioned.PartitionedClusterRemoteOperationRouter.execute(PartitionedClusterRemoteOperationRouter.java:115)
    at com.gigaspaces.internal.client.spaceproxy.router.SpaceProxyRouter.execute(SpaceProxyRouter.java:232)
    at com.gigaspaces.internal.client.spaceproxy.actions.SpaceProxyImplWriteAction.writeMultiple(SpaceProxyImplWriteAction.java:65)
    at com.gigaspaces.internal.client.spaceproxy.actions.SpaceProxyImplWriteAction.writeMultiple(SpaceProxyImplWriteAction.java:29)
    at com.gigaspaces.internal.client.spaceproxy.actions.AbstractSpaceProxyActionManager.writeMultiple(AbstractSpaceProxyActionManager.java:422)
    at com.gigaspaces.internal.client.spaceproxy.AbstractSpaceProxy.writeMultiple(AbstractSpaceProxy.java:483)
    at org.openspaces.core.DefaultGigaSpace.writeMultiple(DefaultGigaSpace.java:1487)
    at org.openspaces.core.DefaultGigaSpace.writeMultiple(DefaultGigaSpace.java:1477)
    at com.gigaspaces.test.MyMain.main(MyMain.java:32)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:497)
    at com.intellij.rt.execution.application.AppMain.main(AppMain.java:144)

Workaround

None

Acceptance Test

None

Status

Assignee

Unassigned

Reporter

Yuval Dori

Labels

None

Priority

Medium

SalesForce Case ID

None

Fix versions

None

Commitment Version/s

None

Due date

None

Product

None

Edition

None

Platform

All