-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding provision to invoke stop replication from other plugins #1391
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: aggarwalShivani <[email protected]>
Signed-off-by: aggarwalShivani <[email protected]>
Signed-off-by: aggarwalShivani <[email protected]>
Also tagging @saikaranam-amazon for review |
Here's the background of making these changes to enable calling action from another plugin |
} catch (e: Exception) { | ||
log.error("Stop replication failed for index[${transformedRequest.indexName}] with error ${e.stackTraceToString()}") | ||
listener?.onFailure(e) | ||
throw e |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably no need to rethrow the exception here, as the exception is sent back to the caller to handle?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, you are right. I had corrected that change in my local, but missed pushing it. My bad!
I've some concerns around moving the classes out of cross cluster replication package. |
@Ankit Kala commented on Jul 15, 2024, 7:06 AM PDT:
We didn't try Option 1 and it looks promising, and we may not need a proxy transport class then since ISM base doesn't have to know anything about CCR. @aggarwalShivani some quick notes for trying it out |
Have a PR to publish ISM spi so it can be consumed from central repo And here's the commit of extend ISM in CCR
@ankitkala I tried the method of using ISM spi, but seems still have the same problem If we extend ReplicationPlugin with ISM spi, that means CCR can only be installed after ISM installed (a hard dependency introduced), and there are a lot of jar hells, probably not what we want here. If we create a new plugin in CCR, like CCR-ISM-extension plugin, it probably will also suffer the same problem that one plugin cannot use the TransportAction from another plugin directly. As plugin has their own classloaders, same class become different after being loaded by different classloaders. So I think it's better to stick with the current approach now.
It seems to be a 2 way door, moving classes around probably won't introduce any issues between versions. But let me know any concerns. |
Hi @ankitkala , We see issues with Option 1 ->
Even with option 2, it would reverse the problem on ism. And also, if we reuse classes from one plugin in another, we would see class-loading issue i.e. plugins have their own classloaders, same class become different after being loaded by different classloaders. (explained in opensearch-project/notifications#223 as well) For your concern
To add in addition to Bowen's comment, in our proposed approach, we've ensured that the current REST API users of CCR would not get impacted functionality-wise by any change. ' |
I didn't get this part. All the SPI classes are getting loaded in Opensearch process already, we should just be having it as compile time dependency right?
I went through the issue. I still have questions around why we'd want to load same class into JVM twice through different classloaders.
I definitely understand the issues with the integration and if we really need to move classes out to common utils, we'll do that. But before we do that, i just want to ensure that we've exhausted all options here :-) |
Hi @ankitkala , Thanks for your feedback..
I'll try to answer this from my understanding and @bowenlan-amzn could add-on/correct me as needed :) In the notifications example as well as in the case of this PR, the flow is - ism plugin needs to invoke actions from other plugins, for ex. SendNotificationAction or TransportStopIndexReplicationAction. These are transport actions in their respective plugins, which accept request of fixed type like StopIndexReplicationRequest. To invoke these transport action, ism would need to first construct request object of the required type (ex. StopIndexReplicationRequest), invoke the action from the other plugin and finally expect the response of type returned by that action i.e. the request class object is created in ism, but it has to be used in ccr code. OpenSearch loads each plugin by different class loader separately, which means the same class StopIndexReplicationRequest would have to be recreated/reloaded into ccr's classloader - which causes issues. Same case with response classes too. |
Hey @aggarwalShivani, what i don't understand is why we need to load the ccr classes at runtime. Compile time dependency makes sense, but at runtime, these classes are already loaded into the opensearch jvm(even if its through a different classloader). |
@Ankit Kala commented on Jul 29, 2024, 8:23 PM PDT:
Valid question. @ankitkala can your team help our contributor here to enable this workflow you are thinking? I'm not sure if CCR is publishing any jar for now. Note, based on how class loaded in OpenSearch, we will need the CCR jar load from opensearch core I guess, otherwise it probably not recognizable from other plugin as it's loaded by CCR plugin classloader? |
@Ankit Kala commented on Jul 24, 2024, 8:45 AM PDT:
If we define CCR as extended plugin of ISM (refer here), when we install the plugin, it hit this check of jarhell It requires that the extension plugin CCR to not have any duplicate dependencies as its extended plugin ISM. |
Hi @ankitkala, |
Hi, unfortunately we don't have bandwidth to pick this up. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Few minor comments.
Also, please ensure that we get a clean build by merging the common utils changes followed by rerunning the tests in this PR again.
@@ -68,7 +70,7 @@ class TransportStopIndexReplicationAction @Inject constructor(transportService: | |||
IndexNameExpressionResolver, | |||
val client: Client, | |||
val replicationMetadataManager: ReplicationMetadataManager) : |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add a comment here explaining that the stop replication action class in moved to common utils and link this PR here?
* While TransportStopIndexReplicationAction is used directly by the _stop replication REST API, | ||
* this action is used for inter-plugin communication by ism plugin to unfollow i.e. stop replication. | ||
*/ | ||
class TransportUnfollowIndexReplicationAction @Inject constructor ( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's rename is to something like TransportIsmStopIndexReplicationAction
. Basically remove unfollow
and add ISM in action name to distinct it from other supported actions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure thanks. Suggestions were given in the ISM PR to name this class as InternalTransportStopIndexReplicationAction and the action name as "indices:internal/plugins/replication/index/stop", so the name is differentiated to be something internally invoked from plugins. We hadn't added the name ism, so it remains generic to be used by something else too in future.
I hope these names shall be okay?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TransportInternalStopIndexReplicationAction
should be fine
Hi @ankitkala , @bowenlan-amzn I'm back now and trying to revive this change request 🤞 There have been multiple changes in the three projects in this while, and I will have to validate my changes on the latest. One issue I'm facing is -> This basically happens when there is a mismatch of jdk in project and its dependency. |
Description
This is related to the feature unfollow-action #726 and is the 2nd PR in continuation of the proposed solution mentioned in common-utils-PR. Detailed information is explained there.
Change description:
Issues Resolved
Related Issues
unfollow-action #726
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.