Cluster state refresher #2429

muhamadazmy · 2024-12-16T17:48:14Z

Cluster state refresher

Summary:
Simple ping mechanism to collect and maintain a local
view of cluster liveness state

Stack created with Sapling. Best reviewed with ReviewStack.

Summary: derprecate old cluster state that included information about partition state. A new ClusterState object is introduced that only have livenss information

Summary: Types used by nodes to share cluster state

Summary: Simple ping mechanism to collect and maintain a local view of cluster liveness state

AhmedSoliman · 2024-12-23T10:11:41Z

crates/node/src/cluster_state_refresher/mod.rs

+    networking: Networking<T>,
+    nodes: BTreeMap<PlainNodeId, NodeTracker>,
+    heartbeat_interval: Duration,
+    cluster_state_watch_tx: watch::Sender<Arc<ClusterState>>,


Do you imagine situations where users ClusterState would be interested in waiting for state change of the entire ClusterState, or is it more likely that they'd be interested in a certain node?

And in respect to the latter, how can this achieved with this design?

It's actually pretty common pattern here to wait for a certain state of the entire cluster or a single node. While this can be accomplished by waiting on changes of the cluster state in a state machine, we maybe can make the experience a little better by providing a wrapper on top of the watch that make it easier to wait on more complex configuration

Do you have an example use-case?

AhmedSoliman · 2024-12-23T10:14:35Z

crates/node/src/cluster_state_refresher/mod.rs

+    last_attempt_at: Option<MillisSinceEpoch>,
+}
+
+pub struct ClusterStateRefresher<T> {


It seems that this is what we traditionally call "FailureDetector", perhaps we can call it that to avoid confusion?

Sounds better :)

AhmedSoliman · 2024-12-23T10:22:36Z

crates/types/src/net/cluster_state.rs

+#[derive(Debug, Clone, Serialize, Deserialize)]
+pub struct NodePong {}


What do you imagine will be in response and do we actually need a response?

I see your point :) Pongs are definitely not needed as their own message type and can be dropped. The failure detector still treat it as a ping anyway.

AhmedSoliman · 2024-12-23T10:26:53Z

crates/node/src/cluster_state_refresher/mod.rs

+                _ = &mut cancelled => {
+                    break;
+                }


todo: this is possibly a good place to inform peers that you are shutting down?

AhmedSoliman · 2024-12-23T10:30:20Z

crates/node/src/roles/base.rs

+            tokio::select! {
+                result = cluster_state_refresher.run() => {
+                    result
+                }
+                _ = cancelled => {
+                    Ok(())
+                }


doesn't refresher itself monitor the cancellation token?

yes, true. this is a mistake

AhmedSoliman · 2024-12-23T10:39:01Z

crates/node/src/cluster_state_refresher/mod.rs

+    networking: Networking<T>,
+    nodes: BTreeMap<PlainNodeId, NodeTracker>,
+    heartbeat_interval: Duration,
+    cluster_state_watch_tx: watch::Sender<Arc<ClusterState>>,


Do you have an example use-case?

AhmedSoliman · 2024-12-23T10:39:23Z

crates/node/src/cluster_state_refresher/mod.rs

+            networking,
+            nodes: BTreeMap::default(),
+            heartbeat_interval: config.common.heartbeat_interval.into(),
+            cluster_state_watch_tx: watch::Sender::new(Arc::new(ClusterState::empty())),


Is there a reason why ClusterState is in an Arc?

Borrowed values from a watch has a Read lock which means you should only borrow for very short period of times and never across await points. Hence I think using an Arc is safer so we can cheaply copy the cluster state and release the borrow. Then you can pass this state snapshot around or use it across await points

AhmedSoliman · 2024-12-23T10:45:46Z

crates/node/src/cluster_state_refresher/mod.rs

+    async fn on_pong(&mut self, mut msg: Incoming<NodePong>) -> Result<(), ShutdownError> {
+        msg.follow_from_sender();
+
+        trace!("Handling pong response");
+
+        let tracker = self.nodes.entry(msg.peer().as_plain()).or_default();
+        tracker.seen = Some(SeenState::new(msg.peer()));
+
+        Ok(())
+    }


I'm really not sure if pong is needed. Nodes ping other nodes, and each node makes its own view based on the pings it has received.

Yes, makes sense!

AhmedSoliman · 2024-12-23T10:53:39Z

crates/node/src/roles/base.rs


 pub struct BaseRole {
    processor_manager_handle: Option<ProcessorsManagerHandle>,
-    incoming_node_state: MessageStream<GetNodeState>,
+    processors_state_request_stream: MessageStream<GetPartitionsProcessorsState>,


Should this move to PPM?

If so, do we still need BaseRole or should we remove it?

It will be eventually be deleted but right now it still handles GetPartitionsProcessorsState messages needed for CC operation

muhamadazmy mentioned this pull request Dec 16, 2024

Cluster state types #2311

Closed

muhamadazmy force-pushed the pr2429 branch 6 times, most recently from fae0c62 to 0e90b2e Compare December 17, 2024 13:18

muhamadazmy mentioned this pull request Dec 17, 2024

Deprecate ClusterState types #2434

Closed

muhamadazmy force-pushed the pr2429 branch 7 times, most recently from 835d7da to 1bf7793 Compare December 20, 2024 13:13

muhamadazmy changed the title ~~Cluster state gossiping~~ Cluster state refresher Dec 20, 2024

muhamadazmy force-pushed the pr2429 branch 2 times, most recently from 3f976c4 to 9490ed2 Compare December 20, 2024 14:57

Deprecate ClusterState types

9a553d2

Summary: derprecate old cluster state that included information about partition state. A new ClusterState object is introduced that only have livenss information

muhamadazmy force-pushed the pr2429 branch from 9490ed2 to d3c251b Compare December 20, 2024 14:58

Cluster state types

cba8ee9

Summary: Types used by nodes to share cluster state

muhamadazmy force-pushed the pr2429 branch from d3c251b to f390ddd Compare December 20, 2024 15:02

Cluster state refresher

dc7eb18

Summary: Simple ping mechanism to collect and maintain a local view of cluster liveness state

muhamadazmy force-pushed the pr2429 branch from f390ddd to dc7eb18 Compare December 23, 2024 08:34

AhmedSoliman reviewed Dec 23, 2024

View reviewed changes

muhamadazmy closed this Dec 26, 2024

muhamadazmy deleted the pr2429 branch December 26, 2024 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster state refresher #2429

Cluster state refresher #2429

muhamadazmy commented Dec 16, 2024 •

edited

Loading

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024

AhmedSoliman Dec 23, 2024

AhmedSoliman Dec 23, 2024

muhamadazmy Dec 23, 2024 •

edited

Loading

		#[derive(Debug, Clone, Serialize, Deserialize)]
		pub struct NodePong {}

Cluster state refresher #2429

Cluster state refresher #2429

Conversation

muhamadazmy commented Dec 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

muhamadazmy Dec 23, 2024 • edited Loading

Choose a reason for hiding this comment

muhamadazmy commented Dec 16, 2024 •

edited

Loading

muhamadazmy Dec 23, 2024 •

edited

Loading