Add scaling docs for V2 #4661

ukclivecox · 2023-02-13T09:14:22Z

Adds initial scaling docs section for V2.

sakoush

LGTM. Left minor comments. Should we state which components can be scaled dynamically during operations if not all?

sakoush · 2023-02-13T10:20:34Z

docs/source/contents/kubernetes/scaling/index.md

+  replicas: 3
+```
+
+The number of replicas will need not to exceed the replicas of the Server the model is scheduled to.


Suggested change

The number of replicas will need not to exceed the replicas of the Server the model is scheduled to.

Currently, the number of replicas will need not to exceed the replicas of the Server the model is scheduled to.

sakoush · 2023-02-13T10:22:23Z

docs/source/contents/kubernetes/scaling/index.md

+  serverConfig: mlserver
+```
+
+Models scheduled to a server can only scale up to the server replica count.


Suggested change

Models scheduled to a server can only scale up to the server replica count.

Currently, models scheduled to a server can only scale up to the server replica count.

sakoush · 2023-02-13T10:23:18Z

docs/source/contents/kubernetes/scaling/index.md

+
+## Internal Components
+
+Seldon core v2 runs with several control and dataplane components. The scaling of these resurces is discussed below:


Suggested change

Seldon core v2 runs with several control and dataplane components. The scaling of these resurces is discussed below:

Seldon Core v2 runs with several control and dataplane components. The scaling of these resurces is discussed below:

sakoush · 2023-02-13T10:24:49Z

docs/source/contents/kubernetes/scaling/index.md

+- Model gateway.
+    - This component pulls model requests from Kafka and sends them to inference servers. It can be scaled up to the partition factor of your Kafka topics. At present we set a uniform partition factor for all topics in one installation of Seldon Core V2.
+- Dataflow engine.
+    - The dataflow engine runs KStream topologies to manage Pipelines. It can run as multiple replicas and the scheduler will balance Pipelines to run across it with a consistent hashing load balancer with each Pipeline managed up to the partition factor of Kafka (presently hardwired to one).


Suggested change

- The dataflow engine runs KStream topologies to manage Pipelines. It can run as multiple replicas and the scheduler will balance Pipelines to run across it with a consistent hashing load balancer with each Pipeline managed up to the partition factor of Kafka (presently hardwired to one).

- The dataflow engine runs KStream topologies to manage Pipelines. It can run as multiple replicas and the scheduler will balance Pipelines to run across it with a consistent hashing load balancer. Each Pipeline is managed up to the partition factor of Kafka (presently hardwired to one).

sakoush · 2023-02-13T10:26:02Z

docs/source/contents/kubernetes/scaling/index.md

+- Scheduler.
+    - This manages the control plane operations. It is presently required to be one replica as it maintains internal state within a BadgerDB held on local persistent storage (stateful set in Kubernetes). Performance tests have shown this not to be a bottleneck at present.
+- Kubernetes Controller.
+    - The Kubernetes controller manages resources updates on the cluster which it passes on to the Scheduler, It is by default one replica but has the ability to scale.


Suggested change

- The Kubernetes controller manages resources updates on the cluster which it passes on to the Scheduler, It is by default one replica but has the ability to scale.

- The Kubernetes controller manages resources updates on the cluster which it passes on to the Scheduler. It is by default one replica but has the ability to scale.

sakoush · 2023-02-13T10:28:44Z

Should we also add a todo section about any upcoming work that can help with scalability?

Add scaling docs

a2dfde1

ukclivecox added the v2 label Feb 13, 2023

ukclivecox requested review from sakoush and agrski February 13, 2023 09:14

sakoush approved these changes Feb 13, 2023

View reviewed changes

review comments

551acca

ukclivecox merged commit e000c05 into SeldonIO:v2 Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scaling docs for V2 #4661

Add scaling docs for V2 #4661

ukclivecox commented Feb 13, 2023

sakoush left a comment

sakoush Feb 13, 2023

sakoush Feb 13, 2023

sakoush Feb 13, 2023

sakoush Feb 13, 2023

sakoush Feb 13, 2023

sakoush commented Feb 13, 2023

	The number of replicas will need not to exceed the replicas of the Server the model is scheduled to.
	Currently, the number of replicas will need not to exceed the replicas of the Server the model is scheduled to.

	Models scheduled to a server can only scale up to the server replica count.
	Currently, models scheduled to a server can only scale up to the server replica count.


		## Internal Components

		Seldon core v2 runs with several control and dataplane components. The scaling of these resurces is discussed below:

	- The dataflow engine runs KStream topologies to manage Pipelines. It can run as multiple replicas and the scheduler will balance Pipelines to run across it with a consistent hashing load balancer with each Pipeline managed up to the partition factor of Kafka (presently hardwired to one).
	- The dataflow engine runs KStream topologies to manage Pipelines. It can run as multiple replicas and the scheduler will balance Pipelines to run across it with a consistent hashing load balancer. Each Pipeline is managed up to the partition factor of Kafka (presently hardwired to one).

	- The Kubernetes controller manages resources updates on the cluster which it passes on to the Scheduler, It is by default one replica but has the ability to scale.
	- The Kubernetes controller manages resources updates on the cluster which it passes on to the Scheduler. It is by default one replica but has the ability to scale.

Add scaling docs for V2 #4661

Add scaling docs for V2 #4661

Conversation

ukclivecox commented Feb 13, 2023

sakoush left a comment

Choose a reason for hiding this comment

sakoush Feb 13, 2023

Choose a reason for hiding this comment

sakoush Feb 13, 2023

Choose a reason for hiding this comment

sakoush Feb 13, 2023

Choose a reason for hiding this comment

sakoush Feb 13, 2023

Choose a reason for hiding this comment

sakoush Feb 13, 2023

Choose a reason for hiding this comment

sakoush commented Feb 13, 2023