FEAT-#7202: Use custom resources for Ray #7205

YarShev · 2024-04-19T15:59:35Z

What do these changes do?

first commit message and PR title follow format outlined here

NOTE: If you edit the PR title to match this format, you need to add another commit (even if it's empty) or amend your last commit for the CI job that checks the PR title to pick up the new PR title.
passes flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
passes black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
signed commit with git commit -s
Resolves Use custom resources for Ray to schedule a task on a concrete node #7202
tests passing
module layout described at docs/development/architecture.rst is up-to-date

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

anmyachev · 2024-04-20T14:35:08Z

@YarShev do we need to adjust the procedure for determining the total available number of cores, depending on these custom resources?

YarShev · 2024-04-20T17:17:36Z

Custom resources have nothing to do with num_cpus so no need for adjustment.

anmyachev · 2024-04-20T20:53:53Z

Custom resources have nothing to do with num_cpus so no need for adjustment.

Isn’t it possible to use these resources to limit the number of nodes on which calculations will be launched? It turns out that we will be dividing into a much larger number of partitions than can be executed in parallel.

YarShev · 2024-04-22T11:09:18Z

Your thoughts pushed me to a problem in the current setup. The issue is that if the user sets resources={"special_hardware": 1}, we pass this parameter as is in remote functions. This way we limit the parallelism to only one remote task to be executed. I would think of the following preprocessing resources to further pass those in remote functions. What do you think?

resources_per_task = {}

for k, v in RayCustomResources.get():
    resources_per_task[k] = v / v / num_cpus

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

modin/config/envvars.py

anmyachev · 2024-04-22T17:22:37Z

modin/core/execution/ray/common/utils.py

@@ -126,6 +127,7 @@ def initialize_ray(
                "object_store_memory": object_store_memory,
                "_redis_password": redis_password,
                "_memory": object_store_memory,
+                "resources": RayInitCustomResources.get(),


Can we add a test for this case?

What exactly would you like to test with this?

We do not test the situation when this config is different from None.

anmyachev · 2024-04-22T17:23:46Z

modin/config/envvars.py

+    >>> with context(RayTaskCustomResources={"special_hardware": 0.001}):
+    ...     df.<op>


It's good that there is now an option to limit concurrency, but this only works for Ray. Let's create an issue for the rest of the engines.

This config is not generic but only specific to Ray. I am not sure if there is a way to limit concurrency for other engines. I think if we will want to have something similar for other engines, we will open an issue and explore options if they are there. Do you still think we should create an issue now?

Limiting concurrency in context looks like a good feature for an advanced user. We can create a low priority issue, but now.

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

anmyachev · 2024-04-23T10:41:33Z

modin/core/execution/ray/common/deferred_execution.py

-                self.func, self.data, *self.args, **self.kwargs
-            )
+            result, length, width, ip = remote_exec_func.options(
+                resources=RayTaskCustomResources.get()


I wonder if we should just call it RayTaskResources? (the same for RayInitResources) Since this config is used to pass values to resources.

I would prefer to be explicit here as Ray itself calls it as custom resources - https://docs.ray.io/en/latest/ray-core/scheduling/resources.html#custom-resources.

FEAT-modin-project#7202: Use custom resources for Ray

aa839bd

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

YarShev requested review from devin-petersohn, mvashishtha, RehanSD, vnlitvinov, anmyachev, dchigarev and a team as code owners April 19, 2024 15:59

Fix actors

c3dc911

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

YarShev added 2 commits April 22, 2024 14:13

Address comments

3b9e665

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

Fix isort

6bb9847

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

YarShev commented Apr 22, 2024

View reviewed changes

modin/config/envvars.py Outdated Show resolved Hide resolved

Update modin/config/envvars.py

c763161

anmyachev previously approved these changes Apr 22, 2024

View reviewed changes

Add a test

36f5d7b

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

YarShev dismissed anmyachev’s stale review via 36f5d7b April 22, 2024 20:25

Fix isort

ee077b3

Signed-off-by: Igoshev, Iaroslav <iaroslav.igoshev@intel.com>

anmyachev reviewed Apr 23, 2024

View reviewed changes

anmyachev approved these changes Apr 23, 2024

View reviewed changes

anmyachev merged commit 71b8da4 into modin-project:main Apr 23, 2024
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT-#7202: Use custom resources for Ray #7205

FEAT-#7202: Use custom resources for Ray #7205

YarShev commented Apr 19, 2024

anmyachev commented Apr 20, 2024

YarShev commented Apr 20, 2024

anmyachev commented Apr 20, 2024

YarShev commented Apr 22, 2024 •

edited

Loading

anmyachev Apr 22, 2024

YarShev Apr 22, 2024

anmyachev Apr 22, 2024

anmyachev Apr 22, 2024

YarShev Apr 22, 2024

anmyachev Apr 22, 2024

YarShev Apr 22, 2024

anmyachev Apr 23, 2024

YarShev Apr 23, 2024

		>>> with context(RayTaskCustomResources={"special_hardware": 0.001}):
		... df.<op>

FEAT-#7202: Use custom resources for Ray #7205

FEAT-#7202: Use custom resources for Ray #7205

Conversation

YarShev commented Apr 19, 2024

What do these changes do?

anmyachev commented Apr 20, 2024

YarShev commented Apr 20, 2024

anmyachev commented Apr 20, 2024

YarShev commented Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

YarShev commented Apr 22, 2024 •

edited

Loading