Introduce Function Context Feature to TaskVineExecutor #3724

tphung3 · 2024-12-16T19:33:33Z

Description

This PR introduces the function context feature in TaskVine to the TaskVineExecutor. In short, a traditional function can now specify its computational context to be shared across multiple invocations of the same function, allowing drastic improvements in execution performance.

For example, machine learning models, especially LLMs, have a large overhead of model creation to do one inference. Instead of coupling model creation and inferences in the same function, a user now can specify the model creation as the context of the actual inference function, allowing the de-duplication of the model creation cost.

Helpful blog: https://cclnd.blogspot.com/2025/10/reducing-overhead-of-llm-integrated.html.

Tests are added to make sure the feature works as intented.

Changed Behaviour

TaskVineExecutor now has a new feature allowing functions to specify computational contexts to be shared.

Type of change

New feature

parsl/executors/taskvine/executor.py

parsl/tests/test_vineex/test_function_context.py

benclifford · 2025-11-19T10:30:30Z

parsl/tests/test_vineex/test_function_context.py

+
+
+@require_taskvine
+@pytest.mark.taskvine


this mark here is what lets you specify you don't want to test taskvine

Can you please clarify? I thought @pytest.mark.taskvine specifies that this test is only to be run with TaskVineExecutor, or does it have other meanings?

benclifford · 2025-11-19T10:35:16Z

parsl/tests/test_vineex/test_function_context.py

+@pytest.mark.taskvine
+@pytest.mark.parametrize('num_tasks', (1, 50))
+def test_function_context_computation(num_tasks, current_config_name):
+    if current_config_name != 'taskvine_ex':


if you want to test against a specific configuration, have a look at tests that are @pytest.mark.local and have their own with parsl.load() in them - rather than using some ambient environment we don't expect the feature to work in.

That would be more consistent with existing tests. parsl/tests/test_monitoring/test_basic.py is a complicated example. or parsl/tests/test_htex/test_priority_queue.py

There's nothing special about the configuration: this test runs with parsl/tests/configs/taskvine_ex.py. This is my way of saying that this test should only be run with the TaskVineExecutor rather than with thread pool, htex, etc. Using only @pytest.mark.taskvine didn't work for me.

parsl/executors/taskvine/utils.py

parsl/executors/taskvine/manager.py

benclifford · 2025-11-19T11:33:18Z

parsl/executors/taskvine/executor.py

            while written < len(serialized_obj):
                written += f_out.write(serialized_obj[written:])

+    def _cloudpickle_serialize_object_to_file(self, path, obj):


we talked about this somewhere before but I can't remember where: you should be using the parsl serialization libraries not cloudpickle unless you have a specific reason that needs different serialization.

The object I serialize is a list containing a function and other Python objects. https://github.com/Parsl/parsl/pull/3724/files#diff-c5ce2bce42f707d31639e986d8fea5c00d31b5eead8fa510f7fe7e3181e67ccfR458-R461

Because it is a list, Parsl serialize uses methods_for_data to serialize it which eventually uses pickle, and this can't serialize a function by value. So I'm using cloudpickle serialization only for this case. What do you think?

benclifford · 2025-11-19T11:40:46Z

parsl/executors/taskvine/manager.py

-                if not lib_installed:
-                    # Declare and install common library for serverless tasks.
+                if task.func_name not in libs_installed:
+                    # Declare and install one library for serverless tasks per category, and vice versa.


is this one library per function, not per category?

Yes, one library containing one function, not per category. I think many functions per library also works in certain cases, but there will be cases where it doesn't work naively, for example functions A and B each load a huge LLM in a GPU, and the node only has one GPU so the library can't host both A and B simultaneously.

benclifford · 2025-11-19T11:46:16Z

This runs serverless functions several times faster than current Parsl master, when I measure with parsl-perf. 722 tasks per second vs 240 tasks per second on a 10000 task batch. I'm not clear why though.

tphung3 · 2025-11-20T17:45:55Z

This runs serverless functions several times faster than current Parsl master, when I measure with parsl-perf. 722 tasks per second vs 240 tasks per second on a 10000 task batch. I'm not clear why though.

This bypasses the overhead from run_parsl_function, and the library hosts a given function in its address space on the remote node. So now a function is serialized, shipped, and deserialized to the remote node once and invoc'ed multiple times, instead of one serialization/deserialization per invocation.

https://github.com/Parsl/parsl/pull/3724/files#diff-394c24a1ea1b5e8b91de1f0725846f311d12ed8ef0dd496360335078855b72acL288-R336

This also adds some caching of serialization cost as well.

https://github.com/Parsl/parsl/pull/3724/files#diff-c5ce2bce42f707d31639e986d8fea5c00d31b5eead8fa510f7fe7e3181e67ccfL413-R476

benclifford reviewed Dec 17, 2024

View reviewed changes

parsl/executors/taskvine/executor.py Outdated Show resolved Hide resolved

tphung3 force-pushed the opt-vine-ex branch from 37e8e18 to 4c0d1cf Compare October 27, 2025 15:08

tphung3 added 28 commits November 8, 2025 18:21

first draft with double serialization

03b21a0

fix bug

6556d39

deduplicate fn serialization

72e9e00

finish serialization deduplication

10486b2

fix bug dedup double serial

ee39165

add option for non-tmp staging dir

79079b0

context feature added

a8104e7

add _ to hidden variable

a6e609d

use 1 mapping only

5b190bb

check monitoring first

4681cd0

fix lint issues

97b3fe7

fix bug in mapping of function names in executor

3cd8608

fix flake8

433917e

add annotation

edf192d

new way to detect monitoring code

6c9388b

add run_parsl_function

59dd532

fixes to update head

62db46d

comment out debug code

1f2ba3e

use tmp dir defaults to False

bd766e7

fix dir name

a03757d

make parents dir if needed

f93da84

makedirs not mkdir

51a047f

remove context refs during args serialization

0c327bb

remove debug code and clean it a bit

7bcb6ef

add input files support for function context/library

9fb00d5

fix lint errors

056dcc4

ignore cloudpickle import error

a0cc793

add ignore error type

75823c2

tphung3 added 10 commits November 8, 2025 18:21

restrict to taskvine only

bd26ed0

fix lint

efe9ef1

try restrict tests to taskvine

d033a54

remove config parametrize

ee803e6

remove param config

058f42d

use 1st config

abcbffd

fix config

3bebee1

dont load config

831e4b6

rerun

12a9be7

turn on shared fs

f57a957

tphung3 force-pushed the opt-vine-ex branch from 0d23416 to f57a957 Compare November 8, 2025 23:22

tphung3 added 2 commits November 8, 2025 18:30

add res spec

79886a8

local compute

821daf4

tphung3 marked this pull request as ready for review November 9, 2025 04:20

tphung3 requested a review from benclifford November 9, 2025 04:21

tphung3 changed the title ~~WIP: Optimize TaskVineExecutor~~ Introduce Function Context Feature to TaskVineExecutor Nov 9, 2025

benclifford reviewed Nov 19, 2025

View reviewed changes

parsl/tests/test_vineex/test_function_context.py Outdated Show resolved Hide resolved

benclifford reviewed Nov 19, 2025

View reviewed changes

parsl/executors/taskvine/utils.py Show resolved Hide resolved

benclifford reviewed Nov 19, 2025

View reviewed changes

parsl/executors/taskvine/manager.py Outdated Show resolved Hide resolved

benclifford reviewed Nov 19, 2025

View reviewed changes

tphung3 added 4 commits November 19, 2025 22:00

remove monitoring detection code

753a127

remove require_taskvine decorator

2ba5f41

add docstring for helper function

6f2a148

remove redundant serverless output check

7736704

tphung3 requested a review from benclifford November 20, 2025 17:47



		@require_taskvine
		@pytest.mark.taskvine

Introduce Function Context Feature to TaskVineExecutor #3724

Are you sure you want to change the base?

Introduce Function Context Feature to TaskVineExecutor #3724

Uh oh!

Conversation

tphung3 commented Dec 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changed Behaviour

Type of change

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benclifford commented Nov 19, 2025

Uh oh!

tphung3 commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tphung3 commented Dec 16, 2024 •

edited

Loading