Spin Improvement Proposal: Modularizing spin.toml #3073

tschneidereit · 2025-04-01T17:10:28Z

This PR adds a SIP describing changes that'd allow developers to split the spin.toml manifest up into a main application manifest and sub-manifests for individual contained components.

Signed-off-by: Till Schneidereit [email protected]

This PR adds a SIP describing changes that'd allow developers to split the `spin.toml` manifest up into a main application manifest and sub-manifests for individual contained components. Signed-off-by: Till Schneidereit <[email protected]>

docs/content/sips/022-modularizing-spin-manifests.md

itowlson

My sense here is of a slightly uneasy tension between "I'm just providing a handy-dandy way to split an app across files" vs "I'm providing a way to define rich components in isolation from an app." There are a few things in the proposal that say "rich component in isolation" but there are others that say "just splitting an app." The reason this matters is that isolated components imply different sandboxing behaviours from split apps - e.g. should a sub-manifest have access to application variables.

It might also be good to reorganise a bit so that the motivation for some of the design decisions is more visible from where the design decisions are listed.

itowlson · 2025-04-03T20:45:25Z

docs/content/sips/022-modularizing-spin-manifests.md

+- Triggers must be defined in the top-level manifest
+- Variables can be contained in all manifests
+    - A variable defined in the top-level manifest is visible in all manifests
+    - A variable defined in a sub-manifest is only visible in that manifest


It would be good to elaborate on this scoping rule, as it requires Spin to track visibility in a way we currently don't. At the moment, spin up assembles everything into a single lockfile, which is passed to the trigger: components' template values are resolved when the trigger loads the lockfile. Enforcing visibility rules seems like it would mean either multiple lockfiles or tracking provenance in the lockfile. Maybe we can get around it by munging names, but then we need to map them back again to get values from providers.

Another facet of this (and, I'd guess, part of the motivation) is what happens if two sub-manifests define the same variable name. The proposed scoping rule appears to allow this, which is good because it means I can pull in off-the-shelf sub-manifests and not worry about them clashing. But again we need to better understand how we envisage representing that name in the lockfile.

In theory, I gues we could unify variables with the same name, since they will end up backing onto the same provider-level value (e.g. environment variable or Vault key). But in practice different defaults will make this tricky.

What is our collective appetite 🌯 for revving the locked app format to codify these visibility rules ? In any case, I agree that this rule needs to be ironed out a bit.

I think my vote would be to drop these scoping rules for now and require variables be defined in the root manifest.

@fibonacci1729 I think @tschneidereit is working on something that elaborates on his long-term "north star" thinking, which I believe is what is motivating these rules. I feel we should hold off until we see that and can decide whether to aim for it directly (at the cost of possibly revving the lockfile) or to do something expedient to solve the immediate problem (at the expense of possible rework and possible compatibility quirks when we do move towards the aspiration).

Specifically I get hints from this document of a vision of sub-manifested components as units of packaging, so that this becomes more than spicy lexical inclusion. And if so (big if!), there is some conceptual stuff we need to understand about sandboxing and permissions and scoping (e.g. can a component sniff values from the top level without an explicit grant, how do we namespace stuff so components don't need to coordinate on naming, etc.). I may be over-inferring though!

itowlson · 2025-04-03T20:46:18Z

docs/content/sips/022-modularizing-spin-manifests.md

+    - A variable defined in the top-level manifest is visible in all manifests
+    - A variable defined in a sub-manifest is only visible in that manifest
+- Components can be defined in the top-level, and in sub-manifests
+    - Only a single component can be defined in each sub-manifest


It would be useful to say more about the reason for this restriction.

(ETA: you do, but much later - consider a forward link.)

itowlson · 2025-04-03T20:48:09Z

docs/content/sips/022-modularizing-spin-manifests.md

+
+- Manifests can only be nested one level deep: an application can have a single top-level manifest, and zero or more direct sub-manifests, which cannot have any sub-manifests themselves
+- Application-level metadata must fully be contained in the application’s top-level manifest
+- Triggers must be defined in the top-level manifest


It would be useful to say more about the reason for this restriction.

I'm getting the sense that you have a mental model for a concrete thing that a sub-manifest represents, that motivates some of the decisions here? If so it would be useful to talk about it here, even if it is distant and aspirational!

docs/content/sips/022-modularizing-spin-manifests.md

itowlson · 2025-04-03T20:53:35Z

docs/content/sips/022-modularizing-spin-manifests.md

+source = "target/spin-http-js.wasm"
+allowed_outbound_hosts = []
+exclude_files = ["**/node_modules"]
+key_value_stores = ["default"]


Permissions is an interesting one and this is another case where I'd find it useful to hear your long-term vision for this. If this is purely, purely, to help a developer organise a big file, then this is not a worry. But talking about "authors" and "version" fields implies that you envision an ecosystem and distribution mechanism for sub-manifests (dare I call them... components). And in that case it maybe feels like the application should be in charge of granting permissions. I don't want to upgrade "EvilComponent" and discover too late that it's invited itself to my production database...!

itowlson · 2025-04-03T21:00:12Z

docs/content/sips/022-modularizing-spin-manifests.md

+
+1. Triggers are application-level concerns; they describe the application’s structure and high-level organization. As such, keeping them “at the surface” seems right.
+2. Trigger definitions tend to be very short, so moving them into sub-manifests individually as is done for components seems very boilerplate-y.
+3. Trigger definitions don’t come with their own assets, so any folder containing a sub-manifest for them would only ever contain the manifest, and nothing else.


Mostly triggers don't come with component-related config, but we do have an oddity at the moment where WAGI execution has to be specified on the trigger not the component even though it's the component interface that mandates it. (I think this is to allow different trigger to set different argvs on the same component. Not sure though.)

itowlson · 2025-04-03T21:06:06Z

docs/content/sips/022-modularizing-spin-manifests.md

+
+### Inheriting definitions from the application manifest
+
+Similar to [crates in a Cargo workspace](https://doc.rust-lang.org/1.85.1/cargo/reference/workspaces.html#the-package-table), component manifests can inherit certain definitions from the application manifest, such as the version, description, and authors:


Perhaps say more about why you feel this would be useful? Cargo cares because crates are units of distribution and deployment.

itowlson · 2025-04-03T21:08:22Z

docs/content/sips/022-modularizing-spin-manifests.md

+...
+```
+
+Any uses in template strings are adapted to use the prefixed variable names.


Is this saying that Spin has to rewrite template strings to the new names as part of lockfile creation?

How does this affect providers? Do secrets in Vault need to use the scoped name?

itowlson · 2025-04-03T21:09:37Z

docs/content/sips/022-modularizing-spin-manifests.md

+...
+# Inherit the `my_variable` variable from the application
+[variables]
+my_variable.application = true


Why? This seems like busywork.

itowlson · 2025-04-03T21:19:41Z

docs/content/sips/022-modularizing-spin-manifests.md

+
+In component manifests, the role of the latter is taken by the new `[variables.exposed]` table.
+
+*Note: we should evaluate whether this table is really needed. Can we instead just expose all variables in a component manifest’s `[variables]` table to content directly?*


I don't think we can. The term "variable" has two meanings:

A configuration knob exposed to the operator

A value exposed to the guest

The decoupling is not just for sandboxing, but so that configuration knobs are not required to line up with what the component sees. E.g. if I publish a component which looks at a "db-url" value, and you publish a component that looks at "db-host" and "db-port" values, the decoupling allows the app developer to define two knobs for the operator, which the application surfaces in different ways to meet different component expectations.

This may have turned out to be generality that we don't need (or won't need until a broader component ecosystem evolves and uses wasi-config), but I'd be cautious about backing it out, and doubly cautious about backing it out for the special case of sub-manifests. It may be that sub-manifest local variables are so tightly internally coupled to their components that it's a reasonable call though - just want to be cautious in the evaluation!

itowlson · 2025-04-03T21:33:36Z

I should add that I do like the implications of isolation rather than mere textual splitting. One of the challenges I ran into with the previous attempt at this (which was basically naive inclusion) was providing good locations for errors. Being able to parse and analyse sub-manifests separately, and combine them mostly-reliably only once parsed, should address that nicely. I am very keen that we think about how to give good pointers in failure cases in this work!

seungjin · 2025-04-03T23:16:24Z

Once I switched my app to a microservice-style structure, my spin.toml became bloated and complicated.
I even considered creating my own spin.toml compiler to generate a final configuration from smaller spin.toml files at build time.

docs/content/sips/022-modularizing-spin-manifests.md

kate-goldenring · 2025-04-14T17:43:46Z

docs/content/sips/022-modularizing-spin-manifests.md

+description = "A sentiment analysis API that demonstrates using LLM inference and
+
+# Components defined in sub-manifests must be included in this list.
+# Entries are treated as folder names, relative to the folder containing the application manifest.


So the directory would be laid out like follows?

├── spin.toml ├── sentiment-analysis │ └─── spin.toml └── kv-explorer └─── spin.toml

Is there a way we could allow for file exact paths, too?

kate-goldenring · 2025-04-14T17:45:22Z

docs/content/sips/022-modularizing-spin-manifests.md

+# Certain keys can be inherited from the application manifest
+authors.application = true
+version.application = true
+source = "target/spin-http-js.wasm"


It feels weird to me to have this be relative to the sub manifest. Shouldn't this already be defined in the sub manifest?

kate-goldenring · 2025-04-14T17:49:38Z

docs/content/sips/022-modularizing-spin-manifests.md

+version.application = true
+```
+
+### Handling of variables


I am having a hard time understanding what make variables unique over other resources, such as environment variables, files, and kv stores. Can you explain why variables are component scoped but other resources are not?

fibonacci1729 · 2025-04-16T16:54:52Z

I'm going to start an in-parallel WIP for this SIP as it currently stands.

Co-authored-by: Kate Goldenring <[email protected]>

fibonacci1729 · 2025-04-22T21:14:42Z

docs/content/sips/022-modularizing-spin-manifests.md

+
+`spin.toml` files contain multiple different kinds of items: application-level metadata, variables, trigger definitions, and component definitions. I propose to support modularizing manifests in the following way:
+
+- Manifests can only be nested one level deep: an application can have a single top-level manifest, and zero or more direct sub-manifests, which cannot have any sub-manifests themselves


What's the motivation for restricting to 1-level nesting?

tschneidereit force-pushed the sip-modular-manifests branch from 0bbb647 to 3d0edb8 Compare April 1, 2025 17:13

fibonacci1729 reviewed Apr 3, 2025

View reviewed changes

docs/content/sips/022-modularizing-spin-manifests.md Outdated Show resolved Hide resolved

itowlson requested changes Apr 3, 2025

View reviewed changes

kate-goldenring reviewed Apr 14, 2025

View reviewed changes

fibonacci1729 and others added 2 commits April 22, 2025 13:45

Update docs/content/sips/022-modularizing-spin-manifests.md

29c0a62

Co-authored-by: Kate Goldenring <[email protected]>

Update docs/content/sips/022-modularizing-spin-manifests.md

3833735

Co-authored-by: Kate Goldenring <[email protected]>

fibonacci1729 reviewed Apr 22, 2025

View reviewed changes

itowlson mentioned this pull request May 2, 2025

Support workflows for creating library components inside a Spin application using them #2982

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spin Improvement Proposal: Modularizing spin.toml #3073

Spin Improvement Proposal: Modularizing spin.toml #3073

tschneidereit commented Apr 1, 2025

itowlson left a comment

itowlson Apr 3, 2025

fibonacci1729 Apr 22, 2025 •

edited

Loading

fibonacci1729 Apr 22, 2025 •

edited

Loading

itowlson Apr 27, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson Apr 3, 2025

itowlson commented Apr 3, 2025

seungjin commented Apr 3, 2025

kate-goldenring Apr 14, 2025

kate-goldenring Apr 14, 2025

kate-goldenring Apr 14, 2025

fibonacci1729 commented Apr 16, 2025

fibonacci1729 Apr 22, 2025


		### Inheriting definitions from the application manifest

		Similar to [crates in a Cargo workspace](https://doc.rust-lang.org/1.85.1/cargo/reference/workspaces.html#the-package-table), component manifests can inherit certain definitions from the application manifest, such as the version, description, and authors:


		In component manifests, the role of the latter is taken by the new `[variables.exposed]` table.

		Note: we should evaluate whether this table is really needed. Can we instead just expose all variables in a component manifest’s `[variables]` table to content directly?


		`spin.toml` files contain multiple different kinds of items: application-level metadata, variables, trigger definitions, and component definitions. I propose to support modularizing manifests in the following way:

		- Manifests can only be nested one level deep: an application can have a single top-level manifest, and zero or more direct sub-manifests, which cannot have any sub-manifests themselves

Spin Improvement Proposal: Modularizing spin.toml #3073

Are you sure you want to change the base?

Spin Improvement Proposal: Modularizing spin.toml #3073

Conversation

tschneidereit commented Apr 1, 2025

itowlson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fibonacci1729 Apr 22, 2025 • edited Loading

Choose a reason for hiding this comment

fibonacci1729 Apr 22, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

itowlson commented Apr 3, 2025

seungjin commented Apr 3, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fibonacci1729 commented Apr 16, 2025

Choose a reason for hiding this comment

fibonacci1729 Apr 22, 2025 •

edited

Loading

fibonacci1729 Apr 22, 2025 •

edited

Loading