Skip to content

Commit 5b4fbb9

Browse files
authored
removed InferenceModel section from EPP protocol (#1389)
Signed-off-by: Nir Rozenbaum <[email protected]>
1 parent 83e9cae commit 5b4fbb9

File tree

1 file changed

+0
-3
lines changed
  • docs/proposals/004-endpoint-picker-protocol

1 file changed

+0
-3
lines changed

docs/proposals/004-endpoint-picker-protocol/README.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,3 @@ This metadata is required because the EPP provides a list of endpoints to the da
8282
8383
### Why envoy.lb namespace as a default?
8484
The `envoy.lb` namespace is a predefined namespace. One common way to use the selected endpoint returned from the server, is [envoy subsets](https://www.envoyproxy.io/docs/envoy/latest/intro/arch_overview/upstream/load_balancing/subsets) where host metadata for subset load balancing must be placed under `envoy.lb`. Note that this is not related to the subsetting feature discussed above, this is an enovy implementation detail.
85-
86-
## Matching An InferenceModel
87-
The model name of a request MUST match the `Spec.ModelName` parameter of one of the `InferenceModels` referencing the `InferencePool` managed by the EPP. Otherwise, the EPP MUST return a 404 status code.

0 commit comments

Comments
 (0)