Exposing llm components articles in ai pipelines guide (#8384)

szymondudycz · Manul from Pathway · commit d77a6c9d6806 · 2025-03-27T08:36:50.000Z
GitOrigin-RevId: 2ceca8c0359eb7b315e2c1ed9668b114184c5eae
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/.chats/llm-chats.md b/docs/2.developers/4.user-guide/50.llm-xpack/.chats/llm-chats.md
@@ -9,12 +9,23 @@ keywords: ['LLM', 'GPT', 'OpenAI', 'Gemini', 'LiteLLM', 'Wrapper']
 
 # LLM Chats
 
-Out of the box, the LLM xpack provides wrappers for text generation and embedding LLMs. For text generation, you can use native wrappers for the OpenAI chat model and HuggingFace models running locally. Many other popular models, including Azure OpenAI, HuggingFace (when using their API) or Gemini can be used with the [`LiteLLM`](/developers/user-guide/llm-xpack/llm-chats#litellm) wrapper. To check the full list of providers supported by LiteLLM check [LiteLLM documentation](https://docs.litellm.ai/docs/providers). Currently, Pathway provides wrappers for the following LLMs:
+Out of the box, the LLM xpack provides wrappers for text generation and embedding LLMs. For text generation, you can use native wrappers for the OpenAI chat model and HuggingFace models running locally. Many other popular models, including Azure OpenAI, HuggingFace (when using their API) or Gemini can be used with the [`LiteLLM`](/developers/user-guide/llm-xpack/llm-chats#litellm) wrapper. To check the full list of providers supported by LiteLLM check [LiteLLM documentation](https://docs.litellm.ai/docs/providers). 
+::if{path="/llm-xpack/"}
+Currently, Pathway provides wrappers for the following LLMs:
 - [OpenAI](/developers/user-guide/llm-xpack/llm-chats#openaichat)
 - [LiteLLM](/developers/user-guide/llm-xpack/llm-chats#litellm)
 - [Hugging Face Pipeline](/developers/user-guide/llm-xpack/llm-chats#hugging-face-pipeline)
 - [Cohere](/developers/user-guide/llm-xpack/llm-chats#cohere)
+::
+::if{path="/ai-pipelines/"}
+Currently, Pathway provides wrappers for the following LLMs:
+- [OpenAI](/developers/user-guide/llm-xpack/llm-chats#openaichat)
+- [LiteLLM](/developers/user-guide/llm-xpack/llm-chats#litellm)
+- [Hugging Face Pipeline](/developers/user-guide/llm-xpack/llm-chats#hugging-face-pipeline)
+::
+
 
+::if{path="/llm-xpack/"}
 To use a wrapper, first create an instance of the wrapper, which you can then apply to a column containing prompts.
 
 We create a Pathway table to be used in the examples below:
@@ -28,17 +39,21 @@ How many 'r' there are in 'strawberry'? | 400
     split_on_whitespace=False,
 )
 ```
+::
 
+::if{path="/llm-xpack/"}
 ## UDFs
 
 Each wrapper is a [UDF](/developers/api-docs/pathway#pathway.UDF) (User Defined Function), which allows users to define their own functions to interact with Pathway objects. A UDF, in general, is any function that takes some input, processes it, and returns an output. In the context of the Pathway library, UDFs enable seamless integration of custom logic, such as invoking LLMs for specific tasks.
 
 In particular a UDF can serve as a wrapper for LLM calls, allowing users to pass prompts or other inputs to a model and retrieve the corresponding outputs. This design makes it easy to interact with Pathway tables and columns while incorporating the power of LLMs.
+::
 
 ## OpenAIChat
 
  For OpenAI, you create a wrapper using the [`OpenAIChat` class](/developers/api-docs/pathway-xpacks-llm/llms#pathway.xpacks.llm.llms.OpenAIChat).
 
+::if{path="/llm-xpack/"}
 ```python
 from pathway.xpacks.llm import llms
 
@@ -51,7 +66,17 @@ responses = queries.select(result=model(llms.prompt_chat_single_qa(pw.this.quest
 # Run the computations (including sending requests to OpenAI) and print the output table
 pw.debug.compute_and_print(responses)
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+chat: !pw.xpacks.llm.llms.OpenAIChat
+  model: "gpt-4o-mini
+  api_key: $OPENAI_API_KEY
+```
+::
+
 
+::if{path="/llm-xpack/"}
 ### Message format
 `OpenAIChat` expects messages to be in the format required by [OpenAI API](https://platform.openai.com/docs/api-reference/chat/create) - that is a list of dictionaries, where each dictionary is one message in the conversation so far. For asking a single question, you can use [`pw.xpacks.llm.llm.prompt_chat_single_qa`](/developers/api-docs/pathway-xpacks-llm/llms#pathway.xpacks.llm.llms.prompt_chat_single_qa) to wrap a string so that it matches the format expected by OpenAI API. Our example above presents that use case.
 
@@ -88,10 +113,12 @@ responses = queries.select(result=model(llms.prompt_chat_single_qa(pw.this.quest
 responses = queries.select(result=model(llms.prompt_chat_single_qa(pw.this.questions), max_tokens(pw.this.max_tokens)))
 pw.debug.compute_and_print(responses)
 ```
+::
 
 ## LiteLLM
 Pathway has a wrapper for LiteLLM - [`LiteLLMChat`](/developers/api-docs/pathway-xpacks-llm/llms#pathway.xpacks.llm.llms.LiteLLMChat). For example, to use Gemini with LiteLLM, create an instance of `LiteLLMChat` and then apply it to the column with messages to be sent over API.
 
+::if{path="/llm-xpack/"}
 ```python
 from pathway.xpacks.llm import llms
 
@@ -103,12 +130,20 @@ model = llms.LiteLLMChat(
 responses = queries.select(result=model(llms.prompt_chat_single_qa(pw.this.questions)))
 pw.debug.compute_and_print(responses)
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+llm: !pw.xpacks.llm.llms.LiteLLMChat
+  model: "gemini/gemini-pro", # Choose the model you want
+``` 
+::
 
 With the wrapper for LiteLLM, Pathway allows you to use many popular LLMs.
 
 ## Hugging Face pipeline
 For models from Hugging Face that you want to run locally, Pathway gives a separate wrapper called `HFPipelineChat` (for calling HuggingFace through API, use LiteLLM wrapper). When an instance of this wrapper is created, it initializes a HuggingFace `pipeline`, so any [arguments to the `pipeline`](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.pipeline) - including the name of the model - must be set during the initialization of `HFPipelineChat`. Any parameters to `pipeline.__call__` can be as before set during initialization or overridden during application.
 
+::if{path="/llm-xpack/"}
 ```python
 from pathway.xpacks.llm import llms
 
@@ -118,10 +153,21 @@ model = llms.HFPipelineChat(
 responses = queries.select(result=model(pw.this.questions))
 pw.debug.compute_and_print(responses)
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+llm: !pw.xpacks.llm.llms.HFPipelineChat
+  model: "TinyLlama/TinyLlama-1.1B-Chat-v1.0", # Choose the model you want
+``` 
+::
 
 Note that format of questions used in Hugging Face pipeline depends on the model. Some models, like [`gpt2`](https://huggingface.co/openai-community/gpt2), expect a prompt string, whereas conversation models also accept messages as a list of dicts. The model's prompt template will be used if a conversation with a list of dicts is passed.
+::if{path="/ai-pipelines/"}
+Note that Pathway AI pipelines expect conversation models, so models like `gpt2` cannot be used. 
+::
 For more information, see [pipeline docs](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TextGenerationPipeline.__call__.text_inputs). 
 
+::if{path="/llm-xpack/"}
 For example for model [`TinyLlama/TinyLlama-1.1B-Chat-v1.0`](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0), you can use it with:
 
 ```python
@@ -133,8 +179,9 @@ model = llms.HFPipelineChat(
 responses = queries.select(result=model(llms.prompt_chat_single_qa(pw.this.questions)))
 pw.debug.compute_and_print(responses)
 ```
+::
 
-
+::if{path="/llm-xpack/"}
 ## Cohere
 Pathway has a wrapper for the [`Cohere Chat Services`](https://docs.cohere.com/docs/command-beta). The wrapper allows for augmenting the query with documents. The result contains cited documents along with the response.
 
@@ -163,6 +210,7 @@ r = queries_with_docs.select(
 parsed_table = r.select(response=pw.this.ret[0], citations=pw.this.ret[1])
 pw.debug.compute_and_print(parsed_table)
 ```
+::
 
 ## Wrappers are asynchronous
 Wrapper for OpenAI and LiteLLM, both for chat and embedding, are asynchronous, and Pathway allows you to set three parameters to set their behavior. These are:
@@ -172,6 +220,7 @@ Wrapper for OpenAI and LiteLLM, both for chat and embedding, are asynchronous, a
 
 These three parameters need to be set during the initialization of the wrapper. You can read more about them in the [UDFs guide](/developers/user-guide/data-transformation/user-defined-functions#asyncexecutor).
 
+::if{path="/llm-xpack/"}
 ```python
 model = llms.OpenAIChat(
     # maximum concurrent operations is 10
@@ -188,3 +237,15 @@ model = llms.OpenAIChat(
 responses = queries.select(result=model(prompt_chat_single_qa(pw.this.questions)))
 pw.debug.compute_and_print(responses)
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+chat: !pw.xpacks.llm.llms.OpenAIChat
+  model: "gpt-4o-mini
+  capacity: 10
+  retry_strategy: !pw.udfs.ExponentialBackoffRetryStrategy
+    max_retries: 5
+    initial_delay: 1000
+    backoff_factor: 2
+```
+::
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/.embedders/embedders.md b/docs/2.developers/4.user-guide/50.llm-xpack/.embedders/embedders.md
@@ -21,6 +21,7 @@ The following embedding wrappers are available through the Pathway xpack:
 ## OpenAIEmbedder
 The default model for [`OpenAIEmbedder`](/developers/api-docs/pathway-xpacks-llm/embedders/#pathway.xpacks.llm.embedders.OpenAIEmbedder) is `text-embedding-3-small`.
 
+::if{path="/llm-xpack/"}
 ```python
 import os
 import pathway as pw
@@ -44,10 +45,18 @@ documents = documents.select(text=pw.this.elements[0], metadata=pw.this.elements
 embedder = OpenAIEmbedder(api_key=os.environ["OPENAI_API_KEY"])
 embeddings = documents.select(embedding=embedder(pw.this.text))
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+embedder: !pw.xpacks.llm.embedders.OpenAIEmbedder
+  model: "text-embedding-3-small"
+```
+::
 
 ## LiteLLMEmbedder
 The model for [`LiteLLMEmbedder`](/developers/api-docs/pathway-xpacks-llm/embedders/#pathway.xpacks.llm.embedders.LiteLLMEmbedder) has to be specified during initialization. No default is provided.
 
+::if{path="/llm-xpack/"}
 ```python
 from pathway.xpacks.llm import embedders
 
@@ -63,12 +72,20 @@ Here is some text
 )
 res = t.select(ret=embedder(pw.this.text_column))
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+embedder: !pw.xpacks.llm.embedders.LiteLLMEmbedder
+  model: "text-embedding-3-small"
+```
+::
 
 ## SentenceTransformerEmbedder
 This [`SentenceTransformerEmbedder`](/developers/api-docs/pathway-xpacks-llm/embedders/#pathway.xpacks.llm.embedders.SentenceTransformerEmbedder) embedder allows you to use the models from the Hugging Face Sentence Transformer models.
 
 The model is specified during initialization. Here is a list of [`available models`](https://www.sbert.net/docs/sentence_transformer/pretrained_models.html).
 
+::if{path="/llm-xpack/"}
 ```python
 import pathway as pw
 from pathway.xpacks.llm import embedders
@@ -84,10 +101,18 @@ Some text to embed
 # Extract the embedded text
 t.select(ret=embedder(pw.this.txt))
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+embedder: !pw.xpacks.llm.embedders.SentenceTransformerEmbedder
+  model: "intfloat/e5-large-v2"
+```
+::
 
-## GemeniEmbedder
-[`GemeniEmbedder`](/developers/api-docs/pathway-xpacks-llm/embedders/#pathway.xpacks.llm.embedders.GeminiEmbedder) is the embedder for Google's Gemeni Embedding Services. Available models can be found [`here`](https://ai.google.dev/gemini-api/docs/models/gemini#text-embedding-and-embedding).
+## GeminiEmbedder
+[`GeminiEmbedder`](/developers/api-docs/pathway-xpacks-llm/embedders/#pathway.xpacks.llm.embedders.GeminiEmbedder) is the embedder for Google's Gemini Embedding Services. Available models can be found [`here`](https://ai.google.dev/gemini-api/docs/models/gemini#text-embedding-and-embedding).
 
+::if{path="/llm-xpack/"}
 ```python
 import pathway as pw
 from pathway.xpacks.llm import embedders
@@ -102,3 +127,10 @@ Some text to embed
 
 t.select(ret=embedder(pw.this.txt))
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+embedder: !pw.xpacks.llm.embedders.GeminiEmbedder
+  model: "models/text-embedding-004"
+```
+::
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/.rerankers/rerankers.md b/docs/2.developers/4.user-guide/50.llm-xpack/.rerankers/rerankers.md
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/.splitters/splitters.md b/docs/2.developers/4.user-guide/50.llm-xpack/.splitters/splitters.md
@@ -18,6 +18,7 @@ A better method is to chunk the text by tokens, ensuring each chunk makes sense
 ## TokenCountSplitter
 Pathway offers a [`TokenCountSplitter`](/developers/api-docs/pathway-xpacks-llm/splitters#pathway.xpacks.llm.splitters.TokenCountSplitter) for token-based chunking. Here's how to use it:
 
+::if{path="/llm-xpack/"}
 ```python
 from pathway.xpacks.llm.splitters import TokenCountSplitter
 
@@ -27,6 +28,15 @@ text_splitter = TokenCountSplitter(
     encoding_name="cl100k_base"
 )
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+splitter: pw.xpacks.llm.splitters.TokenCountSplitter
+  min_tokes: 100
+  max_tokens: 500
+  encoding_name: "cl100k_base"
+```
+::
 
 This configuration creates chunks of 100–500 tokens using the `cl100k_base` tokenizer, compatible with OpenAI's embedding models.
 
@@ -42,6 +52,7 @@ However, the way it determines split points differs.
 The splitter continues this process until all chunks are smaller than `chunk_size`.
 Additionally, you can introduce overlapping chunks by setting the `chunk_overlap` parameter. This is particularly useful if you want to capture different contexts in your chunks. However, keep in mind that enabling overlap increases the total number of chunks retrieved, which could impact performance.
 
+::if{path="/llm-xpack/"}
 ```python
 splitter = RecursiveSplitter(
     chunk_size=400,
@@ -50,3 +61,17 @@ splitter = RecursiveSplitter(
     model_name="gpt-4o-mini",
 )
 ```
+::
+::if{path="/ai-pipelines/"}
+```yaml
+splitter: pw.xpacks.llm.splitters.RecursiveSplitter
+  chunk_size: 400
+  chunk_overlap: 200
+  separators:
+    - "\n#
+    - "\n##"
+    - "\n\n"
+    - "\n"
+  model_name: "gpt-4o-mini"
+```
+::
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/60.splitters.md b/docs/2.developers/4.user-guide/50.llm-xpack/60.splitters.md
@@ -0,0 +1 @@
+.splitters/splitters.md
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/70.embedders.md b/docs/2.developers/4.user-guide/50.llm-xpack/70.embedders.md
@@ -0,0 +1 @@
+.embedders/embedders.md
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/80.llm-chats.md b/docs/2.developers/4.user-guide/50.llm-xpack/80.llm-chats.md
@@ -0,0 +1 @@
+.chats/llm-chats.md
diff --git a/docs/2.developers/4.user-guide/50.llm-xpack/90.rerankers.md b/docs/2.developers/4.user-guide/50.llm-xpack/90.rerankers.md
@@ -0,0 +1 @@
+.rerankers/rerankers.md
diff --git a/docs/2.developers/6.ai-pipelines/40.components/10.parsers.md b/docs/2.developers/6.ai-pipelines/40.components/10.parsers.md
@@ -0,0 +1 @@
+../../4.user-guide/50.llm-xpack/.parsers/parsers.md
diff --git a/docs/2.developers/6.ai-pipelines/40.components/20.splitters.md b/docs/2.developers/6.ai-pipelines/40.components/20.splitters.md
@@ -0,0 +1 @@
+../../4.user-guide/50.llm-xpack/.splitters/splitters.md
diff --git a/docs/2.developers/6.ai-pipelines/40.components/30.embedders.md b/docs/2.developers/6.ai-pipelines/40.components/30.embedders.md
@@ -0,0 +1 @@
+../../4.user-guide/50.llm-xpack/.embedders/embedders.md
diff --git a/docs/2.developers/6.ai-pipelines/40.components/40.llm-chats.md b/docs/2.developers/6.ai-pipelines/40.components/40.llm-chats.md
@@ -0,0 +1 @@
+../../4.user-guide/50.llm-xpack/.chats/llm-chats.md
diff --git a/docs/2.developers/6.ai-pipelines/40.components/_dir.yml b/docs/2.developers/6.ai-pipelines/40.components/_dir.yml
@@ -0,0 +1 @@
+title: 'Available Components'
diff --git a/docs/2.developers/6.ai-pipelines/70.parsers.md b/docs/2.developers/6.ai-pipelines/70.parsers.md

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+../../4.user-guide/50.llm-xpack/.parsers/parsers.md`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+../../4.user-guide/50.llm-xpack/.splitters/splitters.md`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+../../4.user-guide/50.llm-xpack/.embedders/embedders.md`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+../../4.user-guide/50.llm-xpack/.chats/llm-chats.md`