(Community): Adding Structured Support for ChatPerplexity #29361

keenborder786 · 2025-01-23T00:58:12Z

Description: Adding Structured Support for ChatPerplexity
Issue: ChatPerplexity does not implement bind_tools for structured output #29357
This is implemented as per the Perplexity official docs: https://docs.perplexity.ai/guides/structured-outputs

vercel · 2025-01-23T00:58:16Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Feb 7, 2025 9:00pm

keenborder786 · 2025-01-26T14:08:05Z

@ccurme

ccurme · 2025-01-27T20:58:51Z

libs/community/langchain_community/chat_models/perplexity.py

+@chain
+def _oai_structured_outputs_parser(ai_msg: AIMessage) -> PydanticBaseModel:
+    if ai_msg.additional_kwargs.get("parsed"):
+        return ai_msg.additional_kwargs["parsed"]


Is a BaseModel instance getting populated under "parsed" in .additional_kwargs?

keenborder786 · 2025-01-27T23:02:07Z

@ccurme please see now. I have double checked now and tested as well with Preplexity Docs.

keenborder786 · 2025-01-30T00:51:57Z

@ccurme looking all good, please review

keenborder786 · 2025-01-31T00:21:51Z

@ccurme

keenborder786 · 2025-02-01T11:18:17Z

@ccurme

ccurme

I enabled standard tests for perplexity to pick up tests for structured output. It's currently failing-- we expect to handle TypedDict, Pydantic, and JSON schema.

More importantly, this doesn't appear to work for any input type. Let me know if I'm doing something wrong.

from langchain_community.chat_models import ChatPerplexity
from pydantic import BaseModel, Field

class Joke(BaseModel):
    """Joke to tell user."""

    setup: str = Field(description="question to set up a joke")
    punchline: str = Field(description="answer to resolve the joke")

llm = ChatPerplexity(model="sonar").with_structured_output(Joke)
result = llm.invoke("Tell me a joke about cats.")

BadRequestError: Error code: 400 - {'error': {'message': '["At body -> response_format -> ResponseFormatText -> type: Input should be 'text'", "At body -> response_format -> ResponseFormatJSONSchema -> type: Input should be 'json_schema'", "At body -> response_format -> ResponseFormatJSONSchema -> json_schema: Field required", "At body -> response_format -> ResponseFormatRegex -> type: Input should be 'regex'", "At body -> response_format -> ResponseFormatRegex -> regex: Field required"]', 'type': 'bad_request', 'code': 400}}

keenborder786 · 2025-02-02T12:24:00Z

okay @ccurme

keenborder786 · 2025-02-02T15:04:33Z

@ccurme I have ensured that we are handling TypedDict, Pydantic, and JSON Schema. To clarify, currently, Perplexity only supports JSON Schema for structured output. Additionally, I have accounted for both Pydantic V1 and Pydantic V2 when converting schemas to JSON.

keenborder786 · 2025-02-03T21:50:18Z

@ccurme

ccurme · 2025-02-04T02:52:14Z

Thanks for the update. I'm still getting the same error though

from langchain_community.chat_models import ChatPerplexity
from pydantic import BaseModel, Field

class Joke(BaseModel):
    """Joke to tell user."""

    setup: str = Field(description="question to set up a joke")
    punchline: str = Field(description="answer to resolve the joke")

llm = ChatPerplexity(model="sonar").with_structured_output(Joke)
result = llm.invoke("Tell me a joke about cats.")

Are you able to reproduce the issue?

keenborder786 · 2025-02-04T10:41:40Z

@ccurme no

keenborder786 · 2025-02-04T10:42:02Z

What is the exact error you are facing?

ccurme · 2025-02-04T14:36:11Z

What is the exact error you are facing?

BadRequestError: Error code: 400 - {'error': {'message': '["At body -> response_format -> ResponseFormatText -> type: Input should be 'text'", "At body -> response_format -> ResponseFormatJSONSchema -> type: Input should be 'json_schema'", "At body -> response_format -> ResponseFormatJSONSchema -> json_schema: Field required", "At body -> response_format -> ResponseFormatRegex -> type: Input should be 'regex'", "At body -> response_format -> ResponseFormatRegex -> regex: Field required"]', 'type': 'bad_request', 'code': 400}}

Here is the relevant key according to the docs:

    "response_format": {
		    "type": "json_schema",
        "json_schema": {"schema": AnswerFormat.model_json_schema()},
    },

ccurme · 2025-02-04T14:37:04Z

libs/community/langchain_community/chat_models/perplexity.py

+            else:
+                response_format = schema.schema()  # type: ignore[union-attr]
+            llm = self.bind(response_format=response_format)
+            output_parser = JsonOutputParser()


if a pydantic object is passed in for the schema, we should return a pydantic object

@ccurme I accidently forget to add the schema. It has been fixed now and I have tested it with a testing account as well.

keenborder786 · 2025-02-07T00:10:11Z

@ccurme

ccurme

Thanks @keenborder786, there's still a bit of work to do on this one.

Here are the test cases to get passing:

from langchain_community.chat_models import ChatPerplexity
from pydantic import BaseModel, Field


query = "Tell me a joke about cats. Output a json object."
llm = ChatPerplexity(model="sonar")


# Pydantic
class Joke(BaseModel):
    """Joke to tell user."""

    setup: str = Field(description="question to set up a joke")
    punchline: str = Field(description="answer to resolve the joke")

structured_llm = llm.with_structured_output(Joke)
result = structured_llm.invoke(query)
assert isinstance(result, Joke)

## Streaming
for chunk in structured_llm.stream(query):
    assert isinstance(chunk, Joke)

# JSON schema

structured_llm = llm.with_structured_output(Joke.model_json_schema())
result = structured_llm.invoke(query)
assert isinstance(result, dict)
assert isinstance(result["setup"], str)
assert isinstance(result["punchline"], str)


for chunk in structured_llm.stream(query):
    assert isinstance(chunk, dict)

assert isinstance(chunk["setup"], str)
assert isinstance(chunk["punchline"], str)

# TypedDict

from typing_extensions import Annotated, TypedDict

class JokeDict(TypedDict):
    """Joke to tell user."""

    setup: Annotated[str, ..., "question to set up a joke"]
    punchline: Annotated[str, ..., "answer to resolve the joke"]


structured_llm = llm.with_structured_output(JokeDict)
result = structured_llm.invoke(query)
assert isinstance(result, dict)
assert isinstance(result["setup"], str)
assert isinstance(result["punchline"], str)


for chunk in structured_llm.stream(query):
    assert isinstance(chunk, dict)

assert isinstance(chunk["setup"], str)
assert isinstance(chunk["punchline"], str)

These are essentially our standard tests for structured output. We cannot use them out of the box because Perplexity's feature is different in that it appears as though you need to specifically prompt it to return a JSON object.

[wip]

3da4299

keenborder786 and others added 4 commits January 24, 2025 04:59

[wip]

796e597

Merge branch 'master' into add_structured_support

afd5979

[chore]

2d0188a

[format]

e7f40b0

keenborder786 marked this pull request as ready for review January 25, 2025 19:21

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. community Related to langchain-community labels Jan 25, 2025

ccurme reviewed Jan 27, 2025

View reviewed changes

[fix]

99193f0

keenborder786 and others added 3 commits January 28, 2025 04:10

[lint]

9e70ad7

[lint]

27839b4

Merge branch 'master' into add_structured_support

028f681

Merge branch 'master' into add_structured_support

a3b4467

ccurme reviewed Feb 1, 2025

View reviewed changes

keenborder786 and others added 4 commits February 2, 2025 18:43

Merge branch 'master' into add_structured_support

ac2960f

[chore]

8e2ee36

[chore]

8726b57

[fix]

2d73235

Merge branch 'master' into add_structured_support

737ab8b

ccurme reviewed Feb 4, 2025

View reviewed changes

keenborder786 and others added 2 commits February 6, 2025 03:35

Merge branch 'master' into add_structured_support

381a73c

[fix]

1102a10

keenborder786 and others added 3 commits February 7, 2025 05:10

Merge branch 'master' into add_structured_support

b43ef0d

Merge branch 'master' into add_structured_support

09e3e16

fix pydantic case

2a47490

ccurme reviewed Feb 7, 2025

View reviewed changes

efriis assigned ccurme Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Community): Adding Structured Support for ChatPerplexity #29361

(Community): Adding Structured Support for ChatPerplexity #29361

keenborder786 commented Jan 23, 2025 •

edited

Loading

vercel bot commented Jan 23, 2025 •

edited

Loading

keenborder786 commented Jan 26, 2025

ccurme Jan 27, 2025

keenborder786 commented Jan 27, 2025

keenborder786 commented Jan 30, 2025

keenborder786 commented Jan 31, 2025

keenborder786 commented Feb 1, 2025

ccurme left a comment

keenborder786 commented Feb 2, 2025

keenborder786 commented Feb 2, 2025

keenborder786 commented Feb 3, 2025

ccurme commented Feb 4, 2025

keenborder786 commented Feb 4, 2025

keenborder786 commented Feb 4, 2025

ccurme commented Feb 4, 2025

ccurme Feb 4, 2025

keenborder786 Feb 5, 2025

keenborder786 commented Feb 7, 2025

ccurme left a comment

(Community): Adding Structured Support for ChatPerplexity #29361

Are you sure you want to change the base?

(Community): Adding Structured Support for ChatPerplexity #29361

Conversation

keenborder786 commented Jan 23, 2025 • edited Loading

vercel bot commented Jan 23, 2025 • edited Loading

keenborder786 commented Jan 26, 2025

ccurme Jan 27, 2025

Choose a reason for hiding this comment

keenborder786 commented Jan 27, 2025

keenborder786 commented Jan 30, 2025

keenborder786 commented Jan 31, 2025

keenborder786 commented Feb 1, 2025

ccurme left a comment

Choose a reason for hiding this comment

keenborder786 commented Feb 2, 2025

keenborder786 commented Feb 2, 2025

keenborder786 commented Feb 3, 2025

ccurme commented Feb 4, 2025

keenborder786 commented Feb 4, 2025

keenborder786 commented Feb 4, 2025

ccurme commented Feb 4, 2025

ccurme Feb 4, 2025

Choose a reason for hiding this comment

keenborder786 Feb 5, 2025

Choose a reason for hiding this comment

keenborder786 commented Feb 7, 2025

ccurme left a comment

Choose a reason for hiding this comment

keenborder786 commented Jan 23, 2025 •

edited

Loading

vercel bot commented Jan 23, 2025 •

edited

Loading