Description
We recently added polar in our list of our tested third party libraries, to better prevent regressions in future versions of Pydantic.
To improve build performance, we are going to make some internal changes to the handling of __get_pydantic_core_schema__ and Pydantic models in pydantic/pydantic#10863. As a consequence, the __get_pydantic_core_schema__ method of the BaseModel class was going to be removed, but turns out that some projects (including polar) are relying on this method, e.g. in the ListResource model:
|
@classmethod |
|
def __get_pydantic_core_schema__( |
|
cls, source: type[BaseModel], handler: GetCoreSchemaHandler, / |
|
) -> CoreSchema: |
|
""" |
|
Override the schema to set the `ref` field to the overridden class name. |
|
""" |
|
result = super().__get_pydantic_core_schema__(source, handler) |
|
result["ref"] = cls.__name__ # type: ignore |
|
return result |
As a consequence, we are going to raise a deprecation warning when super().__get_pydantic_core_schema__ is being called to ease transition. In the future, this can be fixed by directly calling handler(source) instead. However, I wouldn't recommend implementing __get_pydantic_core_schema__ on Pydantic models, as it can lead to unexpected behavior.
In the case of ListResource, you are mutating the core schema reference, which is crashing the core schema generation logic in some cases:
class ListResource[T](BaseModel):
@classmethod
def __get_pydantic_core_schema__(
cls, source: type[BaseModel], handler: GetCoreSchemaHandler, /
) -> CoreSchema:
"""
Override the schema to set the `ref` field to the overridden class name.
"""
result = super().__get_pydantic_core_schema__(source, handler)
result["ref"] = cls.__name__ # type: ignore
return result
class Model(BaseModel):
a: ListResource[int]
b: ListResource[int]
# Crash with a KeyError when the schema for `Model` is generated
The reason for this is that internally, references are used to avoid generating a core schema twice for the same object (in the case of Model, the core schema for ListResource[int] is only generated once). To do so, we generate a reference for the object and compare it with the already generated definitions. But because the "ref" was dynamically changed, Pydantic is not able to retrieve the already generated schema and this breaks a lot of things.
It seems that changing the ref was made in order to simplify the generated JSON Schema names in #3833. Instead, I would suggest using a custom GenerateJsonSchema class, and overriding the relevant method (probably get_defs_ref). I know it may be more tedious to do so, but altering the core schema ref directly is never going to play well 1
As a side note, I also see you are using the internal display_as_type function:
|
@classmethod |
|
def model_parametrized_name(cls, params: tuple[type[Any], ...]) -> str: |
|
""" |
|
Override default model name implementation to detect `ClassName` metadata. |
|
|
|
It's useful to shorten the name when a long union type is used. |
|
""" |
|
param_names = [] |
|
for param in params: |
|
if hasattr(param, "__metadata__"): |
|
for metadata in param.__metadata__: |
|
if isinstance(metadata, ClassName): |
|
param_names.append(metadata.name) |
|
else: |
|
param_names.append(display_as_type(param)) |
Because ListResource is defined with a single type variable, I can suggest using the following instead:
@classmethod
def model_parametrized_name(cls, params: tuple[type[Any]]) -> str: # Guaranteed to be of length 1
"""
Override default model name implementation to detect `ClassName` metadata.
It's useful to shorten the name when a long union type is used.
"""
param = params[0]
if hasattr(param, "__metadata__"):
for metadata in param.__metadata__:
if isinstance(metadata, ClassName):
return f"{cls.__name__}[{metadata.name}]"
return super().model_parametrized_name(params)
But, again, if this is done for JSON Schema generation purposes, it might be best to leave the model name unchanged and define a custom GenerateJsonSchema class.
Description
We recently added polar in our list of our tested third party libraries, to better prevent regressions in future versions of Pydantic.
To improve build performance, we are going to make some internal changes to the handling of
__get_pydantic_core_schema__and Pydantic models in pydantic/pydantic#10863. As a consequence, the__get_pydantic_core_schema__method of theBaseModelclass was going to be removed, but turns out that some projects (including polar) are relying on this method, e.g. in theListResourcemodel:polar/server/polar/kit/pagination.py
Lines 146 to 155 in ae2c70a
As a consequence, we are going to raise a deprecation warning when
super().__get_pydantic_core_schema__is being called to ease transition. In the future, this can be fixed by directly callinghandler(source)instead. However, I wouldn't recommend implementing__get_pydantic_core_schema__on Pydantic models, as it can lead to unexpected behavior.In the case of
ListResource, you are mutating the core schema reference, which is crashing the core schema generation logic in some cases:The reason for this is that internally, references are used to avoid generating a core schema twice for the same object (in the case of
Model, the core schema forListResource[int]is only generated once). To do so, we generate a reference for the object and compare it with the already generated definitions. But because the"ref"was dynamically changed, Pydantic is not able to retrieve the already generated schema and this breaks a lot of things.It seems that changing the ref was made in order to simplify the generated JSON Schema names in #3833. Instead, I would suggest using a custom
GenerateJsonSchemaclass, and overriding the relevant method (probablyget_defs_ref). I know it may be more tedious to do so, but altering the core schema ref directly is never going to play well 1As a side note, I also see you are using the internal
display_as_typefunction:polar/server/polar/kit/pagination.py
Lines 127 to 141 in ae2c70a
Because
ListResourceis defined with a single type variable, I can suggest using the following instead:But, again, if this is done for JSON Schema generation purposes, it might be best to leave the model name unchanged and define a custom
GenerateJsonSchemaclass.Footnotes
Alternatively, we are thinking about designing a new API for core schema generation, that would allow providing a custom reference generation implementation for Pydantic models (but also other types). ↩