Skip to content

Conversation

@jbrockmendel
Copy link
Member

cc @jorisvandenbossche

Copy link
Member

@jorisvandenbossche jorisvandenbossche left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

# GH#61916
warnings.warn(
"For backward compatibility, 'str' dtypes are included by "
"select_dtypes when object dtypes are specified. "
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"select_dtypes when object dtypes are specified. "
"select_dtypes when 'object' dtype is specified. "

Comment on lines 2375 to 2380
To select string columns include ``str``:

.. ipython:: python
df.select_dtypes(include=["object"])
df.select_dtypes(include=[str])
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would maybe add a note that this changed in pandas 3.0 and that for pandas<3, include="object" was used. Maybe with a link to https://pandas.pydata.org/docs/user_guide/migration-3-strings.html#hardcoded-use-of-object-dtype

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea, updated.

@mroeschke mroeschke added the Deprecate Functionality to remove in pandas label Oct 17, 2025
.. note::

This is a change in pandas 3.0. Previously strings were stored in ``object`` dtype columns, so would be selected with ``include=[object]``. See https://pandas.pydata.org/docs/user_guide/migration-3-strings.html#hardcoded-use-of-object-dtype.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This is a change in pandas 3.0. Previously strings were stored in ``object`` dtype columns, so would be selected with ``include=[object]``. See https://pandas.pydata.org/docs/user_guide/migration-3-strings.html#hardcoded-use-of-object-dtype.
This is a change in pandas 3.0. Previously strings were stored in ``object`` dtype columns, so would be selected with ``include=[object]``. See :ref:`string_migration.object`.

Best to use internal references for something like this (but will do this in a follow-up PR expanding that section a bit)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The migration guide actually already had the content, but so just updating the reference, and also added a link to the depr warning -> #62759

@jorisvandenbossche jorisvandenbossche merged commit 15ca85b into pandas-dev:main Oct 19, 2025
47 checks passed
@jorisvandenbossche
Copy link
Member

@jbrockmendel thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Deprecate Functionality to remove in pandas

Projects

None yet

Development

Successfully merging this pull request may close these issues.

String dtype: backwards compatibility of selecting "object" vs "str" columns in select_dtypes

3 participants