Skip to content

Commit 8f9039d

Browse files
authored
BUG: Fix ValueError in DataFrame/Series regex replace for all-NA values (#60691)
* BUG: DataFrame/Series regex replace fix for all NA values * Add entry to whatsnew
1 parent 0110487 commit 8f9039d

File tree

4 files changed

+17
-1
lines changed

4 files changed

+17
-1
lines changed

doc/source/whatsnew/v3.0.0.rst

+1
Original file line numberDiff line numberDiff line change
@@ -798,6 +798,7 @@ Other
798798
- Bug in :meth:`Series.dt` methods in :class:`ArrowDtype` that were returning incorrect values. (:issue:`57355`)
799799
- Bug in :meth:`Series.rank` that doesn't preserve missing values for nullable integers when ``na_option='keep'``. (:issue:`56976`)
800800
- Bug in :meth:`Series.replace` and :meth:`DataFrame.replace` inconsistently replacing matching instances when ``regex=True`` and missing values are present. (:issue:`56599`)
801+
- Bug in :meth:`Series.replace` and :meth:`DataFrame.replace` throwing ``ValueError`` when ``regex=True`` and all NA values. (:issue:`60688`)
801802
- Bug in :meth:`Series.to_string` when series contains complex floats with exponents (:issue:`60405`)
802803
- Bug in :meth:`read_csv` where chained fsspec TAR file and ``compression="infer"`` fails with ``tarfile.ReadError`` (:issue:`60028`)
803804
- Bug in Dataframe Interchange Protocol implementation was returning incorrect results for data buffers' associated dtype, for string and datetime columns (:issue:`54781`)

pandas/core/array_algos/replace.py

+2-1
Original file line numberDiff line numberDiff line change
@@ -89,7 +89,8 @@ def _check_comparison_types(
8989
op = np.vectorize(
9090
lambda x: bool(re.search(b, x))
9191
if isinstance(x, str) and isinstance(b, (str, Pattern))
92-
else False
92+
else False,
93+
otypes=[bool],
9394
)
9495

9596
# GH#32621 use mask to avoid comparing to NAs

pandas/tests/frame/methods/test_replace.py

+7
Original file line numberDiff line numberDiff line change
@@ -713,6 +713,13 @@ def test_replace_with_None_keeps_categorical(self):
713713
)
714714
tm.assert_frame_equal(result, expected)
715715

716+
def test_replace_all_NA(self):
717+
# GH#60688
718+
df = DataFrame({"ticker": ["#1234#"], "name": [None]})
719+
result = df.replace({col: {r"^#": "$"} for col in df.columns}, regex=True)
720+
expected = DataFrame({"ticker": ["$1234#"], "name": [None]})
721+
tm.assert_frame_equal(result, expected)
722+
716723
def test_replace_value_is_none(self, datetime_frame):
717724
orig_value = datetime_frame.iloc[0, 0]
718725
orig2 = datetime_frame.iloc[1, 0]

pandas/tests/series/methods/test_replace.py

+7
Original file line numberDiff line numberDiff line change
@@ -708,3 +708,10 @@ def test_replace_ea_float_with_bool(self):
708708
expected = ser.copy()
709709
result = ser.replace(0.0, True)
710710
tm.assert_series_equal(result, expected)
711+
712+
def test_replace_all_NA(self):
713+
# GH#60688
714+
df = pd.Series([pd.NA, pd.NA])
715+
result = df.replace({r"^#": "$"}, regex=True)
716+
expected = pd.Series([pd.NA, pd.NA])
717+
tm.assert_series_equal(result, expected)

0 commit comments

Comments
 (0)