Skip to content

Commit 231824f

Browse files
authored
doc: mention binary properties and ascii options (#308)
1 parent 3046486 commit 231824f

File tree

2 files changed

+16
-8
lines changed

2 files changed

+16
-8
lines changed

doc/pcre2compat.3

Lines changed: 10 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -54,9 +54,10 @@ interprets them.
5454
7. The Perl escape sequences \ep, \eP, and \eX are supported only if PCRE2 is
5555
built with Unicode support (the default). The properties that can be tested
5656
with \ep and \eP are limited to the general category properties such as Lu and
57-
Nd, script names such as Greek or Han, Bidi_Class, Bidi_Control, and the
58-
derived properties Any and LC (synonym L&). Both PCRE2 and Perl support the Cs
59-
(surrogate) property, but in PCRE2 its use is limited. See the
57+
Nd, the derived properties Any and LC (synonym L&), script names such as Greek
58+
or Han, Bidi_Class, Bidi_Control, and a few binary properties. Both PCRE2 and
59+
Perl support the Cs (surrogate) property, but in PCRE2 its use is limited. See
60+
the
6061
.\" HREF
6162
\fBpcre2pattern\fP
6263
.\"
@@ -204,6 +205,11 @@ documentation for details. Perl went with 5.10 from recursion to iteration
204205
keeping the intermediate matches on the heap, which is ~10% slower but does not
205206
fall into any stack-overflow limit. PCRE2 made a similar change at release
206207
10.30, and also has many build-time and run-time customizable limits.
208+
.P
209+
21. Unlike Perl, PCRE2 doesn't have character set modifiers and specially no way
210+
to set characters by context just like Perl's "/d". A regular expression using
211+
PCRE2_UTF and PCRE2_UCP will use similar rules to Perl's "/u"; something closer
212+
to "/a" could be selected by adding other PCRE2_EXTRA_ASCII* options on top.
207213
.
208214
.
209215
.SH AUTHOR
@@ -220,6 +226,6 @@ Cambridge, England.
220226
.rs
221227
.sp
222228
.nf
223-
Last updated: 19 September 2023
229+
Last updated: 12 October 2023
224230
Copyright (c) 1997-2023 University of Cambridge.
225231
.fi

doc/pcre2unicode.3

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -42,9 +42,11 @@ When PCRE2 is built with Unicode support, the escape sequences \ep{..},
4242
\eP{..}, and \eX can be used. This is not dependent on the PCRE2_UTF setting.
4343
The Unicode properties that can be tested are a subset of those that Perl
4444
supports. Currently they are limited to the general category properties such as
45-
Lu for an upper case letter or Nd for a decimal number, the Unicode script
46-
names such as Arabic or Han, Bidi_Class, Bidi_Control, and the derived
47-
properties Any and LC (synonym L&). Full lists are given in the
45+
Lu for an upper case letter or Nd for a decimal number, the derived properties
46+
Any and LC (synonym L&), the Unicode script names such as Arabic or Han,
47+
Bidi_Class, Bidi_Control, and a few binary properties.
48+
.P
49+
The full lists are given in the
4850
.\" HREF
4951
\fBpcre2pattern\fP
5052
.\"
@@ -478,6 +480,6 @@ Cambridge, England.
478480
.rs
479481
.sp
480482
.nf
481-
Last updated: 04 February 2023
483+
Last updated: 12 October 2023
482484
Copyright (c) 1997-2023 University of Cambridge.
483485
.fi

0 commit comments

Comments
 (0)