[re.grammar] - C++17 → C++20

Files changed (1) hide show

tmp/tmpndlhgghp/{from.md → to.md} +37 -45

tmp/tmpndlhgghp/{from.md → to.md} RENAMED Viewed

@@ -13,58 +13,57 @@ including the formatted string input functions. Instead they shall call
 the appropriate traits member function to achieve the required effect.
 The following productions within the ECMAScript grammar are modified as
 follows:
-``` cpp
 ClassAtom::
-  -
   ClassAtomNoDash
   ClassAtomExClass
   ClassAtomCollatingElement
   ClassAtomEquivalence
 IdentityEscape::
-  SourceCharacter but not c
 ```
 The following new productions are then added:
-``` cpp
 ClassAtomExClass::
-  [: ClassName :]
 ClassAtomCollatingElement::
-  [. ClassName .]
 ClassAtomEquivalence::
-  [= ClassName =]
 ClassName::
   ClassNameCharacter
   ClassNameCharacter ClassName
 ClassNameCharacter::
-  SourceCharacter but not one of "." "=" ":"
 ```
-The productions `ClassAtomExClass`, `ClassAtomCollatingElement` and
-`ClassAtomEquivalence` provide functionality equivalent to that of the
 same features in regular expressions in POSIX.
 The regular expression grammar may be modified by any
 `regex_constants::syntax_option_type` flags specified when constructing
 an object of type specialization of `basic_regex` according to the rules
-in Table [[tab:re:syntaxoption]].
-A `ClassName` production, when used in `ClassAtomExClass`, is not valid
-if `traits_inst.lookup_classname` returns zero for that name. The names
-recognized as valid `ClassName`s are determined by the type of the
-traits class, but at least the following names shall be recognized:
-`alnum`, `alpha`, `blank`, `cntrl`, `digit`, `graph`, `lower`, `print`,
-`punct`, `space`, `upper`, `xdigit`, `d`, `s`, `w`. In addition the
-following expressions shall be equivalent:
 ``` cpp
 \d and [[:digit:]]
 \D and [^[:digit:]]
@@ -76,22 +75,22 @@ following expressions shall be equivalent:
 \w and [_[:alnum:]]
 \W and [^_[:alnum:]]
 ```
-A `ClassName` production when used in a `ClassAtomCollatingElement`
-production is not valid if the value returned by
-`traits_inst.lookup_collatename` for that name is an empty string.
 The results from multiple calls to `traits_inst.lookup_classname` can be
-bitwise OR’ed together and subsequently passed to `traits_inst.isctype`.
-A `ClassName` production when used in a `ClassAtomEquivalence`
-production is not valid if the value returned by
-`traits_inst.lookup_collatename` for that name is an empty string or if
-the value returned by `traits_inst.transform_primary` for the result of
-the call to `traits_inst.lookup_collatename` is an empty string.
 When the sequence of characters being transformed to a finite state
 machine contains an invalid class name the translator shall throw an
 exception object of type `regex_error`.
@@ -106,13 +105,13 @@ Where the regular expression grammar requires the conversion of a
 sequence of characters to an integral value, this is accomplished by
 calling `traits_inst.value`.
 The behavior of the internal finite state machine representation when
 used to match a sequence of characters is as described in ECMA-262. The
-behavior is modified according to any match_flag_type flags (
-[[re.matchflag]]) specified when using the regular expression object in
-one of the regular expression algorithms ([[re.alg]]). The behavior is
 also localized by interaction with the traits class template parameter
 as follows:
 - During matching of a regular expression finite state machine against a
   sequence of characters, two characters `c` and `d` are compared using
@@ -125,24 +124,24 @@ as follows:
     `traits_inst.translate(c) == traits_inst.translate(d)`;
   - otherwise, the two characters are equal if `c == d`.
 - During matching of a regular expression finite state machine against a
   sequence of characters, comparison of a collating element range
   `c1-c2` against a character `c` is conducted as follows: if
-  `flags() & regex_constants::collate` is false then the character `c`
   is matched if `c1
   <= c && c <= c2`, otherwise `c` is matched in accordance with the
   following algorithm:
   ``` cpp
   string_type str1 = string_type(1,
     flags() & icase ?
-      traits_inst.translate_nocase(c1) : traits_inst.translate(c1);
   string_type str2 = string_type(1,
     flags() & icase ?
-      traits_inst.translate_nocase(c2) : traits_inst.translate(c2);
   string_type str = string_type(1,
     flags() & icase ?
-      traits_inst.translate_nocase(c) : traits_inst.translate(c);
   return traits_inst.transform(str1.begin(), str1.end())
         <= traits_inst.transform(str.begin(), str.end())
     && traits_inst.transform(str.begin(), str.end())
         <= traits_inst.transform(str2.begin(), str2.end());
   ```
@@ -156,10 +155,12 @@ as follows:
   sequence of characters, a character `c` is a member of a character
   class designated by an iterator range \[`first`, `last`) if
   `traits_inst.isctype(c, traits_inst.lookup_classname(first, last, flags() & icase))`
   is `true`.
 <!-- Link reference definitions -->
 [algorithms]: algorithms.md#algorithms
 [bidirectional.iterators]: iterators.md#bidirectional.iterators
 [bitmask.types]: library.md#bitmask.types
 [container.requirements.general]: containers.md#container.requirements.general
@@ -181,14 +182,12 @@ as follows:
 [re.grammar]: #re.grammar
 [re.iter]: #re.iter
 [re.matchflag]: #re.matchflag
 [re.regex]: #re.regex
 [re.regex.assign]: #re.regex.assign
-[re.regex.const]: #re.regex.const
 [re.regex.construct]: #re.regex.construct
 [re.regex.locale]: #re.regex.locale
-[re.regex.nmswap]: #re.regex.nmswap
 [re.regex.nonmemb]: #re.regex.nonmemb
 [re.regex.operations]: #re.regex.operations
 [re.regex.swap]: #re.regex.swap
 [re.regiter]: #re.regiter
 [re.regiter.cnstr]: #re.regiter.cnstr
@@ -206,27 +205,20 @@ as follows:
 [re.results.state]: #re.results.state
 [re.results.swap]: #re.results.swap
 [re.submatch]: #re.submatch
 [re.submatch.members]: #re.submatch.members
 [re.submatch.op]: #re.submatch.op
 [re.syn]: #re.syn
 [re.synopt]: #re.synopt
 [re.tokiter]: #re.tokiter
 [re.tokiter.cnstr]: #re.tokiter.cnstr
 [re.tokiter.comp]: #re.tokiter.comp
 [re.tokiter.deref]: #re.tokiter.deref
 [re.tokiter.incr]: #re.tokiter.incr
 [re.traits]: #re.traits
 [sequence.reqmts]: containers.md#sequence.reqmts
 [strings.general]: strings.md#strings.general
-[tab:re.lib.summary]: #tab:re.lib.summary
-[tab:re.traits.classnames]: #tab:re.traits.classnames
-[tab:re:RegexpTraits]: #tab:re:RegexpTraits
-[tab:re:alg:match]: #tab:re:alg:match
-[tab:re:alg:search]: #tab:re:alg:search
-[tab:re:errortype]: #tab:re:errortype
-[tab:re:matchflag]: #tab:re:matchflag
-[tab:re:results:assign]: #tab:re:results:assign
-[tab:re:syntaxoption]: #tab:re:syntaxoption
 [^1]: For example, if the parameter `icase` is `true` then `[[:lower:]]`
     is the same as `[[:alpha:]]`.

 the appropriate traits member function to achieve the required effect.
 The following productions within the ECMAScript grammar are modified as
 follows:
+``` bnf
 ClassAtom::
+ '-'
   ClassAtomNoDash
   ClassAtomExClass
   ClassAtomCollatingElement
   ClassAtomEquivalence
 IdentityEscape::
+  SourceCharacter but not 'c'
 ```
 The following new productions are then added:
+``` bnf
 ClassAtomExClass::
+ '[:' ClassName ':]'
 ClassAtomCollatingElement::
+ '[.' ClassName '.]'
 ClassAtomEquivalence::
+ '[=' ClassName '=]'
 ClassName::
   ClassNameCharacter
   ClassNameCharacter ClassName
 ClassNameCharacter::
+  SourceCharacter but not one of '.' or '=' or ':'
 ```
+The productions , and provide functionality equivalent to that of the
 same features in regular expressions in POSIX.
 The regular expression grammar may be modified by any
 `regex_constants::syntax_option_type` flags specified when constructing
 an object of type specialization of `basic_regex` according to the rules
+in [[re.synopt]].
+A production, when used in , is not valid if
+`traits_inst.lookup_classname` returns zero for that name. The names
+recognized as valid s are determined by the type of the traits class,
+but at least the following names shall be recognized: `alnum`, `alpha`,
+`blank`, `cntrl`, `digit`, `graph`, `lower`, `print`, `punct`, `space`,
+`upper`, `xdigit`, `d`, `s`, `w`. In addition the following expressions
+shall be equivalent:
 ``` cpp
 \d and [[:digit:]]
 \D and [^[:digit:]]
 \w and [_[:alnum:]]
 \W and [^_[:alnum:]]
 ```
+A production when used in a production is not valid if the value
+returned by `traits_inst.lookup_collatename` for that name is an empty
+string.
 The results from multiple calls to `traits_inst.lookup_classname` can be
+bitwise ’ed together and subsequently passed to `traits_inst.isctype`.
+A production when used in a production is not valid if the value
+returned by `traits_inst.lookup_collatename` for that name is an empty
+string or if the value returned by `traits_inst.transform_primary` for
+the result of the call to `traits_inst.lookup_collatename` is an empty
+string.
 When the sequence of characters being transformed to a finite state
 machine contains an invalid class name the translator shall throw an
 exception object of type `regex_error`.
 sequence of characters to an integral value, this is accomplished by
 calling `traits_inst.value`.
 The behavior of the internal finite state machine representation when
 used to match a sequence of characters is as described in ECMA-262. The
+behavior is modified according to any `match_flag_type` flags
+[[re.matchflag]] specified when using the regular expression object in
+one of the regular expression algorithms [[re.alg]]. The behavior is
 also localized by interaction with the traits class template parameter
 as follows:
 - During matching of a regular expression finite state machine against a
   sequence of characters, two characters `c` and `d` are compared using
     `traits_inst.translate(c) == traits_inst.translate(d)`;
   - otherwise, the two characters are equal if `c == d`.
 - During matching of a regular expression finite state machine against a
   sequence of characters, comparison of a collating element range
   `c1-c2` against a character `c` is conducted as follows: if
+  `flags() & regex_constants::collate` is `false` then the character `c`
   is matched if `c1
   <= c && c <= c2`, otherwise `c` is matched in accordance with the
   following algorithm:
   ``` cpp
   string_type str1 = string_type(1,
     flags() & icase ?
+      traits_inst.translate_nocase(c1) : traits_inst.translate(c1));
   string_type str2 = string_type(1,
     flags() & icase ?
+      traits_inst.translate_nocase(c2) : traits_inst.translate(c2));
   string_type str = string_type(1,
     flags() & icase ?
+      traits_inst.translate_nocase(c) : traits_inst.translate(c));
   return traits_inst.transform(str1.begin(), str1.end())
         <= traits_inst.transform(str.begin(), str.end())
     && traits_inst.transform(str.begin(), str.end())
         <= traits_inst.transform(str2.begin(), str2.end());
   ```
   sequence of characters, a character `c` is a member of a character
   class designated by an iterator range \[`first`, `last`) if
   `traits_inst.isctype(c, traits_inst.lookup_classname(first, last, flags() & icase))`
   is `true`.
+ECMA-262 15.10
 <!-- Link reference definitions -->
 [algorithms]: algorithms.md#algorithms
 [bidirectional.iterators]: iterators.md#bidirectional.iterators
 [bitmask.types]: library.md#bitmask.types
 [container.requirements.general]: containers.md#container.requirements.general
 [re.grammar]: #re.grammar
 [re.iter]: #re.iter
 [re.matchflag]: #re.matchflag
 [re.regex]: #re.regex
 [re.regex.assign]: #re.regex.assign
 [re.regex.construct]: #re.regex.construct
 [re.regex.locale]: #re.regex.locale
 [re.regex.nonmemb]: #re.regex.nonmemb
 [re.regex.operations]: #re.regex.operations
 [re.regex.swap]: #re.regex.swap
 [re.regiter]: #re.regiter
 [re.regiter.cnstr]: #re.regiter.cnstr
 [re.results.state]: #re.results.state
 [re.results.swap]: #re.results.swap
 [re.submatch]: #re.submatch
 [re.submatch.members]: #re.submatch.members
 [re.submatch.op]: #re.submatch.op
+[re.summary]: #re.summary
 [re.syn]: #re.syn
 [re.synopt]: #re.synopt
 [re.tokiter]: #re.tokiter
 [re.tokiter.cnstr]: #re.tokiter.cnstr
 [re.tokiter.comp]: #re.tokiter.comp
 [re.tokiter.deref]: #re.tokiter.deref
 [re.tokiter.incr]: #re.tokiter.incr
 [re.traits]: #re.traits
+[re.traits.classnames]: #re.traits.classnames
 [sequence.reqmts]: containers.md#sequence.reqmts
 [strings.general]: strings.md#strings.general
 [^1]: For example, if the parameter `icase` is `true` then `[[:lower:]]`
     is the same as `[[:alpha:]]`.

Diff to HTML by rtfpessoa