[lex.ext] - C++17 → C++20

Files changed (1) hide show

tmp/tmpx2hovdoy/{from.md → to.md} +56 -46

tmp/tmpx2hovdoy/{from.md → to.md} RENAMED Viewed

@@ -1,11 +1,11 @@
 ### User-defined literals <a id="lex.ext">[[lex.ext]]</a>
 ``` bnf
 user-defined-literal:
     user-defined-integer-literal
-    user-defined-floating-literal
     user-defined-string-literal
     user-defined-character-literal
 ```
 ``` bnf
@@ -15,11 +15,11 @@ user-defined-integer-literal:
     hexadecimal-literal ud-suffix
     binary-literal ud-suffix
 ```
 ``` bnf
-user-defined-floating-literal:
     fractional-constant exponent-partₒₚₜ ud-suffix
     digit-sequence exponent-part ud-suffix
     hexadecimal-prefix hexadecimal-fractional-constant binary-exponent-part ud-suffix
     hexadecimal-prefix hexadecimal-digit-sequence binary-exponent-part ud-suffix
 ```
@@ -53,65 +53,65 @@ is a *user-defined-literal*, but `12LL` is an *integer-literal*.
 The syntactic non-terminal preceding the *ud-suffix* in a
 *user-defined-literal* is taken to be the longest sequence of characters
 that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
-literal operator template ([[over.literal]]). To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
 the *literal-operator-id* whose literal suffix identifier is *X* is
 looked up in the context of *L* using the rules for unqualified name
-lookup ([[basic.lookup.unqual]]). Let *S* be the set of declarations
-found by this lookup. *S* shall not be empty.
 If *L* is a *user-defined-integer-literal*, let *n* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `unsigned long long`, the literal *L* is treated as a
 call of the form
 ``` cpp
 operator "" X(nULL)
 ```
-Otherwise, *S* shall contain a raw literal operator or a literal
-operator template ([[over.literal]]) but not both. If *S* contains a
-raw literal operator, the literal *L* is treated as a call of the form
 ``` cpp
 operator "" X("n{"})
 ```
-Otherwise (*S* contains a literal operator template), *L* is treated as
-a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
 where *n* is the source character sequence c₁c₂...cₖ.
 [*Note 1*: The sequence c₁c₂...cₖ can only contain characters from the
 basic source character set. — *end note*]
-If *L* is a *user-defined-floating-literal*, let *f* be the literal
-without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
 ``` cpp
 operator "" X(fL)
 ```
-Otherwise, *S* shall contain a raw literal operator or a literal
-operator template ([[over.literal]]) but not both. If *S* contains a
-raw literal operator, the *literal* *L* is treated as a call of the form
 ``` cpp
 operator "" X("f{"})
 ```
-Otherwise (*S* contains a literal operator template), *L* is treated as
-a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
@@ -120,20 +120,28 @@ where *f* is the source character sequence c₁c₂...cₖ.
 [*Note 2*: The sequence c₁c₂...cₖ can only contain characters from the
 basic source character set. — *end note*]
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
-*str* (i.e., its length excluding the terminating null character). The
 literal *L* is treated as a call of the form
 ``` cpp
 operator "" X(str, len)
 ```
 If *L* is a *user-defined-character-literal*, let *ch* be the literal
-without its *ud-suffix*. *S* shall contain a literal operator (
-[[over.literal]]) whose only parameter has the type of *ch* and the
 literal *L* is treated as a call of the form
 ``` cpp
 operator "" X(ch)
 ```
@@ -152,16 +160,16 @@ int main() {
 }
 ```
 — *end example*]
-In translation phase 6 ([[lex.phases]]), adjacent string literals are
-concatenated and *user-defined-string-literal*s are considered string
-literals for that purpose. During concatenation, *ud-suffix*es are
-removed and ignored and the concatenation process occurs as described
-in  [[lex.string]]. At the end of phase 6, if a string literal is the
-result of a concatenation involving at least one
 *user-defined-string-literal*, all the participating
 *user-defined-string-literal*s shall have the same *ud-suffix* and that
 suffix is applied to the result of the concatenation.
 [*Example 3*:
@@ -179,51 +187,55 @@ int main() {
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
 [basic.types]: basic.md#basic.types
-[conv.mem]: conv.md#conv.mem
-[conv.ptr]: conv.md#conv.ptr
 [cpp]: cpp.md#cpp
 [cpp.concat]: cpp.md#cpp.concat
 [cpp.cond]: cpp.md#cpp.cond
 [cpp.include]: cpp.md#cpp.include
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
 [lex.charset]: #lex.charset
 [lex.comment]: #lex.comment
 [lex.digraph]: #lex.digraph
 [lex.ext]: #lex.ext
 [lex.fcon]: #lex.fcon
 [lex.header]: #lex.header
 [lex.icon]: #lex.icon
 [lex.key]: #lex.key
 [lex.literal]: #lex.literal
 [lex.literal.kinds]: #lex.literal.kinds
 [lex.name]: #lex.name
 [lex.nullptr]: #lex.nullptr
 [lex.operators]: #lex.operators
 [lex.phases]: #lex.phases
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
 [lex.token]: #lex.token
 [over.literal]: over.md#over.literal
-[tab:alternative.representations]: #tab:alternative.representations
-[tab:alternative.tokens]: #tab:alternative.tokens
-[tab:charname.allowed]: #tab:charname.allowed
-[tab:charname.disallowed]: #tab:charname.disallowed
-[tab:escape.sequences]: #tab:escape.sequences
-[tab:identifiers.special]: #tab:identifiers.special
-[tab:keywords]: #tab:keywords
-[tab:lex.string.concat]: #tab:lex.string.concat
-[tab:lex.type.integer.literal]: #tab:lex.type.integer.literal
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
 [^1]: Implementations must behave as if these separate phases occur,
     although in practice different phases might be folded together.
@@ -244,21 +256,21 @@ int main() {
     (described in translation phase 1) is specified as
     *implementation-defined*, an implementation is required to document
     how the basic source characters are represented in source files.
 [^5]: A sequence of characters resembling a *universal-character-name*
-    in an *r-char-sequence* ([[lex.string]]) does not form a
     *universal-character-name*.
 [^6]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
-    descriptive, since one of the alternative preprocessing-tokens is
     `%:%:` and of course several primary tokens contain two characters.
     Nonetheless, those alternative tokens that aren’t lexical keywords
     are colloquially known as “digraphs”.
-[^7]: Thus the “stringized” values ([[cpp.stringize]]) of `[` and `<:`
     will be different, maintaining the source spelling, but the tokens
     can otherwise be freely interchanged.
 [^8]: Literals include strings and character and numeric literals.
@@ -275,15 +287,13 @@ int main() {
     long external identifier, but C++ does not place a translation limit
     on significant characters for external identifiers. In C++, upper-
     and lower-case letters are considered different for all identifiers,
     including external identifiers.
-[^11]: The term “literal” generally designates, in this International
- Standard, those tokens that are called “constants” in ISO C.
-[^12]: The digits `8` and `9` are not octal digits.
-[^13]: They are intended for character sets where a character does not
     fit into a single byte.
-[^14]: Using an escape sequence for a question mark is supported for
     compatibility with ISO C++14 and ISO C.

 ### User-defined literals <a id="lex.ext">[[lex.ext]]</a>
 ``` bnf
 user-defined-literal:
     user-defined-integer-literal
+    user-defined-floating-point-literal
     user-defined-string-literal
     user-defined-character-literal
 ```
 ``` bnf
     hexadecimal-literal ud-suffix
     binary-literal ud-suffix
 ```
 ``` bnf
+user-defined-floating-point-literal:
     fractional-constant exponent-partₒₚₜ ud-suffix
     digit-sequence exponent-part ud-suffix
     hexadecimal-prefix hexadecimal-fractional-constant binary-exponent-part ud-suffix
     hexadecimal-prefix hexadecimal-digit-sequence binary-exponent-part ud-suffix
 ```
 The syntactic non-terminal preceding the *ud-suffix* in a
 *user-defined-literal* is taken to be the longest sequence of characters
 that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
+literal operator template [[over.literal]]. To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
 the *literal-operator-id* whose literal suffix identifier is *X* is
 looked up in the context of *L* using the rules for unqualified name
+lookup [[basic.lookup.unqual]]. Let *S* be the set of declarations found
+by this lookup. *S* shall not be empty.
 If *L* is a *user-defined-integer-literal*, let *n* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `unsigned long long`, the literal *L* is treated as a
 call of the form
 ``` cpp
 operator "" X(nULL)
 ```
+Otherwise, *S* shall contain a raw literal operator or a numeric literal
+operator template [[over.literal]] but not both. If *S* contains a raw
+literal operator, the literal *L* is treated as a call of the form
 ``` cpp
 operator "" X("n{"})
 ```
+Otherwise (*S* contains a numeric literal operator template), *L* is
+treated as a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
 where *n* is the source character sequence c₁c₂...cₖ.
 [*Note 1*: The sequence c₁c₂...cₖ can only contain characters from the
 basic source character set. — *end note*]
+If *L* is a *user-defined-floating-point-literal*, let *f* be the
+literal without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
 ``` cpp
 operator "" X(fL)
 ```
+Otherwise, *S* shall contain a raw literal operator or a numeric literal
+operator template [[over.literal]] but not both. If *S* contains a raw
+literal operator, the *literal* *L* is treated as a call of the form
 ``` cpp
 operator "" X("f{"})
 ```
+Otherwise (*S* contains a numeric literal operator template), *L* is
+treated as a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
 [*Note 2*: The sequence c₁c₂...cₖ can only contain characters from the
 basic source character set. — *end note*]
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
+*str* (i.e., its length excluding the terminating null character). If
+*S* contains a literal operator template with a non-type template
+parameter for which *str* is a well-formed *template-argument*, the
 literal *L* is treated as a call of the form
+``` cpp
+operator "" X<str>()
+```
+Otherwise, the literal *L* is treated as a call of the form
 ``` cpp
 operator "" X(str, len)
 ```
 If *L* is a *user-defined-character-literal*, let *ch* be the literal
+without its *ud-suffix*. *S* shall contain a literal operator
+[[over.literal]] whose only parameter has the type of *ch* and the
 literal *L* is treated as a call of the form
 ``` cpp
 operator "" X(ch)
 ```
 }
 ```
 — *end example*]
+In translation phase 6 [[lex.phases]], adjacent *string-literal*s are
+concatenated and *user-defined-string-literal*s are considered
+*string-literal*s for that purpose. During concatenation, *ud-suffix*es
+are removed and ignored and the concatenation process occurs as
+described in  [[lex.string]]. At the end of phase 6, if a
+*string-literal* is the result of a concatenation involving at least one
 *user-defined-string-literal*, all the participating
 *user-defined-string-literal*s shall have the same *ud-suffix* and that
 suffix is applied to the result of the concatenation.
 [*Example 3*:
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
 [basic.types]: basic.md#basic.types
+[conv.mem]: expr.md#conv.mem
+[conv.ptr]: expr.md#conv.ptr
 [cpp]: cpp.md#cpp
 [cpp.concat]: cpp.md#cpp.concat
 [cpp.cond]: cpp.md#cpp.cond
+[cpp.import]: cpp.md#cpp.import
 [cpp.include]: cpp.md#cpp.include
+[cpp.module]: cpp.md#cpp.module
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
+[lex.ccon.esc]: #lex.ccon.esc
 [lex.charset]: #lex.charset
 [lex.comment]: #lex.comment
 [lex.digraph]: #lex.digraph
 [lex.ext]: #lex.ext
 [lex.fcon]: #lex.fcon
+[lex.fcon.type]: #lex.fcon.type
 [lex.header]: #lex.header
 [lex.icon]: #lex.icon
+[lex.icon.base]: #lex.icon.base
+[lex.icon.type]: #lex.icon.type
 [lex.key]: #lex.key
+[lex.key.digraph]: #lex.key.digraph
 [lex.literal]: #lex.literal
 [lex.literal.kinds]: #lex.literal.kinds
 [lex.name]: #lex.name
+[lex.name.allowed]: #lex.name.allowed
+[lex.name.disallowed]: #lex.name.disallowed
+[lex.name.special]: #lex.name.special
 [lex.nullptr]: #lex.nullptr
 [lex.operators]: #lex.operators
 [lex.phases]: #lex.phases
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
+[lex.string.concat]: #lex.string.concat
 [lex.token]: #lex.token
+[module.import]: module.md#module.import
+[module.unit]: module.md#module.unit
 [over.literal]: over.md#over.literal
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
 [^1]: Implementations must behave as if these separate phases occur,
     although in practice different phases might be folded together.
     (described in translation phase 1) is specified as
     *implementation-defined*, an implementation is required to document
     how the basic source characters are represented in source files.
 [^5]: A sequence of characters resembling a *universal-character-name*
+    in an *r-char-sequence* [[lex.string]] does not form a
     *universal-character-name*.
 [^6]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
+    descriptive, since one of the alternative *preprocessing-token*s is
     `%:%:` and of course several primary tokens contain two characters.
     Nonetheless, those alternative tokens that aren’t lexical keywords
     are colloquially known as “digraphs”.
+[^7]: Thus the “stringized” values [[cpp.stringize]] of `[` and `<:`
     will be different, maintaining the source spelling, but the tokens
     can otherwise be freely interchanged.
 [^8]: Literals include strings and character and numeric literals.
     long external identifier, but C++ does not place a translation limit
     on significant characters for external identifiers. In C++, upper-
     and lower-case letters are considered different for all identifiers,
     including external identifiers.
+[^11]: The term “literal” generally designates, in this document, those
+    tokens that are called “constants” in ISO C.
+[^12]: They are intended for character sets where a character does not
     fit into a single byte.
+[^13]: Using an escape sequence for a question mark is supported for
     compatibility with ISO C++14 and ISO C.

Diff to HTML by rtfpessoa