[lex.ext] - C++20 → C++23

Files changed (1) hide show

tmp/tmp_ep8ln91/{from.md → to.md} +29 -49

tmp/tmp_ep8ln91/{from.md → to.md} RENAMED Viewed

@@ -55,14 +55,13 @@ The syntactic non-terminal preceding the *ud-suffix* in a
 that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
 literal operator template [[over.literal]]. To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
-the *literal-operator-id* whose literal suffix identifier is *X* is
-looked up in the context of *L* using the rules for unqualified name
-lookup [[basic.lookup.unqual]]. Let *S* be the set of declarations found
-by this lookup. *S* shall not be empty.
 If *L* is a *user-defined-integer-literal*, let *n* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `unsigned long long`, the literal *L* is treated as a
 call of the form
@@ -74,11 +73,11 @@ operator "" X(nULL)
 Otherwise, *S* shall contain a raw literal operator or a numeric literal
 operator template [[over.literal]] but not both. If *S* contains a raw
 literal operator, the literal *L* is treated as a call of the form
 ``` cpp
-operator "" X("n{"})
 ```
 Otherwise (*S* contains a numeric literal operator template), *L* is
 treated as a call of the form
@@ -87,11 +86,11 @@ operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
 where *n* is the source character sequence c₁c₂...cₖ.
 [*Note 1*: The sequence c₁c₂...cₖ can only contain characters from the
-basic source character set. — *end note*]
 If *L* is a *user-defined-floating-point-literal*, let *f* be the
 literal without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
@@ -103,11 +102,11 @@ operator "" X(fL)
 Otherwise, *S* shall contain a raw literal operator or a numeric literal
 operator template [[over.literal]] but not both. If *S* contains a raw
 literal operator, the *literal* *L* is treated as a call of the form
 ``` cpp
-operator "" X("f{"})
 ```
 Otherwise (*S* contains a numeric literal operator template), *L* is
 treated as a call of the form
@@ -116,11 +115,11 @@ operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
 where *f* is the source character sequence c₁c₂...cₖ.
 [*Note 2*: The sequence c₁c₂...cₖ can only contain characters from the
-basic source character set. — *end note*]
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
 *str* (i.e., its length excluding the terminating null character). If
 *S* contains a literal operator template with a non-type template
@@ -174,39 +173,43 @@ suffix is applied to the result of the concatenation.
 [*Example 3*:
 ``` cpp
 int main() {
-  L"A" "B" "C"_x;   // OK: same as L"ABC"_x
   "P"_x "Q" "R"_y;  // error: two different ud-suffix{es}
 }
 ```
 — *end example*]
 <!-- Link reference definitions -->
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
-[basic.types]: basic.md#basic.types
 [conv.mem]: expr.md#conv.mem
 [conv.ptr]: expr.md#conv.ptr
 [cpp]: cpp.md#cpp
-[cpp.concat]: cpp.md#cpp.concat
 [cpp.cond]: cpp.md#cpp.cond
 [cpp.import]: cpp.md#cpp.import
 [cpp.include]: cpp.md#cpp.include
 [cpp.module]: cpp.md#cpp.module
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
 [lex.ccon.esc]: #lex.ccon.esc
 [lex.charset]: #lex.charset
 [lex.comment]: #lex.comment
 [lex.digraph]: #lex.digraph
 [lex.ext]: #lex.ext
 [lex.fcon]: #lex.fcon
 [lex.fcon.type]: #lex.fcon.type
@@ -217,83 +220,60 @@ int main() {
 [lex.key]: #lex.key
 [lex.key.digraph]: #lex.key.digraph
 [lex.literal]: #lex.literal
 [lex.literal.kinds]: #lex.literal.kinds
 [lex.name]: #lex.name
-[lex.name.allowed]: #lex.name.allowed
-[lex.name.disallowed]: #lex.name.disallowed
 [lex.name.special]: #lex.name.special
 [lex.nullptr]: #lex.nullptr
 [lex.operators]: #lex.operators
 [lex.phases]: #lex.phases
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
 [lex.string.concat]: #lex.string.concat
 [lex.token]: #lex.token
 [module.import]: module.md#module.import
 [module.unit]: module.md#module.unit
 [over.literal]: over.md#over.literal
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
-[^1]: Implementations must behave as if these separate phases occur,
- although in practice different phases might be folded together.
 [^2]: A partial preprocessing token would arise from a source file
     ending in the first portion of a multi-character token that requires
     a terminating sequence of characters, such as a *header-name* that
     is missing the closing `"` or `>`. A partial comment would arise
     from a source file ending with an unclosed `/*` comment.
-[^3]: An implementation need not convert all non-corresponding source
-    characters to the same execution character.
-[^4]: The glyphs for the members of the basic source character set are
-    intended to identify characters from the subset of ISO/IEC 10646
-    which corresponds to the ASCII character set. However, because the
-    mapping from source file characters to the source character set
-    (described in translation phase 1) is specified as
-    *implementation-defined*, an implementation is required to document
-    how the basic source characters are represented in source files.
-[^5]: A sequence of characters resembling a *universal-character-name*
-    in an *r-char-sequence* [[lex.string]] does not form a
-    *universal-character-name*.
-[^6]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
     descriptive, since one of the alternative *preprocessing-token*s is
     `%:%:` and of course several primary tokens contain two characters.
     Nonetheless, those alternative tokens that aren’t lexical keywords
     are colloquially known as “digraphs”.
-[^7]: Thus the “stringized” values [[cpp.stringize]] of `[` and `<:`
     will be different, maintaining the source spelling, but the tokens
     can otherwise be freely interchanged.
-[^8]: Literals include strings and character and numeric literals.
-[^9]: Thus, a sequence of characters that resembles an escape sequence
- might result in an error, be interpreted as the character
     corresponding to the escape sequence, or have a completely different
     meaning, depending on the implementation.
-[^10]: On systems in which linkers cannot accept extended characters, an
-    encoding of the *universal-character-name* may be used in forming
     valid external identifiers. For example, some otherwise unused
-    character or sequence of characters may be used to encode the `\u`
- in a *universal-character-name*. Extended characters may produce a
     long external identifier, but C++ does not place a translation limit
-    on significant characters for external identifiers. In C++, upper-
-    and lower-case letters are considered different for all identifiers,
-    including external identifiers.
-[^11]: The term “literal” generally designates, in this document, those
     tokens that are called “constants” in ISO C.
-[^12]: They are intended for character sets where a character does not
-    fit into a single byte.
-[^13]: Using an escape sequence for a question mark is supported for
-    compatibility with ISO C++14 and ISO C.

 that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
 literal operator template [[over.literal]]. To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
+first let *S* be the set of declarations found by unqualified lookup for
+the *literal-operator-id* whose literal suffix identifier is *X*
+[[basic.lookup.unqual]]. *S* shall not be empty.
 If *L* is a *user-defined-integer-literal*, let *n* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `unsigned long long`, the literal *L* is treated as a
 call of the form
 Otherwise, *S* shall contain a raw literal operator or a numeric literal
 operator template [[over.literal]] but not both. If *S* contains a raw
 literal operator, the literal *L* is treated as a call of the form
 ``` cpp
+operator ""X("n")
 ```
 Otherwise (*S* contains a numeric literal operator template), *L* is
 treated as a call of the form
 ```
 where *n* is the source character sequence c₁c₂...cₖ.
 [*Note 1*: The sequence c₁c₂...cₖ can only contain characters from the
+basic character set. — *end note*]
 If *L* is a *user-defined-floating-point-literal*, let *f* be the
 literal without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
 Otherwise, *S* shall contain a raw literal operator or a numeric literal
 operator template [[over.literal]] but not both. If *S* contains a raw
 literal operator, the *literal* *L* is treated as a call of the form
 ``` cpp
+operator ""X("f")
 ```
 Otherwise (*S* contains a numeric literal operator template), *L* is
 treated as a call of the form
 ```
 where *f* is the source character sequence c₁c₂...cₖ.
 [*Note 2*: The sequence c₁c₂...cₖ can only contain characters from the
+basic character set. — *end note*]
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
 *str* (i.e., its length excluding the terminating null character). If
 *S* contains a literal operator template with a non-type template
 [*Example 3*:
 ``` cpp
 int main() {
+  L"A" "B" "C"_x;   // OK, same as L"ABC"_x
   "P"_x "Q" "R"_y;  // error: two different ud-suffix{es}
 }
 ```
 — *end example*]
 <!-- Link reference definitions -->
+[basic.extended.fp]: basic.md#basic.extended.fp
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
+[character.seq]: library.md#character.seq
 [conv.mem]: expr.md#conv.mem
 [conv.ptr]: expr.md#conv.ptr
 [cpp]: cpp.md#cpp
 [cpp.cond]: cpp.md#cpp.cond
 [cpp.import]: cpp.md#cpp.import
 [cpp.include]: cpp.md#cpp.include
 [cpp.module]: cpp.md#cpp.module
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
+[expr.prim.literal]: expr.md#expr.prim.literal
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
 [lex.ccon.esc]: #lex.ccon.esc
+[lex.ccon.literal]: #lex.ccon.literal
 [lex.charset]: #lex.charset
+[lex.charset.basic]: #lex.charset.basic
+[lex.charset.literal]: #lex.charset.literal
 [lex.comment]: #lex.comment
 [lex.digraph]: #lex.digraph
 [lex.ext]: #lex.ext
 [lex.fcon]: #lex.fcon
 [lex.fcon.type]: #lex.fcon.type
 [lex.key]: #lex.key
 [lex.key.digraph]: #lex.key.digraph
 [lex.literal]: #lex.literal
 [lex.literal.kinds]: #lex.literal.kinds
 [lex.name]: #lex.name
 [lex.name.special]: #lex.name.special
 [lex.nullptr]: #lex.nullptr
 [lex.operators]: #lex.operators
 [lex.phases]: #lex.phases
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
 [lex.string.concat]: #lex.string.concat
+[lex.string.literal]: #lex.string.literal
 [lex.token]: #lex.token
 [module.import]: module.md#module.import
 [module.unit]: module.md#module.unit
 [over.literal]: over.md#over.literal
+[support.types.layout]: support.md#support.types.layout
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
+[^1]: Implementations behave as if these separate phases occur, although
+    in practice different phases can be folded together.
 [^2]: A partial preprocessing token would arise from a source file
     ending in the first portion of a multi-character token that requires
     a terminating sequence of characters, such as a *header-name* that
     is missing the closing `"` or `>`. A partial comment would arise
     from a source file ending with an unclosed `/*` comment.
+[^3]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
     descriptive, since one of the alternative *preprocessing-token*s is
     `%:%:` and of course several primary tokens contain two characters.
     Nonetheless, those alternative tokens that aren’t lexical keywords
     are colloquially known as “digraphs”.
+[^4]: Thus the “stringized” values [[cpp.stringize]] of `[` and `<:`
     will be different, maintaining the source spelling, but the tokens
     can otherwise be freely interchanged.
+[^5]: Literals include strings and character and numeric literals.
+[^6]: Thus, a sequence of characters that resembles an escape sequence
+ can result in an error, be interpreted as the character
     corresponding to the escape sequence, or have a completely different
     meaning, depending on the implementation.
+[^7]: On systems in which linkers cannot accept extended characters, an
+    encoding of the \*universal-character-name\* can be used in forming
     valid external identifiers. For example, some otherwise unused
+    character or sequence of characters can be used to encode the `̆` in
+    a \*universal-character-name\*. Extended characters can produce a
     long external identifier, but C++ does not place a translation limit
+    on significant characters for external identifiers.
+[^8]: The term “literal” generally designates, in this document, those
     tokens that are called “constants” in ISO C.

Diff to HTML by rtfpessoa