[lex.ext] - C++14 → C++17

Files changed (1) hide show

tmp/tmp1d86r7_u/{from.md → to.md} +43 -30

tmp/tmp1d86r7_u/{from.md → to.md} RENAMED Viewed

@@ -18,10 +18,12 @@ user-defined-integer-literal:
 ``` bnf
 user-defined-floating-literal:
     fractional-constant exponent-partₒₚₜ ud-suffix
     digit-sequence exponent-part ud-suffix
 ```
 ``` bnf
 user-defined-string-literal:
     string-literal ud-suffix
@@ -35,15 +37,24 @@ user-defined-character-literal:
 ``` bnf
 ud-suffix:
     identifier
 ```
-If a token matches both *user-defined-literal* and another literal kind,
-it is treated as the latter. `123_km` is a *user-defined-literal*, but
-`12LL` is an *integer-literal*. The syntactic non-terminal preceding the
-*ud-suffix* in a *user-defined-literal* is taken to be the longest
-sequence of characters that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
 literal operator template ([[over.literal]]). To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
 the *literal-operator-id* whose literal suffix identifier is *X* is
@@ -73,13 +84,14 @@ a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
-where *n* is the source character sequence c₁c₂...cₖ. The sequence
-c₁c₂...cₖ can only contain characters from the basic source character
-set.
 If *L* is a *user-defined-floating-literal*, let *f* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
@@ -101,32 +113,35 @@ a call of the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
-where *f* is the source character sequence c₁c₂...cₖ. The sequence
-c₁c₂...cₖ can only contain characters from the basic source character
-set.
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
 *str* (i.e., its length excluding the terminating null character). The
 literal *L* is treated as a call of the form
 ``` cpp
-operator "" X(str{}, len{})
 ```
 If *L* is a *user-defined-character-literal*, let *ch* be the literal
 without its *ud-suffix*. *S* shall contain a literal operator (
 [[over.literal]]) whose only parameter has the type of *ch* and the
 literal *L* is treated as a call of the form
 ``` cpp
-operator "" X(ch{})
 ```
 ``` cpp
 long double operator "" _w(long double);
 std::string operator "" _w(const char16_t*, std::size_t);
 unsigned operator "" _w(const char*);
 int main() {
@@ -135,48 +150,47 @@ int main() {
   12_w;       // calls operator "" _w("12")
   "two"_w;    // error: no applicable literal operator
 }
 ```
 In translation phase 6 ([[lex.phases]]), adjacent string literals are
 concatenated and *user-defined-string-literal*s are considered string
 literals for that purpose. During concatenation, *ud-suffix*es are
 removed and ignored and the concatenation process occurs as described
 in  [[lex.string]]. At the end of phase 6, if a string literal is the
 result of a concatenation involving at least one
 *user-defined-string-literal*, all the participating
 *user-defined-string-literal*s shall have the same *ud-suffix* and that
 suffix is applied to the result of the concatenation.
 ``` cpp
 int main() {
   L"A" "B" "C"_x; // OK: same as L"ABC"_x
   "P"_x "Q" "R"_y;// error: two different ud-suffix{es}
 }
 ```
-Some *identifier*s appearing as *ud-suffix*es are reserved for future
-standardization ([[usrlit.suffix]]). A program containing such a
-*ud-suffix* is ill-formed, no diagnostic required.
 <!-- Link reference definitions -->
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
 [basic.types]: basic.md#basic.types
-[charname.allowed]: charname.md#charname.allowed
-[charname.disallowed]: charname.md#charname.disallowed
 [conv.mem]: conv.md#conv.mem
 [conv.ptr]: conv.md#conv.ptr
 [cpp]: cpp.md#cpp
 [cpp.concat]: cpp.md#cpp.concat
 [cpp.cond]: cpp.md#cpp.cond
 [cpp.include]: cpp.md#cpp.include
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
-[global.names]: library.md#global.names
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
 [lex.charset]: #lex.charset
@@ -196,23 +210,22 @@ standardization ([[usrlit.suffix]]). A program containing such a
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
 [lex.token]: #lex.token
-[lex.trigraph]: #lex.trigraph
 [over.literal]: over.md#over.literal
 [tab:alternative.representations]: #tab:alternative.representations
 [tab:alternative.tokens]: #tab:alternative.tokens
 [tab:escape.sequences]: #tab:escape.sequences
 [tab:identifiers.special]: #tab:identifiers.special
 [tab:keywords]: #tab:keywords
 [tab:lex.string.concat]: #tab:lex.string.concat
 [tab:lex.type.integer.literal]: #tab:lex.type.integer.literal
-[tab:trigraph.sequences]: #tab:trigraph.sequences
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
-[usrlit.suffix]: library.md#usrlit.suffix
 [^1]: Implementations must behave as if these separate phases occur,
     although in practice different phases might be folded together.
 [^2]: A partial preprocessing token would arise from a source file
@@ -227,16 +240,16 @@ standardization ([[usrlit.suffix]]). A program containing such a
 [^4]: The glyphs for the members of the basic source character set are
     intended to identify characters from the subset of ISO/IEC 10646
     which corresponds to the ASCII character set. However, because the
     mapping from source file characters to the source character set
     (described in translation phase 1) is specified as
-    implementation-defined, an implementation is required to document
     how the basic source characters are represented in source files.
-[^5]: A sequence of characters resembling a universal-character-name in
-    an *r-char-sequence* ([[lex.string]]) does not form a
-    universal-character-name.
 [^6]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
     descriptive, since one of the alternative preprocessing-tokens is
     `%:%:` and of course several primary tokens contain two characters.
@@ -253,14 +266,14 @@ standardization ([[usrlit.suffix]]). A program containing such a
     might result in an error, be interpreted as the character
     corresponding to the escape sequence, or have a completely different
     meaning, depending on the implementation.
 [^10]: On systems in which linkers cannot accept extended characters, an
-    encoding of the universal-character-name may be used in forming
     valid external identifiers. For example, some otherwise unused
     character or sequence of characters may be used to encode the `\u`
-    in a universal-character-name. Extended characters may produce a
     long external identifier, but C++does not place a translation limit
     on significant characters for external identifiers. In C++, upper-
     and lower-case letters are considered different for all identifiers,
     including external identifiers.
@@ -270,7 +283,7 @@ standardization ([[usrlit.suffix]]). A program containing such a
 [^12]: The digits `8` and `9` are not octal digits.
 [^13]: They are intended for character sets where a character does not
     fit into a single byte.
-[^14]: Using an escape sequence for a question mark can avoid
- accidentally creating a trigraph.

 ``` bnf
 user-defined-floating-literal:
     fractional-constant exponent-partₒₚₜ ud-suffix
     digit-sequence exponent-part ud-suffix
+    hexadecimal-prefix hexadecimal-fractional-constant binary-exponent-part ud-suffix
+    hexadecimal-prefix hexadecimal-digit-sequence binary-exponent-part ud-suffix
 ```
 ``` bnf
 user-defined-string-literal:
     string-literal ud-suffix
 ``` bnf
 ud-suffix:
     identifier
 ```
+If a token matches both *user-defined-literal* and another *literal*
+kind, it is treated as the latter.
+[*Example 1*:
+`123_km`
+is a *user-defined-literal*, but `12LL` is an *integer-literal*.
+— *end example*]
+The syntactic non-terminal preceding the *ud-suffix* in a
+*user-defined-literal* is taken to be the longest sequence of characters
+that could match that non-terminal.
 A *user-defined-literal* is treated as a call to a literal operator or
 literal operator template ([[over.literal]]). To determine the form of
 this call for a given *user-defined-literal* *L* with *ud-suffix* *X*,
 the *literal-operator-id* whose literal suffix identifier is *X* is
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
+where *n* is the source character sequence c₁c₂...cₖ.
+[*Note 1*: The sequence c₁c₂...cₖ can only contain characters from the
+basic source character set. — *end note*]
 If *L* is a *user-defined-floating-literal*, let *f* be the literal
 without its *ud-suffix*. If *S* contains a literal operator with
 parameter type `long double`, the literal *L* is treated as a call of
 the form
 ``` cpp
 operator "" X<'c₁', 'c₂', ... 'cₖ'>()
 ```
+where *f* is the source character sequence c₁c₂...cₖ.
+[*Note 2*: The sequence c₁c₂...cₖ can only contain characters from the
+basic source character set. — *end note*]
 If *L* is a *user-defined-string-literal*, let *str* be the literal
 without its *ud-suffix* and let *len* be the number of code units in
 *str* (i.e., its length excluding the terminating null character). The
 literal *L* is treated as a call of the form
 ``` cpp
+operator "" X(str, len)
 ```
 If *L* is a *user-defined-character-literal*, let *ch* be the literal
 without its *ud-suffix*. *S* shall contain a literal operator (
 [[over.literal]]) whose only parameter has the type of *ch* and the
 literal *L* is treated as a call of the form
 ``` cpp
+operator "" X(ch)
 ```
+[*Example 2*:
 ``` cpp
 long double operator "" _w(long double);
 std::string operator "" _w(const char16_t*, std::size_t);
 unsigned operator "" _w(const char*);
 int main() {
   12_w;       // calls operator "" _w("12")
   "two"_w;    // error: no applicable literal operator
 }
 ```
+— *end example*]
 In translation phase 6 ([[lex.phases]]), adjacent string literals are
 concatenated and *user-defined-string-literal*s are considered string
 literals for that purpose. During concatenation, *ud-suffix*es are
 removed and ignored and the concatenation process occurs as described
 in  [[lex.string]]. At the end of phase 6, if a string literal is the
 result of a concatenation involving at least one
 *user-defined-string-literal*, all the participating
 *user-defined-string-literal*s shall have the same *ud-suffix* and that
 suffix is applied to the result of the concatenation.
+[*Example 3*:
 ``` cpp
 int main() {
   L"A" "B" "C"_x; // OK: same as L"ABC"_x
   "P"_x "Q" "R"_y;// error: two different ud-suffix{es}
 }
 ```
+— *end example*]
 <!-- Link reference definitions -->
 [basic.fundamental]: basic.md#basic.fundamental
 [basic.link]: basic.md#basic.link
 [basic.lookup.unqual]: basic.md#basic.lookup.unqual
 [basic.stc]: basic.md#basic.stc
 [basic.types]: basic.md#basic.types
 [conv.mem]: conv.md#conv.mem
 [conv.ptr]: conv.md#conv.ptr
 [cpp]: cpp.md#cpp
 [cpp.concat]: cpp.md#cpp.concat
 [cpp.cond]: cpp.md#cpp.cond
 [cpp.include]: cpp.md#cpp.include
 [cpp.stringize]: cpp.md#cpp.stringize
 [dcl.attr.grammar]: dcl.md#dcl.attr.grammar
 [headers]: library.md#headers
 [lex]: #lex
 [lex.bool]: #lex.bool
 [lex.ccon]: #lex.ccon
 [lex.charset]: #lex.charset
 [lex.ppnumber]: #lex.ppnumber
 [lex.pptoken]: #lex.pptoken
 [lex.separate]: #lex.separate
 [lex.string]: #lex.string
 [lex.token]: #lex.token
 [over.literal]: over.md#over.literal
 [tab:alternative.representations]: #tab:alternative.representations
 [tab:alternative.tokens]: #tab:alternative.tokens
+[tab:charname.allowed]: #tab:charname.allowed
+[tab:charname.disallowed]: #tab:charname.disallowed
 [tab:escape.sequences]: #tab:escape.sequences
 [tab:identifiers.special]: #tab:identifiers.special
 [tab:keywords]: #tab:keywords
 [tab:lex.string.concat]: #tab:lex.string.concat
 [tab:lex.type.integer.literal]: #tab:lex.type.integer.literal
 [temp.explicit]: temp.md#temp.explicit
 [temp.names]: temp.md#temp.names
 [^1]: Implementations must behave as if these separate phases occur,
     although in practice different phases might be folded together.
 [^2]: A partial preprocessing token would arise from a source file
 [^4]: The glyphs for the members of the basic source character set are
     intended to identify characters from the subset of ISO/IEC 10646
     which corresponds to the ASCII character set. However, because the
     mapping from source file characters to the source character set
     (described in translation phase 1) is specified as
+ *implementation-defined*, an implementation is required to document
     how the basic source characters are represented in source files.
+[^5]: A sequence of characters resembling a *universal-character-name*
+ in an *r-char-sequence* ([[lex.string]]) does not form a
+ *universal-character-name*.
 [^6]:  These include “digraphs” and additional reserved words. The term
     “digraph” (token consisting of two characters) is not perfectly
     descriptive, since one of the alternative preprocessing-tokens is
     `%:%:` and of course several primary tokens contain two characters.
     might result in an error, be interpreted as the character
     corresponding to the escape sequence, or have a completely different
     meaning, depending on the implementation.
 [^10]: On systems in which linkers cannot accept extended characters, an
+    encoding of the *universal-character-name* may be used in forming
     valid external identifiers. For example, some otherwise unused
     character or sequence of characters may be used to encode the `\u`
+    in a *universal-character-name*. Extended characters may produce a
     long external identifier, but C++does not place a translation limit
     on significant characters for external identifiers. In C++, upper-
     and lower-case letters are considered different for all identifiers,
     including external identifiers.
 [^12]: The digits `8` and `9` are not octal digits.
 [^13]: They are intended for character sets where a character does not
     fit into a single byte.
+[^14]: Using an escape sequence for a question mark is supported for
+ compatibility with ISO C++14and ISO C.

Diff to HTML by rtfpessoa