improved documentation

master
Charles Iliya Krempeaux 2023-08-16 05:31:48 -07:00
parent 788feadac5
commit e2764bd5a6
1 changed files with 6 additions and 6 deletions

View File

@ -115,9 +115,9 @@ Since, at least as of 2003, Unicode fits into 21 bits, and thus UTF-8 was design
This is done as described in the following table: This is done as described in the following table:
| # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 | | # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 |
|------------|-----------------------|----------------|------------------|----------|----------|----------|----------| |------------|-----------------------|----------------|------------------|------------|------------|------------|------------|
| 1 | 7 | U+000000 | U+00007F | 0xxxxxxx | | | | | 1 | 7 | U+000000 | U+00007F | `0xxxxxxx` | | | |
| 2 | 11 | U+000080 | U+0007FF | 110xxxxx | 10xxxxxx | | | | 2 | 11 | U+000080 | U+0007FF | `110xxxxx` | `10xxxxxx` | | |
| 3 | 16 | U+000800 | U+00FFFF | 1110xxxx | 10xxxxxx | 10xxxxxx | | | 3 | 16 | U+000800 | U+00FFFF | `1110xxxx` | `10xxxxxx` | `10xxxxxx` | |
| 4 | 21 | U+010000 | U+10FFFF | 11110xxx | 10xxxxxx | 10xxxxxx | 10xxxxxx | | 4 | 21 | U+010000 | U+10FFFF | `11110xxx` | `10xxxxxx` | `10xxxxxx` | `10xxxxxx` |
``` ```