improved documentation

master
Charles Iliya Krempeaux 2023-08-16 05:31:48 -07:00
parent 788feadac5
commit e2764bd5a6
1 changed files with 6 additions and 6 deletions

View File

@ -114,10 +114,10 @@ Since, at least as of 2003, Unicode fits into 21 bits, and thus UTF-8 was design
This is done as described in the following table:
| # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 |
|------------|-----------------------|----------------|------------------|----------|----------|----------|----------|
| 1 | 7 | U+000000 | U+00007F | 0xxxxxxx | | | |
| 2 | 11 | U+000080 | U+0007FF | 110xxxxx | 10xxxxxx | | |
| 3 | 16 | U+000800 | U+00FFFF | 1110xxxx | 10xxxxxx | 10xxxxxx | |
| 4 | 21 | U+010000 | U+10FFFF | 11110xxx | 10xxxxxx | 10xxxxxx | 10xxxxxx |
| # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 |
|------------|-----------------------|----------------|------------------|------------|------------|------------|------------|
| 1 | 7 | U+000000 | U+00007F | `0xxxxxxx` | | | |
| 2 | 11 | U+000080 | U+0007FF | `110xxxxx` | `10xxxxxx` | | |
| 3 | 16 | U+000800 | U+00FFFF | `1110xxxx` | `10xxxxxx` | `10xxxxxx` | |
| 4 | 21 | U+010000 | U+10FFFF | `11110xxx` | `10xxxxxx` | `10xxxxxx` | `10xxxxxx` |
```