improved documentation
parent
788feadac5
commit
e2764bd5a6
12
README.md
12
README.md
|
@ -114,10 +114,10 @@ Since, at least as of 2003, Unicode fits into 21 bits, and thus UTF-8 was design
|
|||
|
||||
This is done as described in the following table:
|
||||
|
||||
| # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 |
|
||||
|------------|-----------------------|----------------|------------------|----------|----------|----------|----------|
|
||||
| 1 | 7 | U+000000 | U+00007F | 0xxxxxxx | | | |
|
||||
| 2 | 11 | U+000080 | U+0007FF | 110xxxxx | 10xxxxxx | | |
|
||||
| 3 | 16 | U+000800 | U+00FFFF | 1110xxxx | 10xxxxxx | 10xxxxxx | |
|
||||
| 4 | 21 | U+010000 | U+10FFFF | 11110xxx | 10xxxxxx | 10xxxxxx | 10xxxxxx |
|
||||
| # of bytes | # bits for code point | 1st code point | last code point | byte 1 | byte 2 | byte 3 | byte 4 |
|
||||
|------------|-----------------------|----------------|------------------|------------|------------|------------|------------|
|
||||
| 1 | 7 | U+000000 | U+00007F | `0xxxxxxx` | | | |
|
||||
| 2 | 11 | U+000080 | U+0007FF | `110xxxxx` | `10xxxxxx` | | |
|
||||
| 3 | 16 | U+000800 | U+00FFFF | `1110xxxx` | `10xxxxxx` | `10xxxxxx` | |
|
||||
| 4 | 21 | U+010000 | U+10FFFF | `11110xxx` | `10xxxxxx` | `10xxxxxx` | `10xxxxxx` |
|
||||
```
|
||||
|
|
Loading…
Reference in New Issue