diff --git a/doc/utf8cpp.html b/doc/utf8cpp.html index 3f04ce2..c0a89a2 100644 --- a/doc/utf8cpp.html +++ b/doc/utf8cpp.html @@ -441,7 +441,9 @@ assert (w == twochars); This function has two purposes: one is two iterate backwards through a UTF-8 encoded string. Note that it is usually a better idea to iterate forward instead, since utf8::next is faster. The second purpose is to find a beginning - of a UTF-8 sequence if we have a random position within a string. + of a UTF-8 sequence if we have a random position within a string. Note that in that + case utf8::prior may not detect an invalid UTF-8 sequence in some scenarios: + for instance if there are superfluous trail octets, it will just skip them.

it will typically point to the beginning of @@ -451,10 +453,12 @@ assert (w == twochars); beginning with that octet is decoded to a 32 bit representation and returned.

- In case pass_end is reached before a UTF-8 lead octet is hit, or if an + In case start is reached before a UTF-8 lead octet is hit, or if an invalid UTF-8 sequence is started by the lead octet, an invalid_utf8 exception is thrown.

+

In case start equals it, a not_enough_room + exception is thrown.

utf8::previous

@@ -512,7 +516,7 @@ assert (w == twochars); beginning with that octet is decoded to a 32 bit representation and returned.

- In case pass_end is reached before a UTF-8 lead octet is hit, or if an + In case pass_start is reached before a UTF-8 lead octet is hit, or if an invalid UTF-8 sequence is started by the lead octet, an invalid_utf8 exception is thrown