[Asterisk-Dev] IAX spec: Text formats and character sets
Kevin P. Fleming
kpfleming at digium.com
Fri Apr 29 17:13:29 MST 2005
Michael Giagnocavo wrote:
> Hmm, you're right. That's doesn't look bad at all.
>
> But... what about for comparisons and other Unicode operations? Do the
> libraries available support some UTF-8 version of strcmp, strchr,
> strcasecmp, etc.?
>
Some of them are easy (strcmp, for example). Most of them are harder,
because they either need to know character boundaries, or need case
mappings (strcasecmp, for example). Any function that searches for a
'char' in a string also won't work if the character being searched for
is a multi-byte one.
I think it's safe to document that the on-wire format is UTF-8, but that
the current implementations only support the single-byte subset of
UTF-8. Any implementation is free to be extended to fully support the
entire UTF-8 character space, providing suitable libraries can be found
(or written).
More information about the asterisk-dev
mailing list