[asterisk-dev] [Code Review] RFC compliant uri and display-name encode/decode

Nick Lewis Nick.Lewis at atltelecom.com
Mon Jan 25 03:27:34 CST 2010


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviewboard.asterisk.org/r/469/#review1405
-----------------------------------------------------------

Ship it!


In terms of getting existing uri decode working I think this is good to go

Ideas for further projects:
(1) Introduce parse_uri wherever rfc3261 abnf specifies SIP-URI / SIPS-URI
(2) Add a compiant-uri global config parameter and a SIP_COMPLIANT_DECODE to all places where rfc3261 requires uri decode but the code currently does not decode even when the pedantic is set.
(3) Introduce a new function parse_name-addr for use wherever rfc3261 abnf specifies name-addr. The name-addr abnf appears many times in rfc3261 and in other sip extensions so it is worthwhile having a proven function for this. The function would include the parse_uri and get_calleridname functions and return the requested name-addr parts
(4) Amend the read_to_parts function (which currently does not work correctly in some cases) to use the proven uri and display-name functionality via parse_name-addr
(5) Consider getting all the parts of a header that are required in one go rather than in successive looks. At the moment a portion of a header is only got when it is required e.g. caller name, caller num, domain, tag, etc are required from the to-header for dialog matching, authentication, and dialplan and are got as necessary. This is inefficient because to correctly get any part it is effectively necessary each time to parse the whole header due to the mixture or escaping mechanisms that can appear in a header. 

- Nick


On 2010-01-22 12:13:36, David Vossel wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviewboard.asterisk.org/r/469/
> -----------------------------------------------------------
> 
> (Updated 2010-01-22 12:13:36)
> 
> 
> Review request for Asterisk Developers.
> 
> 
> Summary
> -------
> 
> Parts of this patch were posted in separate reviews a few weeks ago.  During the discussion of those patches I took down the reviews as I felt the code was not complete.  This review is a combination of the two uri encode/decode patches, a complete rewrite of the get_calleridname() function, and the addition of two new unit tests.  These changes are in response to (issue #16299) and are a compilation of code written by both wdoekes and myself.
> 
> ------Changes------
> 
> 1.  URI Encoding
> 
> This patch changes ast_uri_encode()'s behavior when doreserved is enabled.  Previously when doreserved was enabled only a small set of reserved characters were encoded.  This set was comprised primarily of the reserved characters defined in RFC3261 section 25.1, but contained other characters as well.  Rather than only escaping the reserved set, doreserved now escapes all characters not within the unreserved set as defined by RFC 3261 and RFC 2396.  Also, the 'doreserved' variable has been renamed to 'do_special_char' in attempts to avoid confusion.
> 
> When doreserve is not enabled, the previous logic of only encoding the characters <= 0X1F and > 0X7f remains, except for the '%' character, which must always be encoded as it signifies a HEX escaped character during the decode process.
> 
> In RFC 3261 and RFC 2396 the unreserved character set is defined by all alphanumeric characters and a small number of characters defined in the mark set.
> mark        =  "-" / "_" / "." / "!" / "~" / "*" / "'" / "(" / ")"
> unreserved  =  alphanum / mark
> 
> 2. URI Decoding: Break up URI before decode.
> 
> In chan_sip.c ast_uri_decode is called on the entire URI instead of it's individual parts after it is parsed.  This is not good as ast_uri_decode can introduce special characters back into the URI which can mess up parsing.  This patch resolves this by not decoding a URI until parsing is completely done.  There are many instances where we check to see if pedantic checking is enabled before we decode a URI.  In these cases a new macro, SIP_PEDANTIC_DECODE, is used on the individual parsed segments of the URI rather than constantly putting if (pedantic) { decode() } checks everywhere in the code.  In the areas where ast_uri_decode is not dependent upon pedantic checking this macro is not used, but decoding is still moved to each individual part of the URI.  The only behavior that should change from this patch is the time at which decoding occurs.
> 
> Since I had to look over every place URI parsing occurs to create this patch, I found several places where we use duplicate code for parsing.  To consolidate the code, those areas have updated to use the parse_uri() function where possible.
> 
> 3. SIP display-name decoding according to RFC3261 section 25.
> 
> To properly decode the display-name portion of a FROM header, chan_sip's get_calleridname() function required a complete re-write.  More information about this change can be found in the comments at the beginning of this function.
> 
> 4. Unit Tests.
> 
> Unit tests for ast_uri_encode, ast_uri_decode, and get_calleridname() have been written.  This involved the addition of the test_utils.c file for testing the utils api.
> 
> 
> Diffs
> -----
> 
>   /trunk/channels/chan_sip.c 242402 
>   /trunk/include/asterisk/utils.h 242402 
>   /trunk/main/test.c 242402 
>   /trunk/main/test_utils.c PRE-CREATION 
>   /trunk/main/utils.c 242402 
> 
> Diff: https://reviewboard.asterisk.org/r/469/diff
> 
> 
> Testing
> -------
> 
> - new unit tests pass
> - verified SIP registrations, calls, and transfers work correctly within my test environment
> 
> 
> Thanks,
> 
> David
> 
>




More information about the asterisk-dev mailing list