[Asterisk-code-review] res pjsip: Fix deadlock when sending out-of-dialog requests. (asterisk[master])
Joshua Colp
asteriskteam at digium.com
Thu Oct 8 13:12:23 CDT 2015
Joshua Colp has submitted this change and it was merged.
Change subject: res_pjsip: Fix deadlock when sending out-of-dialog requests.
......................................................................
res_pjsip: Fix deadlock when sending out-of-dialog requests.
The struct send_request_wrapper has a pjsip lock associated with it that
is created non-recursive. There is a code path for the struct
send_request_wrapper lock that will attempt to lock it recursively. The
reporter's deadlock showed that the thread calling endpt_send_request()
deadlocked itself right after the wrapper object got created.
Out-of-dialog requests such as MESSAGE, qualify OPTIONS, and unsolicited
MWI NOTIFY messages can hit this deadlock.
* Replaced the struct send_request_wrapper pjsip lock with the mutex lock
that can come with an ao2 object since all of Asterisk's mutexes are
recursive. Benefits include removal of code maintaining the pjsip
non-recursive lock since ao2 objects already know how to maintain their
own lock and the lock will show up in the CLI "core show locks" output.
ASTERISK-25435 #close
Reported by: Dmitriy Serov
Change-Id: I458e131dd1b9816f9e963f796c54136e9e84322d
---
M res/res_pjsip.c
1 file changed, 8 insertions(+), 26 deletions(-)
Approvals:
Anonymous Coward #1000019: Verified
Matt Jordan: Looks good to me, but someone else must approve
Joshua Colp: Looks good to me, approved
diff --git a/res/res_pjsip.c b/res/res_pjsip.c
index 3e93b6f..5fc5990 100644
--- a/res/res_pjsip.c
+++ b/res/res_pjsip.c
@@ -2958,8 +2958,6 @@
pj_timer_entry *timeout_timer;
/*! Original timeout. */
pj_int32_t timeout;
- /*! Timeout/cleanup lock. */
- pj_mutex_t *lock;
/*! The transmit data. */
pjsip_tx_data *tdata;
};
@@ -2985,7 +2983,7 @@
ast_debug(2, "%p: PJSIP tsx response received\n", req_wrapper);
}
- pj_mutex_lock(req_wrapper->lock);
+ ao2_lock(req_wrapper);
/* It's possible that our own timer was already processing while
* we were waiting on the lock so check the timer id. If it's
@@ -3026,7 +3024,7 @@
req_wrapper->cb_called = 1;
ast_debug(2, "%p: Callbacks executed\n", req_wrapper);
}
- pj_mutex_unlock(req_wrapper->lock);
+ ao2_unlock(req_wrapper);
ao2_ref(req_wrapper, -1);
}
@@ -3043,12 +3041,12 @@
ast_debug(2, "%p: Internal tsx timer expired after %d msec\n",
req_wrapper, req_wrapper->timeout);
- pj_mutex_lock(req_wrapper->lock);
+ ao2_lock(req_wrapper);
/* If the id is not TIMEOUT_TIMER2 then the timer was cancelled above
* while the lock was being held so just clean up.
*/
if (entry->id != TIMEOUT_TIMER2) {
- pj_mutex_unlock(req_wrapper->lock);
+ ao2_unlock(req_wrapper);
ast_debug(3, "%p: Timeout already handled\n", req_wrapper);
ao2_ref(req_wrapper, -1);
return;
@@ -3066,7 +3064,7 @@
ast_debug(2, "%p: Callbacks executed\n", req_wrapper);
}
- pj_mutex_unlock(req_wrapper->lock);
+ ao2_unlock(req_wrapper);
ao2_ref(req_wrapper, -1);
}
@@ -3074,7 +3072,6 @@
{
struct send_request_wrapper *req_wrapper = obj;
- pj_mutex_destroy(req_wrapper->lock);
pjsip_tx_data_dec_ref(req_wrapper->tdata);
ast_debug(2, "%p: wrapper destroyed\n", req_wrapper);
}
@@ -3087,8 +3084,7 @@
pjsip_endpoint *endpt = ast_sip_get_pjsip_endpoint();
/* Create wrapper to detect if the callback was actually called on an error. */
- req_wrapper = ao2_alloc_options(sizeof(*req_wrapper), send_request_wrapper_destructor,
- AO2_ALLOC_OPT_LOCK_NOLOCK);
+ req_wrapper = ao2_alloc(sizeof(*req_wrapper), send_request_wrapper_destructor);
if (!req_wrapper) {
pjsip_tx_data_dec_ref(tdata);
return PJ_ENOMEM;
@@ -3100,25 +3096,11 @@
req_wrapper->callback = cb;
req_wrapper->timeout = timeout;
req_wrapper->timeout_timer = NULL;
- req_wrapper->lock = NULL;
req_wrapper->tdata = tdata;
/* Add a reference to tdata. The wrapper destructor cleans it up. */
pjsip_tx_data_add_ref(tdata);
- ret_val = pj_mutex_create_simple(tdata->pool, "tsx_timeout", &req_wrapper->lock);
- if (ret_val != PJ_SUCCESS) {
- char errmsg[PJ_ERR_MSG_SIZE];
- pj_strerror(ret_val, errmsg, sizeof(errmsg));
- ast_log(LOG_ERROR, "Error %d '%s' sending %.*s request to endpoint %s\n",
- (int) ret_val, errmsg, (int) pj_strlen(&tdata->msg->line.req.method.name),
- pj_strbuf(&tdata->msg->line.req.method.name),
- endpoint ? ast_sorcery_object_get_id(endpoint) : "<unknown>");
- pjsip_tx_data_dec_ref(tdata);
- ao2_ref(req_wrapper, -1);
- return PJ_ENOMEM;
- }
-
- pj_mutex_lock(req_wrapper->lock);
+ ao2_lock(req_wrapper);
if (timeout > 0) {
pj_time_val timeout_timer_val = { timeout / 1000, timeout % 1000 };
@@ -3180,7 +3162,7 @@
ao2_ref(req_wrapper, -1);
}
}
- pj_mutex_unlock(req_wrapper->lock);
+ ao2_unlock(req_wrapper);
ao2_ref(req_wrapper, -1);
return ret_val;
}
--
To view, visit https://gerrit.asterisk.org/1413
To unsubscribe, visit https://gerrit.asterisk.org/settings
Gerrit-MessageType: merged
Gerrit-Change-Id: I458e131dd1b9816f9e963f796c54136e9e84322d
Gerrit-PatchSet: 1
Gerrit-Project: asterisk
Gerrit-Branch: master
Gerrit-Owner: Richard Mudgett <rmudgett at digium.com>
Gerrit-Reviewer: Anonymous Coward #1000019
Gerrit-Reviewer: Joshua Colp <jcolp at digium.com>
Gerrit-Reviewer: Matt Jordan <mjordan at digium.com>
More information about the asterisk-code-review
mailing list