[asterisk-bugs] [JIRA] (ASTERISK-29046) pbx: Deadlock when doing a reload, while simultaneously doing an ExtensionState on a pattern match hint that ends up adding an extension
Joshua C. Colp (JIRA)
noreply at issues.asterisk.org
Thu Aug 27 04:50:43 CDT 2020
[ https://issues.asterisk.org/jira/browse/ASTERISK-29046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=251803#comment-251803 ]
Joshua C. Colp edited comment on ASTERISK-29046 at 8/27/20 4:49 AM:
--------------------------------------------------------------------
This deadlock is between two threads due to differing locking order:
{noformat}
Thread 114 (Thread 0x7f9400df4700 (LWP 2958)): -> Has hints locked (in ast_add_hint), but wants contexts
#0 0x00007f9454e8e54d in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x00007f9454e89eb6 in _L_lock_941 () from /lib64/libpthread.so.0
#2 0x00007f9454e89daf in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x000000000053728e in __ast_pthread_mutex_lock ()
#4 0x000000000057fde6 in ast_rdlock_contexts ()
#5 0x0000000000570b28 in ast_hint_extension ()
#6 0x00000000005738e7 in ast_get_hint ()
#7 0x000000000054fac9 in manager_state_cb ()
#8 0x00000000005715cc in execute_state_callback ()
#9 0x0000000000573597 in ast_add_hint ()
#10 0x000000000057cb8b in add_priority ()
#11 0x000000000057d338 in ast_add_extension2_lockopt ()
#12 0x000000000057cbf9 in ast_add_extension2 ()
#13 0x00007f94504bf02f in pbx_load_config () from /usr/lib64/asterisk/modules/pbx_config.so
#14 0x00007f94504bfed6 in pbx_load_module () from /usr/lib64/asterisk/modules/pbx_config.so
#15 0x00007f94504c00ce in reload () from /usr/lib64/asterisk/modules/pbx_config.so
#16 0x0000000000534ea4 in ast_module_reload ()
#17 0x00000000004ce4e0 in handle_core_reload ()
#18 0x00000000004d6703 in ast_cli_command_full ()
#19 0x00000000004d6846 in ast_cli_command_multiple_full ()
#20 0x0000000000453ca7 in netconsole ()
#21 0x00000000005fd81c in dummy_start ()
#22 0x00007f9454e87ea5 in start_thread () from /lib64/libpthread.so.0
#23 0x00007f94541638dd in clone () from /lib64/libc.so.6
Thread 112 (Thread 0x7f940173b700 (LWP 923)): -> Has contexts read locked (in ast_add_extension), but wants hints
#0 0x00007f9454e8e54d in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x00007f9454e89eb6 in _L_lock_941 () from /lib64/libpthread.so.0
#2 0x00007f9454e89daf in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x000000000053728e in __ast_pthread_mutex_lock ()
#4 0x000000000045b293 in __ao2_lock ()
#5 0x000000000057328c in ast_add_hint ()
#6 0x000000000057d625 in ast_add_extension2_lockopt ()
#7 0x000000000057cbf9 in ast_add_extension2 ()
#8 0x000000000057bf15 in ast_add_extension ()
#9 0x000000000057107e in internal_extension_state_extended ()
#10 0x00000000005710ee in ast_extension_state ()
#11 0x000000000054c081 in action_extensionstate ()
#12 0x000000000054df61 in process_message ()
#13 0x000000000054e889 in do_message ()
#14 0x000000000054eccb in session_do ()
#15 0x00000000005ea1e5 in handle_tcptls_connection ()
#16 0x00000000005fd81c in dummy_start ()
#17 0x00007f9454e87ea5 in start_thread () from /lib64/libpthread.so.0
#18 0x00007f94541638dd in clone () from /lib64/libc.so.6
{noformat}
was (Author: jcolp):
This deadlock is between two threads:
{noformat}
Thread 114 (Thread 0x7f9400df4700 (LWP 2958)): -> Has hints locked (in ast_add_hint), but wants contexts
#0 0x00007f9454e8e54d in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x00007f9454e89eb6 in _L_lock_941 () from /lib64/libpthread.so.0
#2 0x00007f9454e89daf in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x000000000053728e in __ast_pthread_mutex_lock ()
#4 0x000000000057fde6 in ast_rdlock_contexts ()
#5 0x0000000000570b28 in ast_hint_extension ()
#6 0x00000000005738e7 in ast_get_hint ()
#7 0x000000000054fac9 in manager_state_cb ()
#8 0x00000000005715cc in execute_state_callback ()
#9 0x0000000000573597 in ast_add_hint ()
#10 0x000000000057cb8b in add_priority ()
#11 0x000000000057d338 in ast_add_extension2_lockopt ()
#12 0x000000000057cbf9 in ast_add_extension2 ()
#13 0x00007f94504bf02f in pbx_load_config () from /usr/lib64/asterisk/modules/pbx_config.so
#14 0x00007f94504bfed6 in pbx_load_module () from /usr/lib64/asterisk/modules/pbx_config.so
#15 0x00007f94504c00ce in reload () from /usr/lib64/asterisk/modules/pbx_config.so
#16 0x0000000000534ea4 in ast_module_reload ()
#17 0x00000000004ce4e0 in handle_core_reload ()
#18 0x00000000004d6703 in ast_cli_command_full ()
#19 0x00000000004d6846 in ast_cli_command_multiple_full ()
#20 0x0000000000453ca7 in netconsole ()
#21 0x00000000005fd81c in dummy_start ()
#22 0x00007f9454e87ea5 in start_thread () from /lib64/libpthread.so.0
#23 0x00007f94541638dd in clone () from /lib64/libc.so.6
Thread 112 (Thread 0x7f940173b700 (LWP 923)): -> Has contexts read locked (in ast_add_extension), but wants hints
#0 0x00007f9454e8e54d in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x00007f9454e89eb6 in _L_lock_941 () from /lib64/libpthread.so.0
#2 0x00007f9454e89daf in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x000000000053728e in __ast_pthread_mutex_lock ()
#4 0x000000000045b293 in __ao2_lock ()
#5 0x000000000057328c in ast_add_hint ()
#6 0x000000000057d625 in ast_add_extension2_lockopt ()
#7 0x000000000057cbf9 in ast_add_extension2 ()
#8 0x000000000057bf15 in ast_add_extension ()
#9 0x000000000057107e in internal_extension_state_extended ()
#10 0x00000000005710ee in ast_extension_state ()
#11 0x000000000054c081 in action_extensionstate ()
#12 0x000000000054df61 in process_message ()
#13 0x000000000054e889 in do_message ()
#14 0x000000000054eccb in session_do ()
#15 0x00000000005ea1e5 in handle_tcptls_connection ()
#16 0x00000000005fd81c in dummy_start ()
#17 0x00007f9454e87ea5 in start_thread () from /lib64/libpthread.so.0
#18 0x00007f94541638dd in clone () from /lib64/libc.so.6
{noformat}
> pbx: Deadlock when doing a reload, while simultaneously doing an ExtensionState on a pattern match hint that ends up adding an extension
> ----------------------------------------------------------------------------------------------------------------------------------------
>
> Key: ASTERISK-29046
> URL: https://issues.asterisk.org/jira/browse/ASTERISK-29046
> Project: Asterisk
> Issue Type: Bug
> Security Level: None
> Components: PBX/General
> Affects Versions: 13.22.0
> Environment: FreePBX V15 with asterisk 13.22.0, Running on Virtual machine.
> 32 GB RAM, 8 Core CPU.Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz
> Reporter: Ramarajan
> Attachments: core-asterisk-running-2020-08-27T00-22-17-0400-brief.txt, core-asterisk-running-2020-08-27T00-22-17-0400-full.txt, core-asterisk-running-2020-08-27T00-22-17-0400-locks.txt, core-asterisk-running-2020-08-27T00-22-17-0400-thread1.txt
>
>
> Asterisk hangs when doing the core reload (apply config in FreePBX).
> This issue occurs very frequently when we do multiple reloads at a time.
> During the problematic time, asterisk not accepting any calls, registrations, and not even responding to any SIP messages.
> The only solution to get it up and running again is to kill the asterisk process and start it over again.
> I have attached the core-dump files for your reference.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
More information about the asterisk-bugs
mailing list