<div dir="ltr"><div dir="ltr">On Mon, Mar 22, 2021 at 5:54 PM Dan Cropp <<a href="mailto:dan@amtelco.com">dan@amtelco.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="gmail-m_-4091100702629046762WordSection1">
<p class="MsoNormal">Thank you Ben.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Looking at the TTS, would that language property be language and country? Example en-US, en-GB, etc.<u></u><u></u></p>
<p class="MsoNormal">Will we use SSML to specify a specific voice for the language? Examplt, Amazon Polly en-US language supports 4 female and 4 male voices. Or might this be an additional parameter (similar to the language)? </p></div></div></blockquote><div><br></div><div>These are arbitrary example values. The values in app_config aren't defined within the protocol, they are opaque:</div><div><br></div><div><p style="margin:10px 0px 0px;padding:0px;color:rgb(51,51,51);font-family:"Helvetica Neue",Helvetica,Arial,sans-serif;font-size:14px">The app_config section contains arbitrary configuration options and are not defined by this protocol. They will be able to be set by the user, and then consumed by the external application.</p><br class="gmail-Apple-interchange-newline"></div><div>Depending on the external applications and what develops we could probably standardize some.</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div lang="EN-US" style="overflow-wrap: break-word;"><div class="gmail-m_-4091100702629046762WordSection1"><p class="MsoNormal">
<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Will there be a mechanism to stop the TTS on one stream when speech to text detects someone speaking? Many people will interrupt automated phone systems. Example, the system answers the call and plays something, a person familiar with
the system will start speaking and they expect the TTS/prompts to stop playing.</p></div></div></blockquote><div><br></div><div>This is a good point. It's probably something we should add to the protocol, so it can communicate back 1. If it can do it and 2. When it occurs. We can use the same thing func_talkdetect uses as a fallback if the external application doesn't support it. The core speech stuff itself already supports handling when they speak and stopping playback.</div></div><div><br></div>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div style="font-family:tahoma,sans-serif"><div><font color="#073763">Joshua C. Colp</font></div><div><font color="#073763">Asterisk Technical Lead</font></div><div><font color="#073763">Sangoma Technologies</font></div><div><font color="#073763">Check us out at <a href="http://www.sangoma.com/" target="_blank">www.sangoma.com</a> and <a href="http://www.asterisk.org/" target="_blank">www.asterisk.org</a></font></div></div></div></div></div></div></div></div></div></div></div></div>