[asterisk-bugs] [Asterisk 0005208]: [patch] SayAlpha does not read special characteres such as ", " and "/"
noreply at bugs.digium.com
noreply at bugs.digium.com
Fri Aug 24 11:26:02 CDT 2007
A NOTE has been added to this issue.
======================================================================
http://bugs.digium.com/view.php?id=5208
======================================================================
Reported By: leoburd
Assigned To: murf
======================================================================
Project: Asterisk
Issue ID: 5208
Category: Core/General
Reproducibility: always
Severity: minor
Priority: normal
Status: assigned
Asterisk Version: SVN
SVN Branch (only for SVN checkouts, not tarball releases): trunk
SVN Revision (number only!):
Disclaimer on File?: No
Request Review:
======================================================================
Date Submitted: 09-13-2005 06:41 CDT
Last Modified: 08-24-2007 11:26 CDT
======================================================================
Summary: [patch] SayAlpha does not read special characteres
such as "," and "/"
Description:
SayAlpha does not read common special characteres that are associated with
email addresses and URLs. It would be great if it could read characteres
such as ":" ";" "_" "/" ","
Thanks,
Leo
======================================================================
Relationships ID Summary
----------------------------------------------------------------------
related to 0010083 SayAlpha() and SayPhonetic() fail when ...
======================================================================
----------------------------------------------------------------------
murf - 08-24-07 11:26
----------------------------------------------------------------------
Thought I'd post my research on this issue; Kevin Fleming (who would OK
digium spending on this) doesn't think there is demand enough for the
following chars, to justify the expense. So, we need to narrow down the
list to just what would be really be minimally necessary....
--------------
Here is what I've gathered together for translation so far:
We need to have the following words recorded, so that SayAlpha will be
able to handle all the characters in the ASCII set.
----------
Already available:
----------
We already have these chars available in the digits/ and letters/
directories:
char (ascii code) "phrase(dir)"
(20) "space(letters)"
!(21) "exclamation point(letters)"
#(23) "pound(digits)"
$(24) "dollar(letters)"
*(2A) "star(digits)"
+(2B) "plus(letters)"
-(2D) "minus(digits)" "dash(letters)"
.(2E) "dot(letters)"
/(2F) "slash(letters)"
=(3D) "equals(letters)"
@(40) "at(digits+letters)"
Since the above are already recorded in all 3 languages, we do not need
to repeat this work! I'm showing them here to calm the worries of anyone
who thinks we are missing something!
----------
Need to add:
----------
ENGLISH
"(22) "double quote"
#(23) "hash" (this one is optional for (es,fr), as
"pound"
is already recorded in all 3 languages.)
%(25) "percent"
&(26) "ampersand"
'(27) "single quote"
((28) "left parenthesis"
)(29) "right parenthesis"
*(2A) "Asterisk"
,(2C) "comma"
:(3A) "colon"
;(3B) "semicolon"
<(3C) "less-than"
>(3E) "greater-than"
?(3F) "question mark"
[(5B) "left square brace"
\(5C) "back-slash"
](5D) "right square brace"
^(5E) "caret"
_(5F) "underscore"
`(60) "back-quote"
{(7B) "left curly brace"
|(7C) "pipe"
}(7D) "right curly brace"
~(7E) "tilde"
(7F) "DEL"
"control" (a prefix for var. chars to give "Control A" for
ascii (01), etc.)
"capital" (a prefix before a letter, to differentiate A from a.)
"small" (a prefix before a letter, to differentiate a from A.)
Also, for es, fr, if there are regionally popular names for any of these
character names, as, for example, "hash", and "pound", for US/GB, you
may wish to add these multiple translations, and we can offer the
variants.
Also, if we can develop a short list of prefixes/suffixes for the
popular 8859-1 characters used in (es,fr), (for example "umlaut", if the
language were German), we can pronounce these characters also in (es,
fr), beyond the ascii 1-127 set...
The following characters will allow the rest of the 8859-1 characters to
be pronounced in English; some variations may be necessary for Spanish
and French! To hold down the cost, perhaps we can eliminate the
characters that would not be used in French or Spanish!
"inverted" (as in: inverted exclamation point, inverted question
mark)
"sign" (as in: cent sign, not sign)
"cent" (as in: cent sign)
"currency" (as in: currency sign)
"yen" (as in: yen sign)
"broken bar"
"section" (as in section sign)
"diaeresis"
"copyright" (as in copyright sign)
"feminine" (as in feminine ordinal indicator)
"masculine" (as in masculine ordinal indicator)
"ordinal indicator"
"left double-angle"
"right double-angle"
"not" (as in: not sign)
"soft hyphen"
"registered" (as in: registered sign)
"macron"
"degree" (as in: degree sign)
"plus-minus" (as in: plus-minus sign)
"superscript" (as in: superscript two, superscript three)
"acute accent"
"micro" (as in: micro sign, or the greek letter mu)
"pilcrow" (as in: pilcrow sign)
"middle" (as in: middle dot)
"cedilla"
"quarter"
"half"
"three-quarters"
"with" (as in: Capital A with grave)
"grave" (as in: Capital A with grave)
"acute" (as in: Capital A with acute)
"circumflex" (as in: Capital A with circumflex)
"ring above" (as in: Capital A with ring above)
"Latin Letter AE" (as in: Capital Latin Letter AE)
"Latin Letter Eth" (as in: Capital Latin Letter Eth)
"multiplication" (as in: multiplication sign)
"division" (as in: division sign)
"stroke" (as in: SMALL LETTER O WITH STROKE)
"thorn" (as in: Latin Letter Thorn)
"sharp" (as in: LATIN SMALL LETTER SHARP S)
(for reference, I provide a table of the characters in 8859-1 below)
Oct Dec Hex Char Description
--------------------------------------------------------------------
240 160 A0 NO-BREAK SPACE
241 161 A1 ¡ INVERTED EXCLAMATION MARK
242 162 A2 ¢ CENT SIGN
243 163 A3 £ POUND SIGN
244 164 A4 ¤ CURRENCY SIGN
245 165 A5 ¥ YEN SIGN
246 166 A6 ¦ BROKEN BAR
247 167 A7 § SECTION SIGN
250 168 A8 ¨ DIAERESIS
251 169 A9 © COPYRIGHT SIGN
252 170 AA ª FEMININE ORDINAL INDICATOR
253 171 AB « LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
254 172 AC ¬ NOT SIGN
255 173 AD SOFT HYPHEN
256 174 AE ® REGISTERED SIGN
257 175 AF ¯ MACRON
260 176 B0 ° DEGREE SIGN
261 177 B1 ± PLUS-MINUS SIGN
262 178 B2 ² SUPERSCRIPT TWO
263 179 B3 ³ SUPERSCRIPT THREE
264 180 B4 ´ ACUTE ACCENT
265 181 B5 µ MICRO SIGN
266 182 B6 ¶ PILCROW SIGN
267 183 B7 · MIDDLE DOT
270 184 B8 ¸ CEDILLA
271 185 B9 ¹ SUPERSCRIPT ONE
272 186 BA º MASCULINE ORDINAL INDICATOR
273 187 BB » RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
274 188 BC ¼ VULGAR FRACTION ONE QUARTER
275 189 BD ½ VULGAR FRACTION ONE HALF
276 190 BE ¾ VULGAR FRACTION THREE QUARTERS
277 191 BF ¿ INVERTED QUESTION MARK
300 192 C0 À LATIN CAPITAL LETTER A WITH GRAVE
301 193 C1 Á LATIN CAPITAL LETTER A WITH ACUTE
302 194 C2 Â LATIN CAPITAL LETTER A WITH CIRCUMFLEX
303 195 C3 Ã LATIN CAPITAL LETTER A WITH TILDE
304 196 C4 Ä LATIN CAPITAL LETTER A WITH DIAERESIS
305 197 C5 Å LATIN CAPITAL LETTER A WITH RING ABOVE
306 198 C6 Æ LATIN CAPITAL LETTER AE
307 199 C7 Ç LATIN CAPITAL LETTER C WITH CEDILLA
310 200 C8 È LATIN CAPITAL LETTER E WITH GRAVE
311 201 C9 É LATIN CAPITAL LETTER E WITH ACUTE
312 202 CA Ê LATIN CAPITAL LETTER E WITH CIRCUMFLEX
313 203 CB Ë LATIN CAPITAL LETTER E WITH DIAERESIS
314 204 CC Ì LATIN CAPITAL LETTER I WITH GRAVE
315 205 CD Í LATIN CAPITAL LETTER I WITH ACUTE
316 206 CE Î LATIN CAPITAL LETTER I WITH CIRCUMFLEX
317 207 CF Ï LATIN CAPITAL LETTER I WITH DIAERESIS
320 208 D0 Ð LATIN CAPITAL LETTER ETH
321 209 D1 Ñ LATIN CAPITAL LETTER N WITH TILDE
322 210 D2 Ò LATIN CAPITAL LETTER O WITH GRAVE
323 211 D3 Ó LATIN CAPITAL LETTER O WITH ACUTE
324 212 D4 Ô LATIN CAPITAL LETTER O WITH CIRCUMFLEX
325 213 D5 Õ LATIN CAPITAL LETTER O WITH TILDE
326 214 D6 Ö LATIN CAPITAL LETTER O WITH DIAERESIS
327 215 D7 × MULTIPLICATION SIGN
330 216 D8 Ø LATIN CAPITAL LETTER O WITH STROKE
331 217 D9 Ù LATIN CAPITAL LETTER U WITH GRAVE
332 218 DA Ú LATIN CAPITAL LETTER U WITH ACUTE
333 219 DB Û LATIN CAPITAL LETTER U WITH CIRCUMFLEX
334 220 DC Ü LATIN CAPITAL LETTER U WITH DIAERESIS
335 221 DD Ý LATIN CAPITAL LETTER Y WITH ACUTE
336 222 DE Þ LATIN CAPITAL LETTER THORN
337 223 DF ß LATIN SMALL LETTER SHARP S
340 224 E0 à LATIN SMALL LETTER A WITH GRAVE
341 225 E1 á LATIN SMALL LETTER A WITH ACUTE
342 226 E2 â LATIN SMALL LETTER A WITH CIRCUMFLEX
343 227 E3 ã LATIN SMALL LETTER A WITH TILDE
344 228 E4 ä LATIN SMALL LETTER A WITH DIAERESIS
345 229 E5 å LATIN SMALL LETTER A WITH RING ABOVE
346 230 E6 æ LATIN SMALL LETTER AE
347 231 E7 ç LATIN SMALL LETTER C WITH CEDILLA
350 232 E8 è LATIN SMALL LETTER E WITH GRAVE
351 233 E9 é LATIN SMALL LETTER E WITH ACUTE
352 234 EA ê LATIN SMALL LETTER E WITH CIRCUMFLEX
353 235 EB ë LATIN SMALL LETTER E WITH DIAERESIS
354 236 EC ì LATIN SMALL LETTER I WITH GRAVE
355 237 ED í LATIN SMALL LETTER I WITH ACUTE
356 238 EE î LATIN SMALL LETTER I WITH CIRCUMFLEX
357 239 EF ï LATIN SMALL LETTER I WITH DIAERESIS
360 240 F0 ð LATIN SMALL LETTER ETH
361 241 F1 ñ LATIN SMALL LETTER N WITH TILDE
362 242 F2 ò LATIN SMALL LETTER O WITH GRAVE
363 243 F3 ó LATIN SMALL LETTER O WITH ACUTE
364 244 F4 ô LATIN SMALL LETTER O WITH CIRCUMFLEX
365 245 F5 õ LATIN SMALL LETTER O WITH TILDE
366 246 F6 ö LATIN SMALL LETTER O WITH DIAERESIS
367 247 F7 ÷ DIVISION SIGN
370 248 F8 ø LATIN SMALL LETTER O WITH STROKE
371 249 F9 ù LATIN SMALL LETTER U WITH GRAVE
372 250 FA ú LATIN SMALL LETTER U WITH ACUTE
373 251 FB û LATIN SMALL LETTER U WITH CIRCUMFLEX
374 252 FC ü LATIN SMALL LETTER U WITH DIAERESIS
375 253 FD ý LATIN SMALL LETTER Y WITH ACUTE
376 254 FE þ LATIN SMALL LETTER THORN
377 255 FF ÿ LATIN SMALL LETTER Y WITH DIAERESIS
In the above table, I plan to shorten "Latin Small Letter N with Tilde"
to "N with Tilde" in English, and supply a couple of options that would
yield "small N with Tilde" or "capital N with tilde", and so on.
If/when we move to unicode, we can/will use the same pronunciations,
just provide a new encoding to pronounce them...
Issue History
Date Modified Username Field Change
======================================================================
08-24-07 11:26 murf Note Added: 0069373
======================================================================
More information about the asterisk-bugs
mailing list