[asterisk-bugs] [Asterisk 0005208]: [patch] SayAlpha does not read special characteres such as ", " and "/"

noreply at bugs.digium.com noreply at bugs.digium.com
Fri Aug 24 11:26:02 CDT 2007


A NOTE has been added to this issue. 
====================================================================== 
http://bugs.digium.com/view.php?id=5208 
====================================================================== 
Reported By:                leoburd
Assigned To:                murf
====================================================================== 
Project:                    Asterisk
Issue ID:                   5208
Category:                   Core/General
Reproducibility:            always
Severity:                   minor
Priority:                   normal
Status:                     assigned
Asterisk Version:           SVN 
SVN Branch (only for SVN checkouts, not tarball releases): trunk 
SVN Revision (number only!):  
Disclaimer on File?:        No 
Request Review:              
====================================================================== 
Date Submitted:             09-13-2005 06:41 CDT
Last Modified:              08-24-2007 11:26 CDT
====================================================================== 
Summary:                    [patch] SayAlpha does not read special characteres
such as "," and "/"
Description: 
SayAlpha does not read common special characteres that are associated with
email addresses and URLs.  It would be great if it could read characteres
such as ":" ";" "_" "/" "," 

Thanks,

Leo

======================================================================
Relationships       ID      Summary
----------------------------------------------------------------------
related to          0010083 SayAlpha() and SayPhonetic() fail when ...
====================================================================== 

---------------------------------------------------------------------- 
 murf - 08-24-07 11:26  
---------------------------------------------------------------------- 
Thought I'd post my research on this issue; Kevin Fleming (who would OK
digium spending on this) doesn't think there is demand enough for the
following chars, to justify the expense. So, we need to narrow down the
list to just what would be really be minimally necessary....


--------------

Here is what I've gathered together for translation so far:


We need to have the following words recorded, so that SayAlpha will be
able to handle all the characters in the ASCII set.

----------
Already available:
----------
We already have these chars available in the digits/ and letters/
directories:

char (ascii code) "phrase(dir)"

 (20) "space(letters)"
!(21) "exclamation point(letters)"
#(23) "pound(digits)"
$(24) "dollar(letters)"
*(2A) "star(digits)"
+(2B) "plus(letters)"
-(2D) "minus(digits)" "dash(letters)"
.(2E) "dot(letters)"
/(2F) "slash(letters)"
=(3D) "equals(letters)"
@(40) "at(digits+letters)"

Since the above are already recorded in all 3 languages, we do not need
to repeat this work! I'm showing them here to calm the worries of anyone
who thinks we are missing something!



----------
Need to add:
----------

ENGLISH

"(22) "double quote"
#(23) "hash"               (this one is optional for (es,fr), as
"pound" 
                            is already recorded in all 3 languages.)
%(25) "percent"
&(26) "ampersand"
'(27) "single quote"
((28) "left parenthesis"
)(29) "right parenthesis"
*(2A) "Asterisk"
,(2C) "comma"
:(3A) "colon"
;(3B) "semicolon"
<(3C) "less-than"
>(3E) "greater-than"
?(3F) "question mark"
[(5B) "left square brace"
\(5C) "back-slash"
](5D) "right square brace"
^(5E) "caret"
_(5F) "underscore"
`(60) "back-quote"
{(7B) "left curly brace"
|(7C) "pipe"
}(7D) "right curly brace"
~(7E) "tilde"
 (7F) "DEL"
      "control"  (a prefix for var. chars to give "Control A" for 
                  ascii (01), etc.)
      "capital"  (a prefix before a letter, to differentiate A from a.)
      "small"    (a prefix before a letter, to differentiate a from A.)

Also, for es, fr, if there are regionally popular names for any of these
character names, as, for example, "hash", and "pound", for US/GB, you
may wish to add these multiple translations, and we can offer the
variants.

Also, if we can develop a short list of prefixes/suffixes for the
popular 8859-1 characters used in (es,fr), (for example "umlaut", if the
language were German), we can pronounce these characters also in (es,
fr), beyond the ascii 1-127 set...

The following characters will allow the rest of the 8859-1 characters to
be pronounced in English; some variations may be necessary for Spanish
and French! To hold down the cost, perhaps we can eliminate the
characters that would not be used in French or Spanish!


      "inverted"  (as in: inverted exclamation point, inverted question
mark)
      "sign"      (as in: cent sign, not sign)
      "cent"      (as in: cent sign)
      "currency"  (as in: currency sign)
      "yen"        (as in: yen sign)
      "broken bar"  
      "section"    (as in section sign)
      "diaeresis"
      "copyright"  (as in copyright sign)
      "feminine"    (as in feminine ordinal indicator)
      "masculine"   (as in masculine ordinal indicator)
      "ordinal indicator"
      "left double-angle"
      "right double-angle"
      "not"          (as in: not sign)
      "soft hyphen"
      "registered"   (as in: registered sign)
      "macron"
      "degree"        (as in: degree sign)
      "plus-minus"    (as in: plus-minus sign)
      "superscript"   (as in: superscript two, superscript three)
      "acute accent"
      "micro"         (as in: micro sign, or the greek letter mu)
      "pilcrow"       (as in: pilcrow sign)
      "middle"        (as in: middle dot)
      "cedilla"
      "quarter"
      "half"
      "three-quarters"
      "with"          (as in: Capital A with grave)
      "grave"         (as in: Capital A with grave)
      "acute"         (as in: Capital A with acute)
      "circumflex"    (as in: Capital A with circumflex)
      "ring above"    (as in: Capital A with ring above)
      "Latin Letter AE"  (as in: Capital Latin Letter AE)
      "Latin Letter Eth" (as in: Capital Latin Letter Eth)
      "multiplication" (as in: multiplication sign)
      "division"       (as in: division sign)
      "stroke"         (as in: SMALL LETTER O WITH STROKE)
      "thorn"          (as in: Latin Letter Thorn)
      "sharp"          (as in: LATIN SMALL LETTER SHARP S)

   (for reference, I provide a table of the characters in 8859-1 below)


       Oct   Dec   Hex   Char   Description

--------------------------------------------------------------------
       240   160   A0           NO-BREAK SPACE
       241   161   A1     ¡     INVERTED EXCLAMATION MARK
       242   162   A2     ¢     CENT SIGN
       243   163   A3     £     POUND SIGN
       244   164   A4     ¤     CURRENCY SIGN
       245   165   A5     ¥     YEN SIGN
       246   166   A6     ¦     BROKEN BAR
       247   167   A7     §     SECTION SIGN
       250   168   A8     ¨     DIAERESIS
       251   169   A9     ©     COPYRIGHT SIGN
       252   170   AA     ª     FEMININE ORDINAL INDICATOR
       253   171   AB     «     LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
       254   172   AC     ¬     NOT SIGN
       255   173   AD     ­     SOFT HYPHEN
       256   174   AE     ®     REGISTERED SIGN
       257   175   AF     ¯     MACRON
       260   176   B0     °     DEGREE SIGN
       261   177   B1     ±     PLUS-MINUS SIGN
       262   178   B2     ²     SUPERSCRIPT TWO
       263   179   B3     ³     SUPERSCRIPT THREE
       264   180   B4     ´     ACUTE ACCENT
       265   181   B5     µ     MICRO SIGN
       266   182   B6     ¶     PILCROW SIGN
       267   183   B7     ·     MIDDLE DOT
       270   184   B8     ¸     CEDILLA
       271   185   B9     ¹     SUPERSCRIPT ONE
       272   186   BA     º     MASCULINE ORDINAL INDICATOR
       273   187   BB     »     RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
       274   188   BC     ¼     VULGAR FRACTION ONE QUARTER
       275   189   BD     ½     VULGAR FRACTION ONE HALF
       276   190   BE     ¾     VULGAR FRACTION THREE QUARTERS
       277   191   BF     ¿     INVERTED QUESTION MARK
       300   192   C0     À     LATIN CAPITAL LETTER A WITH GRAVE
       301   193   C1     Á     LATIN CAPITAL LETTER A WITH ACUTE
       302   194   C2     Â     LATIN CAPITAL LETTER A WITH CIRCUMFLEX
       303   195   C3     Ã     LATIN CAPITAL LETTER A WITH TILDE
       304   196   C4     Ä     LATIN CAPITAL LETTER A WITH DIAERESIS
       305   197   C5     Å     LATIN CAPITAL LETTER A WITH RING ABOVE
       306   198   C6     Æ     LATIN CAPITAL LETTER AE
       307   199   C7     Ç     LATIN CAPITAL LETTER C WITH CEDILLA
       310   200   C8     È     LATIN CAPITAL LETTER E WITH GRAVE
       311   201   C9     É     LATIN CAPITAL LETTER E WITH ACUTE
       312   202   CA     Ê     LATIN CAPITAL LETTER E WITH CIRCUMFLEX
       313   203   CB     Ë     LATIN CAPITAL LETTER E WITH DIAERESIS
       314   204   CC     Ì     LATIN CAPITAL LETTER I WITH GRAVE
       315   205   CD     Í     LATIN CAPITAL LETTER I WITH ACUTE
       316   206   CE     Î     LATIN CAPITAL LETTER I WITH CIRCUMFLEX
       317   207   CF     Ï     LATIN CAPITAL LETTER I WITH DIAERESIS
       320   208   D0     Ð     LATIN CAPITAL LETTER ETH
       321   209   D1     Ñ     LATIN CAPITAL LETTER N WITH TILDE
       322   210   D2     Ò     LATIN CAPITAL LETTER O WITH GRAVE
       323   211   D3     Ó     LATIN CAPITAL LETTER O WITH ACUTE
       324   212   D4     Ô     LATIN CAPITAL LETTER O WITH CIRCUMFLEX
       325   213   D5     Õ     LATIN CAPITAL LETTER O WITH TILDE
       326   214   D6     Ö     LATIN CAPITAL LETTER O WITH DIAERESIS
       327   215   D7     ×     MULTIPLICATION SIGN
       330   216   D8     Ø     LATIN CAPITAL LETTER O WITH STROKE
       331   217   D9     Ù     LATIN CAPITAL LETTER U WITH GRAVE
       332   218   DA     Ú     LATIN CAPITAL LETTER U WITH ACUTE
       333   219   DB     Û     LATIN CAPITAL LETTER U WITH CIRCUMFLEX
       334   220   DC     Ü     LATIN CAPITAL LETTER U WITH DIAERESIS
       335   221   DD     Ý     LATIN CAPITAL LETTER Y WITH ACUTE
       336   222   DE     Þ     LATIN CAPITAL LETTER THORN
       337   223   DF     ß     LATIN SMALL LETTER SHARP S
       340   224   E0     à     LATIN SMALL LETTER A WITH GRAVE
       341   225   E1     á     LATIN SMALL LETTER A WITH ACUTE
       342   226   E2     â     LATIN SMALL LETTER A WITH CIRCUMFLEX
       343   227   E3     ã     LATIN SMALL LETTER A WITH TILDE
       344   228   E4     ä     LATIN SMALL LETTER A WITH DIAERESIS
       345   229   E5     å     LATIN SMALL LETTER A WITH RING ABOVE
       346   230   E6     æ     LATIN SMALL LETTER AE
       347   231   E7     ç     LATIN SMALL LETTER C WITH CEDILLA
       350   232   E8     è     LATIN SMALL LETTER E WITH GRAVE
       351   233   E9     é     LATIN SMALL LETTER E WITH ACUTE
       352   234   EA     ê     LATIN SMALL LETTER E WITH CIRCUMFLEX
       353   235   EB     ë     LATIN SMALL LETTER E WITH DIAERESIS
       354   236   EC     ì     LATIN SMALL LETTER I WITH GRAVE
       355   237   ED     í     LATIN SMALL LETTER I WITH ACUTE
       356   238   EE     î     LATIN SMALL LETTER I WITH CIRCUMFLEX
       357   239   EF     ï     LATIN SMALL LETTER I WITH DIAERESIS
       360   240   F0     ð     LATIN SMALL LETTER ETH
       361   241   F1     ñ     LATIN SMALL LETTER N WITH TILDE
       362   242   F2     ò     LATIN SMALL LETTER O WITH GRAVE
       363   243   F3     ó     LATIN SMALL LETTER O WITH ACUTE
       364   244   F4     ô     LATIN SMALL LETTER O WITH CIRCUMFLEX
       365   245   F5     õ     LATIN SMALL LETTER O WITH TILDE
       366   246   F6     ö     LATIN SMALL LETTER O WITH DIAERESIS
       367   247   F7     ÷     DIVISION SIGN
       370   248   F8     ø     LATIN SMALL LETTER O WITH STROKE
       371   249   F9     ù     LATIN SMALL LETTER U WITH GRAVE
       372   250   FA     ú     LATIN SMALL LETTER U WITH ACUTE
       373   251   FB     û     LATIN SMALL LETTER U WITH CIRCUMFLEX
       374   252   FC     ü     LATIN SMALL LETTER U WITH DIAERESIS
       375   253   FD     ý     LATIN SMALL LETTER Y WITH ACUTE
       376   254   FE     þ     LATIN SMALL LETTER THORN
       377   255   FF     ÿ     LATIN SMALL LETTER Y WITH DIAERESIS

In the above table, I plan to shorten "Latin Small Letter N with Tilde"
to "N with Tilde" in English, and supply a couple of options that would
yield "small N with Tilde" or "capital N with tilde", and so on.

If/when we move to unicode, we can/will use the same pronunciations,
just provide a new encoding to pronounce them... 

Issue History 
Date Modified   Username       Field                    Change               
====================================================================== 
08-24-07 11:26  murf           Note Added: 0069373                          
======================================================================




More information about the asterisk-bugs mailing list