[mod_python] Decoding HTML escape characters in HTTP Requests

Behnam Esfahbod ZWNJ behnam at zwnj.org
Sat Jan 3 10:41:12 EST 2009


Hi list,

When browsers need to send Unicode characters (i.e. U+06FA, EXTENDED
ARABIC-INDIC DIGIT ONE)  in a non-Unicode (i.e. Western ISO-8859-1)
encoded HTTP request, they escape Unicode characters in HTML escape
formats.  For example above, the string "۱" will be sent to the
server.

I'm using mod_pythons's Publisher handler, and in these cases, i get
the escaped string, not the original Unicode text.  Is it a bug in
mod_python, or a non-standard feature of common browsers/app-servers,
or both?

Best,
-Behnam

Hint: U+06FA, EXTENDED ARABIC-INDIC DIGIT ONE = ۱


-- 
    '     بهنام اسفهبد
    '     Behnam Esfahbod
   '
  *  ..   http://behnam.esfahbod.info
 *  `  *
  * o *   http://zwnj.org



More information about the Mod_python mailing list