Changeset 390

Show
Ignore:
Timestamp:
Wed Aug 23 11:38:15 2006
Author:
effbot
Message:

fixed handling of non-ascii entities

Files:

Legend:

Unmodified
Added
Removed
Modified
  • stuff/sandbox/elementlib/ElementSoup.py

    r389 r390  
    20 20 def unescape(string):  
    21 21     # work around oddities in BeautifulSoup's entity handling  
    22       def unescape_entity(m, defs=htmlentitydefs.entitydefs):  
      22     def unescape_entity(m, defs=htmlentitydefs.name2codepoint):  
    22 22         try:  
    23               return defs[m.group(1)]  
      23             return unichr(defs[m.group(1)])  
    23 23         except KeyError:  
    24 24             return m.group(0) # use as is