UnicodeDecodeError when parsing http://www.projekt6.de/?feed=podcast
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
gPodder |
Fix Released
|
Medium
|
|||
feedparser (Ubuntu) |
Fix Released
|
Medium
|
Luca Falavigna |
Bug Description
Trying to parse the RSS feed http://
thp@macbook:~$ python
Python 2.5.2 (r252:60911, Apr 21 2008, 11:12:42)
[GCC 4.2.3 (Ubuntu 4.2.3-2ubuntu7)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import feedparser
>>> f = feedparser.parse('http://
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/var/lib/
feedparser.
File "/var/lib/
sgmllib.
File "/usr/lib/
self.goahead(0)
File "/usr/lib/
k = self.parse_
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/var/lib/
method()
File "/var/lib/
value = self.popContent
File "/var/lib/
value = self.pop(tag)
File "/var/lib/
output = _resolveRelativ
File "/var/lib/
p.feed(
File "/var/lib/
sgmllib.
File "/usr/lib/
self.goahead(0)
File "/usr/lib/
k = self.parse_
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/var/lib/
_BaseHTMLPr
File "/var/lib/
value = unicode(value, self.encoding)
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-8: unsupported Unicode code range
I've created a patch against the most recent feedparser.py in Ubuntu 8.04, which will fix this problem by replacing invalid characters instead of failing completely.
Related branches
Changed in gpodder: | |
status: | Unknown → In Progress |
Changed in gpodder: | |
status: | In Progress → Fix Released |
Changed in gpodder: | |
importance: | Unknown → Medium |
Same bug reported in other bug trackers:
Upstream: http:// code.google. com/p/feedparse r/issues/ detail? id=128 bugs.gpodder. org/show_ bug.cgi? id=157
gPodder: http://