calibre

Merge lp:~tomek3d/calibre/kalibrator into lp:calibre

kalibrator
Merge into trunk

Proposed by Tomasz Długosz on 2013-03-06

Status:	Merged
Merged at revision:	14541
Proposed branch:	lp:~tomek3d/calibre/kalibrator
Merge into:	lp:calibre
Diff against target:	2089 lines (+1239/-180) 56 files modified recipes/adventure_zone_pl.recipe (+2/-2) recipes/archeowiesci.recipe (+2/-1) recipes/astro_news_pl.recipe (+1/-1) recipes/astroflesz.recipe (+1/-0) recipes/astronomia_pl.recipe (+1/-1) recipes/bash_org_pl.recipe (+2/-2) recipes/benchmark_pl.recipe (+11/-9) recipes/biweekly.recipe (+51/-0) recipes/cd_action.recipe (+2/-2) recipes/computerworld_pl.recipe (+6/-10) recipes/conowego_pl.recipe (+9/-1) recipes/czas_gentlemanow.recipe (+4/-2) recipes/dobreprogamy.recipe (+1/-1) recipes/dwutygodnik.recipe (+51/-0) recipes/dzieje_pl.recipe (+2/-2) recipes/dziennik_baltycki.recipe (+34/-0) recipes/dziennik_lodzki.recipe (+35/-0) recipes/dziennik_wschodni.recipe (+78/-0) recipes/dziennik_zachodni.recipe (+34/-0) recipes/echo_dnia.recipe (+74/-0) recipes/eioba.recipe (+1/-0) recipes/elektroda_pl.recipe (+1/-1) recipes/emuzica_pl.recipe (+2/-1) recipes/film_web.recipe (+7/-7) recipes/focus_pl.recipe (+1/-1) recipes/fotoblogia_pl.recipe (+2/-1) recipes/gazeta_krakowska.recipe (+34/-0) recipes/gazeta_lubuska.recipe (+64/-0) recipes/gazeta_pomorska.recipe (+84/-95) recipes/gazeta_wroclawska.recipe (+34/-0) recipes/gazeta_wspolczesna.recipe (+63/-0) recipes/gazeta_wyborcza.recipe (+1/-1) recipes/gcn.recipe (+83/-0) recipes/glos_wielkopolski.recipe (+34/-0) recipes/gram_pl.recipe (+4/-5) recipes/gry_online_pl.recipe (+54/-9) recipes/jazzpress.recipe (+50/-0) recipes/konflikty_zbrojne.recipe (+1/-1) recipes/kosmonauta_pl.recipe (+2/-3) recipes/kurier_lubelski.recipe (+34/-0) recipes/kurier_poranny.recipe (+78/-0) recipes/kurier_szczecinski.recipe (+27/-0) recipes/lomza.recipe (+1/-1) recipes/mlody_technik_pl.recipe (+9/-2) recipes/niebezpiecznik.recipe (+3/-3) recipes/nowa_fantastyka.recipe (+1/-1) recipes/nto.recipe (+63/-0) recipes/pc_foster.recipe (+7/-7) recipes/polska_times.recipe (+3/-1) recipes/spiders_web_pl.recipe (+1/-1) recipes/tablety_pl.recipe (+1/-1) recipes/tanuki.recipe (+2/-1) recipes/trojmiasto_pl.recipe (+37/-0) recipes/tvn24.recipe (+2/-2) recipes/ubuntu_pl.recipe (+1/-1) recipes/zycie_warszawy.recipe (+46/-0)
To merge this branch:	bzr merge lp:~tomek3d/calibre/kalibrator
Related bugs:	Link a bug report

Reviewer	Review Type	Date Requested	Status
Kovid Goyal		2013-03-06	Pending
Review via email: mp+152041@code.launchpad.net

Description of the change

I have two new commiters in my git repository. Here is a bunch of recipes submitted by them.

Revision history for this message

Kovid Goyal (kovid) wrote on 2013-03-07:

There are a few recipes in there that call ebook-convert to get an opf from an epub. This will break if ebook-convert is not in the PATH (which happens for example on OS X). Instead they should use the main() function from conversion/cli.py

I have merged replacing the ebook-convert call with a simple unzip + search for opf as there is no need to do a full conversion just to unpack the epub.

Preview Diff

[H/L] Next/Prev Comment, [J/K] Next/Prev File, [N/P] Next/Prev Hunk

Subscribers

People subscribed via source and target branches

to all changes:

Ali Baba

Kovid Goyal

Pankaj

Timothy Legge

Tomasz Długosz

gstoychev

 === modified file 'recipes/adventure_zone_pl.recipe'
 --- recipes/adventure_zone_pl.recipe	2013-02-16 15:44:46 +0000
 +++ recipes/adventure_zone_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Adventure_zone(BasicNewsRecipe):
      title          = u'Adventure Zone'
      __author__        = 'fenuks'
--    description   = u'Adventure zone - adventure games from A to Z'
++    description   = u'Czytaj więcej o przygodzie - codzienne nowinki. Szukaj u nas solucji i poradników, czytaj recenzje i zapowiedzi. Także galeria, pliki oraz forum dla wszystkich fanów gier przygodowych.'
      category       = 'games'
      language       = 'pl'
      no_stylesheets = True
@@ -78,4 +78,4 @@
                  a['href']=self.index + a['href']
          return soup
--
 \ No newline at end of file
++
 === modified file 'recipes/archeowiesci.recipe'
 --- recipes/archeowiesci.recipe	2013-01-25 12:49:36 +0000
 +++ recipes/archeowiesci.recipe	2013-03-06 19:41:20 +0000
@@ -5,6 +5,7 @@
      __author__        = 'fenuks'
      category       = 'archeology'
      language       = 'pl'
++    description = u'Z pasją o przeszłości'
      cover_url='http://archeowiesci.pl/wp-content/uploads/2011/05/Archeowiesci2-115x115.jpg'
      oldest_article = 7
      needs_subscription='optional'
@@ -29,4 +30,4 @@
              br['log']   = self.username
              br['pwd'] = self.password
              br.submit()
--        return br
 \ No newline at end of file
++        return br
 === modified file 'recipes/astro_news_pl.recipe'
 --- recipes/astro_news_pl.recipe	2011-12-17 16:02:28 +0000
 +++ recipes/astro_news_pl.recipe	2013-03-06 19:41:20 +0000
@@ -2,7 +2,7 @@
  class AstroNEWS(BasicNewsRecipe):
      title          = u'AstroNEWS'
      __author__        = 'fenuks'
--    description   = 'AstroNEWS- astronomy every day'
++    description   = u'AstroNEWS regularnie dostarcza wiadomości o wydarzeniach związanych z astronomią i astronautyką. Informujemy o aktualnych odkryciach i wydarzeniach naukowych, zapowiadamy ciekawe zjawiska astronomiczne. Serwis jest częścią portalu astronomicznego AstroNET prowadzonego przez miłośników astronomii i zawodowych astronomów.'
      category       = 'astronomy, science'
      language       = 'pl'
      oldest_article = 8
 === modified file 'recipes/astroflesz.recipe'
 --- recipes/astroflesz.recipe	2012-12-14 12:01:34 +0000
 +++ recipes/astroflesz.recipe	2013-03-06 19:41:20 +0000
@@ -13,6 +13,7 @@
      max_articles_per_feed = 100
      no_stylesheets = True
      use_embedded_content = False
++    remove_attributes = ['style']
      keep_only_tags = [dict(id="k2Container")]
      remove_tags_after = dict(name='div', attrs={'class':'itemLinks'})
      remove_tags = [dict(name='div', attrs={'class':['itemLinks', 'itemToolbar', 'itemRatingBlock']})]
 === modified file 'recipes/astronomia_pl.recipe'
 --- recipes/astronomia_pl.recipe	2012-02-20 04:34:40 +0000
 +++ recipes/astronomia_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Astronomia_pl(BasicNewsRecipe):
      title          = u'Astronomia.pl'
      __author__        = 'fenuks'
--    description   = 'Astronomia - polish astronomy site'
++    description   = u'Astronomia.pl jest edukacyjnym portalem skierowanym do uczniów, studentów i miłośników astronomii. Przedstawiamy gwiazdy, planety, galaktyki, czarne dziury i wiele innych tajemnic Wszechświata.'
      masthead_url      = 'http://www.astronomia.pl/grafika/logo.gif'
      cover_url      = 'http://www.astronomia.pl/grafika/logo.gif'
      category       = 'astronomy, science'
 === modified file 'recipes/bash_org_pl.recipe'
 --- recipes/bash_org_pl.recipe	2013-02-16 15:44:46 +0000
 +++ recipes/bash_org_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Bash_org_pl(BasicNewsRecipe):
      title          = u'Bash.org.pl'
      __author__        = 'fenuks'
--    description   = 'Bash.org.pl - funny quotations from IRC discussions'
++    description   = 'Bash.org.pl - zabawne cytaty z IRC'
      category       = 'funny quotations, humour'
      language       = 'pl'
      cover_url = u'http://userlogos.org/files/logos/dzikiosiol/none_0.png'
@@ -51,4 +51,4 @@
           feeds          = []
           feeds.append((u"Najnowsze", self.latest_articles()))
           feeds.append((u"Losowe", self.random_articles()))
--         return feeds
 \ No newline at end of file
++         return feeds
 === modified file 'recipes/benchmark_pl.recipe'
 --- recipes/benchmark_pl.recipe	2012-10-17 14:12:08 +0000
 +++ recipes/benchmark_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,14 +3,15 @@
  class BenchmarkPl(BasicNewsRecipe):
      title          = u'Benchmark.pl'
      __author__        = 'fenuks'
--    description   = u'benchmark.pl -IT site'
++    description   = u'benchmark.pl, recenzje i testy sprzętu, aktualności, rankingi, sterowniki, porady, opinie'
      masthead_url = 'http://www.benchmark.pl/i/logo-footer.png'
--    cover_url      = 'http://www.ieaddons.pl/benchmark/logo_benchmark_new.gif'
++    cover_url      = 'http://www.benchmark.pl/i/logo-dark.png'
      category       = 'IT'
      language       = 'pl'
      oldest_article = 8
      max_articles_per_feed = 100
--    no_stylesheets=True
++    no_stylesheets = True
++    remove_attributes = ['style']
      preprocess_regexps = [(re.compile(ur'<h3><span style="font-size: small;">&nbsp;Zobacz poprzednie <a href="http://www.benchmark.pl/news/zestawienie/grupa_id/135">Opinie dnia:</a></span>.*</body>', re.DOTALL|re.IGNORECASE), lambda match: '</body>'), (re.compile(ur'Więcej o .*?</ul>', re.DOTALL|re.IGNORECASE), lambda match: '')]
      keep_only_tags=[dict(name='div', attrs={'class':['m_zwykly', 'gallery']}), dict(id='article')]
      remove_tags_after=dict(name='div', attrs={'class':'body'})
@@ -21,17 +22,18 @@
      def append_page(self, soup, appendtag):
--        nexturl = soup.find('span', attrs={'class':'next'})
--        while nexturl is not None:
--            nexturl= self.INDEX + nexturl.parent['href']
--            soup2 = self.index_to_soup(nexturl)
--            nexturl=soup2.find('span', attrs={'class':'next'})
++        nexturl = soup.find(attrs={'class':'next'})
++        while nexturl:
++            soup2 = self.index_to_soup(nexturl['href'])
++            nexturl = soup2.find(attrs={'class':'next'})
              pagetext = soup2.find(name='div', attrs={'class':'body'})
              appendtag.find('div', attrs={'class':'k_ster'}).extract()
              pos = len(appendtag.contents)
              appendtag.insert(pos, pagetext)
--        if appendtag.find('div', attrs={'class':'k_ster'}) is not None:
++        if appendtag.find('div', attrs={'class':'k_ster'}):
              appendtag.find('div', attrs={'class':'k_ster'}).extract()
++        for r in appendtag.findAll(attrs={'class':'changePage'}):
++            r.extract()
      def image_article(self, soup, appendtag):
 === added file 'recipes/biweekly.recipe'
 --- recipes/biweekly.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/biweekly.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,51 @@
++#!/usr/bin/env python
++# -*- coding: utf-8 -*-
++
++__license__ = 'GPL v3'
++__copyright__ = u'Łukasz Grąbczewski 2011'
++__version__ = '2.0'
++
++import re, os
++from calibre.ptempfile import PersistentTemporaryFile
++from calibre.ebooks.conversion.cli import main
++
++class biweekly(BasicNewsRecipe):
++	__author__ = u'Łukasz Grąbczewski'
++	title = 'Biweekly'
++	language = 'en_EN'
++	publisher = 'National Audiovisual Institute'
++	publication_type = 'magazine'
++	description = u'link with culture [English edition of Polish magazine]: literature, theatre, film, art, music, views, talks'
++
++	conversion_options = {
++		'authors' : 'Biweekly.pl'
++		,'publisher' : publisher
++		,'language' : language
++		,'comments' : description
++		,'no_default_epub_cover' : True
++		,'preserve_cover_aspect_ratio': True
++	}
++
++	def build_index(self):
++		browser = self.get_browser()
++		rc = browser.open('http://www.biweekly.pl/')
++
++		# find the link
++		epublink = browser.find_link(text_regex=re.compile('ePUB VERSION'))
++
++		# download ebook
++		self.report_progress(0,_('Downloading ePUB'))
++		response = browser.follow_link(epublink)
++		book_file = PersistentTemporaryFile(suffix='.epub')
++		book_file.write(response.read())
++		book_file.close()
++
++		# convert
++		self.report_progress(0.2,_('Converting to OEB'))
++		oebdir = self.output_dir + '/INPUT/'
++		main(['ebook-convert', book_file.name, oebdir])
++
++		# feed calibre
++		index = os.path.join(oebdir, 'content.opf')
++
++		return index
 === modified file 'recipes/cd_action.recipe'
 --- recipes/cd_action.recipe	2012-04-18 04:03:44 +0000
 +++ recipes/cd_action.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class CD_Action(BasicNewsRecipe):
      title          = u'CD-Action'
      __author__        = 'fenuks'
--    description   = 'cdaction.pl - polish games magazine site'
++    description   = 'Strona CD-Action (CDA), największego w Polsce pisma dla graczy.Pełne wersje gier, newsy, recenzje, zapowiedzi, konkursy, forum, opinie, galerie screenów,trailery, filmiki, patche, teksty. Gry komputerowe (PC) oraz na konsole (PS3, XBOX 360).'
      category       = 'games'
      language       = 'pl'
      index='http://www.cdaction.pl'
@@ -24,4 +24,4 @@
          for a in soup('a'):
              if a.has_key('href') and 'http://' not in a['href'] and 'https://' not in a['href']:
                  a['href']=self.index + a['href']
--        return soup
 \ No newline at end of file
++        return soup
 === modified file 'recipes/computerworld_pl.recipe'
 --- recipes/computerworld_pl.recipe	2012-02-20 04:34:40 +0000
 +++ recipes/computerworld_pl.recipe	2013-03-06 19:41:20 +0000
@@ -7,17 +7,13 @@
      description   = u'Serwis o IT w przemyśle, finansach, handlu, administracji oraz rynku IT i telekomunikacyjnym - wiadomości, opinie, analizy, porady prawne'
      category       = 'IT'
      language       = 'pl'
--    masthead_url= 'http://g1.computerworld.pl/cw/beta_gfx/cw2.gif'
--    no_stylesheets=True
++    masthead_url = 'http://g1.computerworld.pl/cw/beta_gfx/cw2.gif'
++    cover_url = 'http://g1.computerworld.pl/cw/beta_gfx/cw2.gif'
++    no_stylesheets = True
      oldest_article = 7
      max_articles_per_feed = 100
--    keep_only_tags=[dict(attrs={'class':['tyt_news', 'prawo', 'autor', 'tresc']})]
--    remove_tags_after=dict(name='div', attrs={'class':'rMobi'})
--    remove_tags=[dict(name='div', attrs={'class':['nnav', 'rMobi']}), dict(name='table', attrs={'class':'ramka_slx'})]
++    keep_only_tags = [dict(attrs={'class':['tyt_news', 'prawo', 'autor', 'tresc']})]
++    remove_tags_after = dict(name='div', attrs={'class':'rMobi'})
++    remove_tags = [dict(name='div', attrs={'class':['nnav', 'rMobi']}), dict(name='table', attrs={'class':'ramka_slx'})]
      feeds          = [(u'Wiadomo\u015bci', u'http://rssout.idg.pl/cw/news_iso.xml')]
--    def get_cover_url(self):
--        soup = self.index_to_soup('http://www.computerworld.pl/')
--        cover=soup.find(name='img', attrs={'class':'prawo'})
--        self.cover_url=cover['src']
--        return getattr(self, 'cover_url', self.cover_url)
 === modified file 'recipes/conowego_pl.recipe'
 --- recipes/conowego_pl.recipe	2013-01-20 08:18:34 +0000
 +++ recipes/conowego_pl.recipe	2013-03-06 19:41:20 +0000
@@ -4,11 +4,12 @@
      title          = u'conowego.pl'
      __author__        = 'fenuks'
      description   = u'Nowy wortal technologiczny oraz gazeta internetowa. Testy najnowszych produktów, fachowe porady i recenzje. U nas znajdziesz wszystko o elektronice użytkowej !'
--    cover_url      = 'http://www.conowego.pl/fileadmin/templates/main/images/logo_top.png'
++    #cover_url      = 'http://www.conowego.pl/fileadmin/templates/main/images/logo_top.png'
      category       = 'IT, news'
      language       = 'pl'
      oldest_article = 7
      max_articles_per_feed = 100
++    INDEX = 'http://www.conowego.pl/'
      no_stylesheets = True
      remove_empty_feeds = True
      use_embedded_content = False
@@ -36,3 +37,10 @@
              for r in appendtag.findAll(attrs={'class':['pages', 'paginationWrap']}):
                  r.extract()
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.conowego.pl/magazyn/')
++        tag = soup.find(attrs={'class':'ms_left'})
++        if tag:
++            self.cover_url = self.INDEX + tag.find('img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === modified file 'recipes/czas_gentlemanow.recipe'
 --- recipes/czas_gentlemanow.recipe	2012-12-14 12:01:34 +0000
 +++ recipes/czas_gentlemanow.recipe	2013-03-06 19:41:20 +0000
@@ -1,4 +1,5 @@
  # vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:fdm=marker:ai
++import re
  from calibre.web.feeds.news import BasicNewsRecipe
  class CzasGentlemanow(BasicNewsRecipe):
@@ -13,8 +14,9 @@
      max_articles_per_feed = 100
      no_stylesheets = True
      remove_empty_feeds = True
++    preprocess_regexps     = [(re.compile(u'<h3>Może Cię też zainteresować:</h3>'), lambda m: '')]
      use_embedded_content = False
      keep_only_tags = [dict(name='div', attrs={'class':'content'})]
--    remove_tags = [dict(attrs={'class':'meta_comments'})]
--    remove_tags_after = dict(name='div', attrs={'class':'fblikebutton_button'})
++    remove_tags = [dict(attrs={'class':'meta_comments'}), dict(id=['comments', 'related_posts_thumbnails'])]
++    remove_tags_after = dict(id='comments')
      feeds          = [(u'M\u0119ski \u015awiat', u'http://czasgentlemanow.pl/category/meski-swiat/feed/'), (u'Styl', u'http://czasgentlemanow.pl/category/styl/feed/'), (u'Vademecum Gentlemana', u'http://czasgentlemanow.pl/category/vademecum/feed/'), (u'Dom i rodzina', u'http://czasgentlemanow.pl/category/dom-i-rodzina/feed/'), (u'Honor', u'http://czasgentlemanow.pl/category/honor/feed/'), (u'Gad\u017cety Gentlemana', u'http://czasgentlemanow.pl/category/gadzety-gentlemana/feed/')]
 === modified file 'recipes/dobreprogamy.recipe'
 --- recipes/dobreprogamy.recipe	2012-11-10 10:51:21 +0000
 +++ recipes/dobreprogamy.recipe	2013-03-06 19:41:20 +0000
@@ -18,7 +18,7 @@
      max_articles_per_feed = 100
      preprocess_regexps = [(re.compile(ur'<div id="\S+360pmp4">Twoja przeglądarka nie obsługuje Flasha i HTML5 lub wyłączono obsługę JavaScript...</div>'), lambda match: '') ]
      keep_only_tags=[dict(attrs={'class':['news', 'entry single']})]
--    remove_tags = [dict(attrs={'class':['newsOptions', 'noPrint', 'komentarze', 'tags  font-heading-master']}), dict(id='komentarze')]
++    remove_tags = [dict(attrs={'class':['newsOptions', 'noPrint', 'komentarze', 'tags  font-heading-master']}), dict(id='komentarze'), dict(name='iframe')]
      #remove_tags = [dict(name='div', attrs={'class':['komentarze', 'block', 'portalInfo', 'menuBar', 'topBar']})]
      feeds = [(u'Aktualności', 'http://feeds.feedburner.com/dobreprogramy/Aktualnosci'),
                   ('Blogi', 'http://feeds.feedburner.com/dobreprogramy/BlogCzytelnikow')]
 === added file 'recipes/dwutygodnik.recipe'
 --- recipes/dwutygodnik.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/dwutygodnik.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,51 @@
++#!/usr/bin/env python
++# -*- coding: utf-8 -*-
++
++__license__ = 'GPL v3'
++__copyright__ = u'Łukasz Grąbczewski 2011'
++__version__ = '2.0'
++
++import re, os
++from calibre.ptempfile import PersistentTemporaryFile
++from calibre.ebooks.conversion.cli import main
++
++class dwutygodnik(BasicNewsRecipe):
++	__author__ = u'Łukasz Grąbczewski'
++	title = 'Dwutygodnik'
++	language = 'pl_PL'
++	publisher = 'Narodowy Instytut Audiowizualny'
++	publication_type = 'magazine'
++	description = u'Strona Kultury: literatura, teatr, film, sztuka, muzyka, felietony, rozmowy'
++
++	conversion_options = {
++		'authors' : 'Dwutygodnik.com'
++		,'publisher' : publisher
++		,'language' : language
++		,'comments' : description
++		,'no_default_epub_cover' : True
++		,'preserve_cover_aspect_ratio': True
++	}
++
++	def build_index(self):
++		browser = self.get_browser()
++		rc = browser.open('http://www.dwutygodnik.com/')
++
++		# find the link
++		epublink = browser.find_link(text_regex=re.compile('Wersja ePub'))
++
++		# download ebook
++		self.report_progress(0,_('Downloading ePUB'))
++		response = browser.follow_link(epublink)
++		book_file = PersistentTemporaryFile(suffix='.epub')
++		book_file.write(response.read())
++		book_file.close()
++
++		# convert
++		self.report_progress(0.2,_('Converting to OEB'))
++		oebdir = self.output_dir + '/INPUT/'
++		main(['ebook-convert', book_file.name, oebdir])
++
++		# feed calibre
++		index = os.path.join(oebdir, 'content.opf')
++
++		return index
 === modified file 'recipes/dzieje_pl.recipe'
 --- recipes/dzieje_pl.recipe	2012-12-14 09:32:01 +0000
 +++ recipes/dzieje_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Dzieje(BasicNewsRecipe):
      title          = u'dzieje.pl'
      __author__        = 'fenuks'
--    description   = 'Dzieje - history of Poland'
++    description   = 'Dzieje.pl - najlepszy portal informacyjno-edukacyjny dotyczący historii Polski XX wieku. Archiwalne fotografie, filmy, katalog postaci, quizy i konkursy.'
      cover_url      = 'http://www.dzieje.pl/sites/default/files/dzieje_logo.png'
      category       = 'history'
      language       = 'pl'
@@ -67,4 +67,4 @@
              if a.has_key('href') and 'http://' not in a['href'] and 'https://' not in a['href']:
                  a['href']=self.index + a['href']
          self.append_page(soup, soup.body)
--        return soup
 \ No newline at end of file
++        return soup
 === added file 'recipes/dziennik_baltycki.recipe'
 --- recipes/dziennik_baltycki.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/dziennik_baltycki.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class DziennikBaltycki(BasicNewsRecipe):
++    title          = u'Dziennik Ba\u0142tycki'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Dziennik Bałtycki. Najnowsze Wiadomości Trójmiasto i Wiadomości Pomorskie. Czytaj!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/dziennikbaltycki.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds= True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Wiadomo\u015bci', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_wiadomosci.xml?201302'), (u'Sport', u'http://dziennikbaltycki.feedsportal.com/c/32980/f/533756/index.rss?201302'), (u'Rejsy', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_rejsy.xml?201302'), (u'Biznes na Pomorzu', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_biznesnapomorzu.xml?201302'), (u'GOM', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_gom.xml?201302'), (u'Opinie', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_opinie.xml?201302'), (u'Pitawal Pomorski', u'http://www.dziennikbaltycki.pl/rss/dziennikbaltycki_pitawalpomorski.xml?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/dziennik-baltycki/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 \ No newline at end of file
 === added file 'recipes/dziennik_lodzki.recipe'
 --- recipes/dziennik_lodzki.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/dziennik_lodzki.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,35 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class DziennikLodzki(BasicNewsRecipe):
++    title          = u'Dziennik \u0141\xf3dzki'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Dziennik Łódzki. Najnowsze Wiadomości Łódź. Czytaj Wiadomości Łódzkie!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/dzienniklodzki.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Na sygnale', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_nasygnale.xml?201302'), (u'\u0141\xf3d\u017a', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_lodz.xml?201302'), (u'Opinie', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_opinie.xml?201302'), (u'Pieni\u0105dze', u'http://dzienniklodzki.feedsportal.com/c/32980/f/533763/index.rss?201302'), (u'Kultura', u'http://dzienniklodzki.feedsportal.com/c/32980/f/533762/index.rss?201302'), (u'Sport', u'http://dzienniklodzki.feedsportal.com/c/32980/f/533761/index.rss?201302'), (u'Akcje', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_akcje.xml?201302'), (u'M\xf3j Reporter', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_mojreporter.xml?201302'), (u'Studni\xf3wki', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_studniowki.xml?201302'), (u'Kraj', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_kraj.xml?201302'), (u'Zdrowie', u'http://www.dzienniklodzki.pl/rss/dzienniklodzki_zdrowie.xml?201302')]
++
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/dziennik-lodzki/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === added file 'recipes/dziennik_wschodni.recipe'
 --- recipes/dziennik_wschodni.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/dziennik_wschodni.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,78 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++class DziennikWschodni(BasicNewsRecipe):
++    title          = u'Dziennik Wschodni'
++    __author__        = 'fenuks'
++    description   = u'Dziennik Wschodni - portal regionalny województwa lubelskiego.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.dziennikwschodni.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++
++    feeds          = [(u'Wszystkie', u'http://www.dziennikwschodni.pl/rss.xml'),
++                      (u'Lublin', u'http://www.dziennikwschodni.pl/lublin.xml'),
++                      (u'Zamość', u'http://www.dziennikwschodni.pl/zamosc.xml'),
++                      (u'Biała Podlaska', u'http://www.dziennikwschodni.pl/biala_podlaska.xml'),
++                      (u'Chełm', u'http://www.dziennikwschodni.pl/chelm.xml'),
++                      (u'Kraśnik', u'http://www.dziennikwschodni.pl/krasnik.xml'),
++                      (u'Puławy', u'http://www.dziennikwschodni.pl/pulawy.xml'),
++                      (u'Świdnik', u'http://www.dziennikwschodni.pl/swidnik.xml'),
++                      (u'Łęczna', u'http://www.dziennikwschodni.pl/leczna.xml'),
++                      (u'Lubartów', u'http://www.dziennikwschodni.pl/lubartow.xml'),
++                      (u'Sport', u'http://www.dziennikwschodni.pl/sport.xml'),
++                      (u'Praca', u'http://www.dziennikwschodni.pl/praca.xml'),
++                      (u'Dom', u'http://www.dziennikwschodni.pl/dom.xml'),
++                      (u'Moto', u'http://www.dziennikwschodni.pl/moto.xml'),
++                      (u'Zdrowie', u'http://www.dziennikwschodni.pl/zdrowie.xml'),
++                      ]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === added file 'recipes/dziennik_zachodni.recipe'
 --- recipes/dziennik_zachodni.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/dziennik_zachodni.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class DziennikZachodni(BasicNewsRecipe):
++    title          = u'Dziennik Zachodni'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Dziennik Zachodni. Najnowsze Wiadomości Śląskie. Wiadomości Śląsk. Czytaj!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/dziennikzachodni.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds= True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'}), dict(attrs={'href':'http://www.dziennikzachodni.pl/piano'})]
++
++    feeds          = [(u'Wszystkie', u'http://dziennikzachodni.feedsportal.com/c/32980/f/533764/index.rss?201302'), (u'Wiadomo\u015bci', u'http://dziennikzachodni.feedsportal.com/c/32980/f/533765/index.rss?201302'), (u'Regiony', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_regiony.xml?201302'), (u'Opinie', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_regiony.xml?201302'), (u'Blogi', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_blogi.xml?201302'), (u'Serwisy', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_serwisy.xml?201302'), (u'Sport', u'http://dziennikzachodni.feedsportal.com/c/32980/f/533766/index.rss?201302'), (u'M\xf3j Reporter', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_mojreporter.xml?201302'), (u'Na narty', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_nanarty.xml?201302'), (u'Drogi', u'http://www.dziennikzachodni.pl/rss/dziennikzachodni_drogi.xml?201302'), (u'Pieni\u0105dze', u'http://dziennikzachodni.feedsportal.com/c/32980/f/533768/index.rss?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/dziennik-zachodni/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === added file 'recipes/echo_dnia.recipe'
 --- recipes/echo_dnia.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/echo_dnia.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,74 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class EchoDnia(BasicNewsRecipe):
++    title          = u'Echo Dnia'
++    __author__        = 'fenuks'
++    description   = u'Echo Dnia - portal regionalny świętokrzyskiego radomskiego i podkarpackiego. Najnowsze wiadomości z Twojego regionu, galerie, video, mp3.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.echodnia.eu'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.echodnia.eu/rss.xml'),
++                      (u'Świętokrzyskie', u'http://www.echodnia.eu/swietokrzyskie.xml'),
++                      (u'Radomskie', u'http://www.echodnia.eu/radomskie.xml'),
++                      (u'Podkarpackie', u'http://www.echodnia.eu/podkarpackie.xml'),
++                      (u'Sport \u015bwi\u0119tokrzyski', u'http://www.echodnia.eu/sport_swi.xml'),
++                      (u'Sport radomski', u'http://www.echodnia.eu/sport_rad.xml'),
++                      (u'Sport podkarpacki', u'http://www.echodnia.eu/sport_pod.xml'),
++                      (u'Pi\u0142ka no\u017cna', u'http://www.echodnia.eu/pilka.xml'),
++                      (u'Praca', u'http://www.echodnia.eu/praca.xml'),
++                      (u'Dom', u'http://www.echodnia.eu/dom.xml'),
++                      (u'Auto', u'http://www.echodnia.eu/auto.xml'),
++                      (u'Zdrowie', u'http://www.echodnia.eu/zdrowie.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === modified file 'recipes/eioba.recipe'
 --- recipes/eioba.recipe	2012-04-18 04:03:44 +0000
 +++ recipes/eioba.recipe	2013-03-06 19:41:20 +0000
@@ -4,6 +4,7 @@
  class eioba(BasicNewsRecipe):
      title          = u'eioba'
      __author__        = 'fenuks'
++    description = u'eioba.pl - daj się przeczytać!'
      cover_url      = 'http://www.eioba.org/lay/logo_pl_v3.png'
      language       = 'pl'
      oldest_article = 7
 === modified file 'recipes/elektroda_pl.recipe'
 --- recipes/elektroda_pl.recipe	2012-06-07 06:45:56 +0000
 +++ recipes/elektroda_pl.recipe	2013-03-06 19:41:20 +0000
@@ -5,7 +5,7 @@
      title          = u'Elektroda'
      oldest_article = 8
      __author__        = 'fenuks'
--    description   = 'Elektroda.pl'
++    description   = 'Międzynarodowy portal elektroniczny udostępniający bogate zasoby z dziedziny elektroniki oraz forum dyskusyjne.'
      cover_url      = 'http://demotywatory.elektroda.pl/Thunderpic/logo.gif'
      category       = 'electronics'
      language       = 'pl'
 === modified file 'recipes/emuzica_pl.recipe'
 --- recipes/emuzica_pl.recipe	2012-04-18 04:03:44 +0000
 +++ recipes/emuzica_pl.recipe	2013-03-06 19:41:20 +0000
@@ -12,6 +12,7 @@
      no_stylesheets = True
      oldest_article = 7
      max_articles_per_feed = 100
++    remove_attributes = ['style']
      keep_only_tags=[dict(name='div', attrs={'id':'news_container'}), dict(name='h3'), dict(name='div', attrs={'class':'review_text'})]
      remove_tags=[dict(name='span', attrs={'id':'date'})]
      feeds          = [(u'Aktualno\u015bci', u'http://www.emuzyka.pl/rss.php?f=1'), (u'Recenzje', u'http://www.emuzyka.pl/rss.php?f=2')]
@@ -20,4 +21,4 @@
          for a in soup('a'):
              if a.has_key('href') and 'http://' not in a['href'] and 'https://' not in a['href']:
                  a['href']=self.index + a['href']
--        return soup
 \ No newline at end of file
++        return soup
 === modified file 'recipes/film_web.recipe'
 --- recipes/film_web.recipe	2012-12-14 09:32:01 +0000
 +++ recipes/film_web.recipe	2013-03-06 19:41:20 +0000
@@ -4,21 +4,21 @@
  class FilmWebPl(BasicNewsRecipe):
      title          = u'FilmWeb'
      __author__        = 'fenuks'
--    description   = 'FilmWeb - biggest polish movie site'
--    cover_url      = 'http://userlogos.org/files/logos/crudus/filmweb.png'
++    description   = 'Filmweb.pl - Filmy takie jak Ty Filmweb to największy i najczęściej odwiedzany polski serwis filmowy. Największa baza filmów, seriali i aktorów, repertuar kin i tv, ...'
++    cover_url      = 'http://gfx.filmweb.pl/n/logo-filmweb-bevel.jpg'
      category       = 'movies'
      language       = 'pl'
--    index='http://www.filmweb.pl'
++    index = 'http://www.filmweb.pl'
      oldest_article = 8
      max_articles_per_feed = 100
--    no_stylesheets= True
--    remove_empty_feeds=True
++    no_stylesheets = True
++    remove_empty_feeds = True
      ignore_duplicate_articles = {'title', 'url'}
      preprocess_regexps = [(re.compile(u'\(kliknij\,\ aby powiększyć\)', re.IGNORECASE), lambda m: ''), ]#(re.compile(ur' | ', re.IGNORECASE), lambda m: '')]
      extra_css      = '.hdrBig {font-size:22px;} ul {list-style-type:none; padding: 0; margin: 0;}'
--    remove_tags= [dict(name='div', attrs={'class':['recommendOthers']}), dict(name='ul', attrs={'class':'fontSizeSet'}), dict(attrs={'class':'userSurname anno'})]
++    remove_tags = [dict(name='div', attrs={'class':['recommendOthers']}), dict(name='ul', attrs={'class':'fontSizeSet'}), dict(attrs={'class':'userSurname anno'})]
      remove_attributes = ['style',]
--    keep_only_tags= [dict(name='h1', attrs={'class':['hdrBig', 'hdrEntity']}), dict(name='div', attrs={'class':['newsInfo', 'newsInfoSmall', 'reviewContent description']})]
++    keep_only_tags = [dict(name='h1', attrs={'class':['hdrBig', 'hdrEntity']}), dict(name='div', attrs={'class':['newsInfo', 'newsInfoSmall', 'reviewContent description']})]
      feeds          = [(u'News / Filmy w produkcji', 'http://www.filmweb.pl/feed/news/category/filminproduction'),
                           (u'News / Festiwale, nagrody i przeglądy', u'http://www.filmweb.pl/feed/news/category/festival'),
                           (u'News / Seriale', u'http://www.filmweb.pl/feed/news/category/serials'),
 === modified file 'recipes/focus_pl.recipe'
 --- recipes/focus_pl.recipe	2012-10-25 21:47:19 +0000
 +++ recipes/focus_pl.recipe	2013-03-06 19:41:20 +0000
@@ -13,7 +13,7 @@
      title = u'Focus'
      publisher = u'Gruner + Jahr Polska'
      category = u'News'
--    description = u'Newspaper'
++    description = u'Focus.pl - pierwszy w Polsce portal społecznościowy dla miłośników nauki. Tematyka: nauka, historia, cywilizacja, technika, przyroda, sport, gadżety'
      category = 'magazine'
      cover_url = ''
      remove_empty_feeds = True
 === modified file 'recipes/fotoblogia_pl.recipe'
 --- recipes/fotoblogia_pl.recipe	2012-04-18 04:14:36 +0000
 +++ recipes/fotoblogia_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,6 +3,7 @@
  class Fotoblogia_pl(BasicNewsRecipe):
      title          = u'Fotoblogia.pl'
      __author__        = 'fenuks'
++    description = u'Jeden z największych polskich blogów o fotografii.'
      category       = 'photography'
      language       = 'pl'
      masthead_url = 'http://img.interia.pl/komputery/nimg/u/0/fotoblogia21.jpg'
@@ -11,6 +12,6 @@
      max_articles_per_feed = 100
      no_stylesheets = True
      use_embedded_content = False
--    keep_only_tags=[dict(name='div', attrs={'class':'post-view post-standard'})]
++    keep_only_tags=[dict(name='div', attrs={'class':['post-view post-standard', 'photo-container']})]
      remove_tags=[dict(attrs={'class':['external fotoblogia', 'categories', 'tags']})]
      feeds          = [(u'Wszystko', u'http://fotoblogia.pl/feed/rss2')]
 === added file 'recipes/gazeta_krakowska.recipe'
 --- recipes/gazeta_krakowska.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/gazeta_krakowska.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GazetaKrakowska(BasicNewsRecipe):
++    title          = u'Gazeta Krakowska'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Gazeta Krakowska. Najnowsze Wiadomości Kraków. Informacje Kraków. Czytaj!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/gazetakrakowska.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Fakty24', u'http://gazetakrakowska.feedsportal.com/c/32980/f/533770/index.rss?201302'), (u'Krak\xf3w', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_krakow.xml?201302'), (u'Tarn\xf3w', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_tarnow.xml?201302'), (u'Nowy S\u0105cz', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_nsacz.xml?201302'), (u'Ma\u0142. Zach.', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_malzach.xml?201302'), (u'Podhale', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_podhale.xml?201302'), (u'Sport', u'http://gazetakrakowska.feedsportal.com/c/32980/f/533771/index.rss?201302'), (u'Kultura', u'http://gazetakrakowska.feedsportal.com/c/32980/f/533772/index.rss?201302'), (u'Opinie', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_opinie.xml?201302'), (u'Magazyn', u'http://www.gazetakrakowska.pl/rss/gazetakrakowska_magazyn.xml?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/gazeta-krakowska/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === added file 'recipes/gazeta_lubuska.recipe'
 --- recipes/gazeta_lubuska.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/gazeta_lubuska.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,64 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GazetaLubuska(BasicNewsRecipe):
++    title          = u'Gazeta Lubuska'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Lubuska - portal regionalny województwa lubuskiego.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.gazetalubuska.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.gazetalubuska.pl/rss.xml'), (u'Dreznenko', u'http://www.gazetalubuska.pl/drezdenko.xml'), (u'G\u0142og\xf3w', u'http://www.gazetalubuska.pl/glogow.xml'), (u'Gorz\xf3w Wielkopolski', u'http://www.gazetalubuska.pl/gorzow-wielkopolski.xml'), (u'Gubin', u'http://www.gazetalubuska.pl/gubin.xml'), (u'Kostrzyn', u'http://www.gazetalubuska.pl/kostrzyn.xml'), (u'Krosno Odrza\u0144skie', u'http://www.gazetalubuska.pl/krosno-odrzanskie.xml'), (u'Lubsko', u'http://www.gazetalubuska.pl/lubsko.xml'), (u'Mi\u0119dzych\xf3d', u'http://www.gazetalubuska.pl/miedzychod.xml'), (u'Mi\u0119dzyrzecz', u'http://www.gazetalubuska.pl/miedzyrzecz.xml'), (u'Nowa S\xf3l', u'http://www.gazetalubuska.pl/nowa-sol.xml'), (u'S\u0142ubice', u'http://www.gazetalubuska.pl/slubice.xml'), (u'Strzelce Kraje\u0144skie', u'http://www.gazetalubuska.pl/strzelce-krajenskie.xml'), (u'Sulech\xf3w', u'http://www.gazetalubuska.pl/sulechow.xml'), (u'Sul\u0119cin', u'http://www.gazetalubuska.pl/sulecin.xml'), (u'\u015awi\u0119bodzin', u'http://www.gazetalubuska.pl/swiebodzin.xml'), (u'Wolsztyn', u'http://www.gazetalubuska.pl/wolsztyn.xml'), (u'Wschowa', u'http://www.gazetalubuska.pl/wschowa.xml'), (u'Zielona G\xf3ra', u'http://www.gazetalubuska.pl/zielona-gora.xml'), (u'\u017baga\u0144', u'http://www.gazetalubuska.pl/zagan.xml'), (u'\u017bary', u'http://www.gazetalubuska.pl/zary.xml'), (u'Sport', u'http://www.gazetalubuska.pl/sport.xml'), (u'Auto', u'http://www.gazetalubuska.pl/auto.xml'), (u'Dom', u'http://www.gazetalubuska.pl/dom.xml'), (u'Praca', u'http://www.gazetalubuska.pl/praca.xml'), (u'Zdrowie', u'http://www.gazetalubuska.pl/zdrowie.xml')]
++
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === modified file 'recipes/gazeta_pomorska.recipe'
 --- recipes/gazeta_pomorska.recipe	2013-03-04 23:06:55 +0000
 +++ recipes/gazeta_pomorska.recipe	2013-03-06 19:41:20 +0000
@@ -1,102 +1,91 @@
--#!/usr/bin/env  python
--
--# # Przed uzyciem przeczytaj komentarz w sekcji "feeds"
--
--__license__   = 'GPL v3'
--__copyright__ = u'2010, Richard z forum.eksiazki.org'
--'''pomorska.pl'''
--
  import re
  from calibre.web.feeds.news import BasicNewsRecipe
  class GazetaPomorska(BasicNewsRecipe):
      title          = u'Gazeta Pomorska'
--    publisher      = u'Gazeta Pomorska'
--    description    = u'Kujawy i Pomorze - wiadomo\u015bci'
++    __author__        = 'Richard z forum.eksiazki.org, fenuks'
++    description   = u'Gazeta Pomorska - portal regionalny'
++    category       = 'newspaper'
      language       = 'pl'
--    __author__     = u'Richard z forum.eksiazki.org'
--                      # # (dziekuje t3d z forum.eksiazki.org za testy)
--    oldest_article = 2
--    max_articles_per_feed = 20
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.pomorska.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
      no_stylesheets = True
--    remove_javascript = True
--    preprocess_regexps     = [
--         (re.compile(r'<a href="http://maps.google[^>]*>[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: ''),
--         (re.compile(r'[<Bb >]*Poznaj opinie[^<]*[</Bb >]*[^<]*<a href[^>]*>[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: ''),
--         (re.compile(r'[<Bb >]*Przeczytaj[^<]*[</Bb >]*[^<]*<a href[^>]*>[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: ''),
--         (re.compile(r'[<Bb >]*Wi.cej informacji[^<]*[</Bb >]*[^<]*<a href[^>]*>[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: ''),
--         (re.compile(r'<a href[^>]*>[<Bb >]*Wideo[^<]*[</Bb >]*[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: ''),
--         (re.compile(r'<a href[^>]*>[<Bb >]*KLIKNIJ TUTAJ[^<]*[</Bb >]*[^<]*</a>\.*', re.DOTALL|re.IGNORECASE), lambda m: '')
--      ]
--
--    feeds = [
--# # Tutaj jest wymieniona lista kategorii jakie mozemy otrzymywac z Gazety
--# # Pomorskiej, po jednej kategorii w wierszu. Jesli na poczatku danego wiersza
--# # znajduje sie jeden znak "#", oznacza to ze kategoria jest zakomentowana
--# # i nie bedziemy jej otrzymywac. Jesli chcemy ja otrzymywac nalezy usunac
--# # znak # z jej wiersza.
--# # Jesli subskrybujemy wiecej niz jedna kategorie, na koncu wiersza z kazda
--# # kategoria musi sie znajdowac niezakomentowany przecinek, z wyjatkiem
--# # ostatniego wiersza - ma byc bez przecinka na koncu.
--# # Rekomendowane opcje wyboru kategorii:
--# # 1. PomorskaRSS - wiadomosci kazdego typu, lub
--# # 2. Region + wybrane miasta, lub
--# # 3. Wiadomosci tematyczne.
--# # Lista kategorii:
--
--             # # PomorskaRSS - wiadomosci kazdego typu, zakomentuj znakiem "#"
--             # # przed odkomentowaniem wiadomosci wybranego typu:
--             (u'PomorskaRSS', u'http://www.pomorska.pl/rss.xml')
--
--             # # wiadomosci z regionu nie przypisane do okreslonego miasta:
--             # (u'Region', u'http://www.pomorska.pl/region.xml'),
--
--             # # wiadomosci przypisane do miast:
--             # (u'Bydgoszcz', u'http://www.pomorska.pl/bydgoszcz.xml'),
--             # (u'Nak\u0142o', u'http://www.pomorska.pl/naklo.xml'),
--             # (u'Koronowo', u'http://www.pomorska.pl/koronowo.xml'),
--             # (u'Solec Kujawski', u'http://www.pomorska.pl/soleckujawski.xml'),
--             # (u'Grudzi\u0105dz', u'http://www.pomorska.pl/grudziadz.xml'),
--             # (u'Inowroc\u0142aw', u'http://www.pomorska.pl/inowroclaw.xml'),
--             # (u'Toru\u0144', u'http://www.pomorska.pl/torun.xml'),
--             # (u'W\u0142oc\u0142awek', u'http://www.pomorska.pl/wloclawek.xml'),
--             # (u'Aleksandr\u00f3w Kujawski', u'http://www.pomorska.pl/aleksandrow.xml'),
--             # (u'Brodnica', u'http://www.pomorska.pl/brodnica.xml'),
--             # (u'Che\u0142mno', u'http://www.pomorska.pl/chelmno.xml'),
--             # (u'Chojnice', u'http://www.pomorska.pl/chojnice.xml'),
--             # (u'Ciechocinek', u'http://www.pomorska.pl/ciechocinek.xml'),
--             # (u'Golub Dobrzy\u0144', u'http://www.pomorska.pl/golubdobrzyn.xml'),
--             # (u'Mogilno', u'http://www.pomorska.pl/mogilno.xml'),
--             # (u'Radziej\u00f3w', u'http://www.pomorska.pl/radziejow.xml'),
--             # (u'Rypin', u'http://www.pomorska.pl/rypin.xml'),
--             # (u'S\u0119p\u00f3lno', u'http://www.pomorska.pl/sepolno.xml'),
--             # (u'\u015awiecie', u'http://www.pomorska.pl/swiecie.xml'),
--             # (u'Tuchola', u'http://www.pomorska.pl/tuchola.xml'),
--             # (u'\u017bnin', u'http://www.pomorska.pl/znin.xml')
--
--             # # wiadomosci tematyczne (redundancja z region/miasta):
--             # (u'Sport', u'http://www.pomorska.pl/sport.xml'),
--             # (u'Zdrowie', u'http://www.pomorska.pl/zdrowie.xml'),
--             # (u'Auto', u'http://www.pomorska.pl/moto.xml'),
--             # (u'Dom', u'http://www.pomorska.pl/dom.xml'),
--             # (u'Reporta\u017c', u'http://www.pomorska.pl/reportaz.xml'),
--             # (u'Gospodarka', u'http://www.pomorska.pl/gospodarka.xml')
--           ]
--
--    keep_only_tags = [dict(name='div', attrs={'id':'article'})]
--
--    remove_tags = [
--        dict(name='p', attrs={'id':'articleTags'}),
--        dict(name='div', attrs={'id':'articleEpaper'}),
--        dict(name='div', attrs={'id':'articleConnections'}),
--        dict(name='div', attrs={'class':'articleFacts'}),
--        dict(name='div', attrs={'id':'articleExternalLink'}),
--        dict(name='div', attrs={'id':'articleMultimedia'}),
--        dict(name='div', attrs={'id':'articleGalleries'}),
--        dict(name='div', attrs={'id':'articleAlarm'}),
--        dict(name='div', attrs={'id':'adholder_srodek1'}),
--        dict(name='div', attrs={'id':'articleVideo'}),
--        dict(name='a', attrs={'name':'fb_share'})]
--
--    extra_css = '''h1 { font-size: 1.4em; }
--                        h2 { font-size: 1.0em; }'''
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.pomorska.pl/rss.xml'),
++                      (u'Region', u'http://www.pomorska.pl/region.xml'),
++                      (u'Bydgoszcz', u'http://www.pomorska.pl/bydgoszcz.xml'),
++                      (u'Nakło', u'http://www.pomorska.pl/naklo.xml'),
++                      (u'Koronowo', u'http://www.pomorska.pl/koronowo.xml'),
++                      (u'Solec Kujawski', u'http://www.pomorska.pl/soleckujawski.xml'),
++                      (u'Grudziądz', u'http://www.pomorska.pl/grudziadz.xml'),
++                      (u'Inowrocław', u'http://www.pomorska.pl/inowroclaw.xml'),
++                      (u'Toruń', u'http://www.pomorska.pl/torun.xml'),
++                      (u'Włocławek', u'http://www.pomorska.pl/wloclawek.xml'),
++                      (u'Aleksandrów Kujawski', u'http://www.pomorska.pl/aleksandrow.xml'),
++                      (u'Brodnica', u'http://www.pomorska.pl/brodnica.xml'),
++                      (u'Chełmno', u'http://www.pomorska.pl/chelmno.xml'),
++                      (u'Chojnice', u'http://www.pomorska.pl/chojnice.xml'),
++                      (u'Ciechocinek', u'http://www.pomorska.pl/ciechocinek.xml'),
++                      (u'Golub-Dobrzyń', u'http://www.pomorska.pl/golubdobrzyn.xml'),
++                      (u'Mogilno', u'http://www.pomorska.pl/mogilno.xml'),
++                      (u'Radziejów', u'http://www.pomorska.pl/radziejow.xml'),
++                      (u'Rypin', u'http://www.pomorska.pl/rypin.xml'),
++                      (u'Sępólno', u'http://www.pomorska.pl/sepolno.xml'),
++                      (u'Świecie', u'http://www.pomorska.pl/swiecie.xml'),
++                      (u'Tuchola', u'http://www.pomorska.pl/tuchola.xml'),
++                      (u'Żnin', u'http://www.pomorska.pl/znin.xml'),
++                      (u'Sport', u'http://www.pomorska.pl/sport.xml'),
++                      (u'Zdrowie', u'http://www.pomorska.pl/zdrowie.xml'),
++                      (u'Auto', u'http://www.pomorska.pl/moto.xml'),
++                      (u'Dom', u'http://www.pomorska.pl/dom.xml'),
++                     #(u'Reporta\u017c', u'http://www.pomorska.pl/reportaz.xml'),
++                      (u'Gospodarka', u'http://www.pomorska.pl/gospodarka.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === added file 'recipes/gazeta_wroclawska.recipe'
 --- recipes/gazeta_wroclawska.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/gazeta_wroclawska.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GazetaWroclawska(BasicNewsRecipe):
++    title          = u'Gazeta Wroc\u0142awska'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Gazeta Wrocławska. Najnowsze Wiadomości Wrocław, Informacje Wrocław. Czytaj!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/gazetawroclawska.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Fakty24', u'http://gazetawroclawska.feedsportal.com/c/32980/f/533775/index.rss?201302'), (u'Region', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_region.xml?201302'), (u'Kultura', u'http://gazetawroclawska.feedsportal.com/c/32980/f/533777/index.rss?201302'), (u'Sport', u'http://gazetawroclawska.feedsportal.com/c/32980/f/533776/index.rss?201302'), (u'Z archiwum', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_zarchiwum.xml?201302'), (u'M\xf3j reporter', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_mojreporter.xml?201302'), (u'Historia', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_historia.xml?201302'), (u'Listy do redakcji', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_listydoredakcji.xml?201302'), (u'Na drogach', u'http://www.gazetawroclawska.pl/rss/gazetawroclawska_nadrogach.xml?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/gazeta-wroclawska/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === added file 'recipes/gazeta_wspolczesna.recipe'
 --- recipes/gazeta_wspolczesna.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/gazeta_wspolczesna.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,63 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GazetaWspolczesna(BasicNewsRecipe):
++    title          = u'Gazeta Wsp\xf3\u0142czesna'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Współczesna - portal regionalny.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.wspolczesna.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.wspolczesna.pl/rss.xml'), (u'August\xf3w', u'http://www.wspolczesna.pl/augustow.xml'), (u'Bia\u0142ystok', u'http://www.wspolczesna.pl/bialystok.xml'), (u'Bielsk Podlaski', u'http://www.wspolczesna.pl/bielsk.xml'), (u'E\u0142k', u'http://www.wspolczesna.pl/elk.xml'), (u'Grajewo', u'http://www.wspolczesna.pl/grajewo.xml'), (u'Go\u0142dap', u'http://www.wspolczesna.pl/goldap.xml'), (u'Hajn\xf3wka', u'http://www.wspolczesna.pl/hajnowka.xml'), (u'Kolno', u'http://www.wspolczesna.pl/kolno.xml'), (u'\u0141om\u017ca', u'http://www.wspolczesna.pl/lomza.xml'), (u'Mo\u0144ki', u'http://www.wspolczesna.pl/monki.xml'), (u'Olecko', u'http://www.wspolczesna.pl/olecko.xml'), (u'Ostro\u0142\u0119ka', u'http://www.wspolczesna.pl/ostroleka.xml'), (u'Powiat Bia\u0142ostocki', u'http://www.wspolczesna.pl/powiat.xml'), (u'Sejny', u'http://www.wspolczesna.pl/sejny.xml'), (u'Siemiatycze', u'http://www.wspolczesna.pl/siemiatycze.xml'), (u'Sok\xf3\u0142ka', u'http://www.wspolczesna.pl/sokolka.xml'), (u'Suwa\u0142ki', u'http://www.wspolczesna.pl/suwalki.xml'), (u'Wysokie Mazowieckie', u'http://www.wspolczesna.pl/wysokie.xml'), (u'Zambr\xf3w', u'http://www.wspolczesna.pl/zambrow.xml'), (u'Sport', u'http://www.wspolczesna.pl/sport.xml'), (u'Praca', u'http://www.wspolczesna.pl/praca.xml'), (u'Dom', u'http://www.wspolczesna.pl/dom.xml'), (u'Auto', u'http://www.wspolczesna.pl/auto.xml'), (u'Zdrowie', u'http://www.wspolczesna.pl/zdrowie.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === modified file 'recipes/gazeta_wyborcza.recipe'
 --- recipes/gazeta_wyborcza.recipe	2012-11-11 12:08:23 +0000
 +++ recipes/gazeta_wyborcza.recipe	2013-03-06 19:41:20 +0000
@@ -6,7 +6,7 @@
      title = u'Gazeta.pl'
      __author__ = 'fenuks, Artur Stachecki'
      language = 'pl'
--    description = 'news from gazeta.pl'
++    description = 'Wiadomości z Polski i ze świata. Serwisy tematyczne i lokalne w 20 miastach.'
      category = 'newspaper'
      publication_type = 'newspaper'
      masthead_url = 'http://bi.gazeta.pl/im/5/10285/z10285445AA.jpg'
 === added file 'recipes/gcn.recipe'
 --- recipes/gcn.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/gcn.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,83 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GCN(BasicNewsRecipe):
++    title          = u'Gazeta Codziennej Nowiny'
++    __author__        = 'fenuks'
++    description   = u'nowiny24.pl - portal regionalny województwa podkarpackiego.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.nowiny24.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.nowiny24.pl/rss.xml'),
++                      (u'Podkarpacie', u'http://www.nowiny24.pl/podkarpacie.xml'),
++                      (u'Bieszczady', u'http://www.nowiny24.pl/bieszczady.xml'),
++                      (u'Rzeszów', u'http://www.nowiny24.pl/rzeszow.xml'),
++                      (u'Przemyśl', u'http://www.nowiny24.pl/przemysl.xml'),
++                      (u'Leżajsk', u'http://www.nowiny24.pl/lezajsk.xml'),
++                      (u'Łańcut', u'http://www.nowiny24.pl/lancut.xml'),
++                      (u'Dębica', u'http://www.nowiny24.pl/debica.xml'),
++                      (u'Jarosław', u'http://www.nowiny24.pl/jaroslaw.xml'),
++                      (u'Krosno', u'http://www.nowiny24.pl/krosno.xml'),
++                      (u'Mielec', u'http://www.nowiny24.pl/mielec.xml'),
++                      (u'Nisko', u'http://www.nowiny24.pl/nisko.xml'),
++                      (u'Sanok', u'http://www.nowiny24.pl/sanok.xml'),
++                      (u'Stalowa Wola', u'http://www.nowiny24.pl/stalowawola.xml'),
++                      (u'Tarnobrzeg', u'http://www.nowiny24.pl/tarnobrzeg.xml'),
++                      (u'Sport', u'http://www.nowiny24.pl/sport.xml'),
++                      (u'Dom', u'http://www.nowiny24.pl/dom.xml'),
++                      (u'Auto', u'http://www.nowiny24.pl/auto.xml'),
++                      (u'Praca', u'http://www.nowiny24.pl/praca.xml'),
++                      (u'Zdrowie', u'http://www.nowiny24.pl/zdrowie.xml'),
++                      (u'Wywiady', u'http://www.nowiny24.pl/wywiady.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === added file 'recipes/glos_wielkopolski.recipe'
 --- recipes/glos_wielkopolski.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/glos_wielkopolski.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class GlosWielkopolski(BasicNewsRecipe):
++    title          = u'G\u0142os Wielkopolski'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Głos Wielkopolski. Najnowsze Wiadomości Poznań. Czytaj Informacje Poznań!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/gloswielkopolski.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds= True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Wszystkie', u'http://gloswielkopolski.feedsportal.com/c/32980/f/533779/index.rss?201302'), (u'Wiadomo\u015bci', u'http://gloswielkopolski.feedsportal.com/c/32980/f/533780/index.rss?201302'), (u'Sport', u'http://gloswielkopolski.feedsportal.com/c/32980/f/533781/index.rss?201302'), (u'Kultura', u'http://gloswielkopolski.feedsportal.com/c/32980/f/533782/index.rss?201302'), (u'Porady', u'http://www.gloswielkopolski.pl/rss/gloswielkopolski_porady.xml?201302'), (u'Blogi', u'http://www.gloswielkopolski.pl/rss/gloswielkopolski_blogi.xml?201302'), (u'Nasze akcje', u'http://www.gloswielkopolski.pl/rss/gloswielkopolski_naszeakcje.xml?201302'), (u'Opinie', u'http://www.gloswielkopolski.pl/rss/gloswielkopolski_opinie.xml?201302'), (u'Magazyn', u'http://www.gloswielkopolski.pl/rss/gloswielkopolski_magazyn.xml?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/glos-wielkopolski/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === modified file 'recipes/gram_pl.recipe'
 --- recipes/gram_pl.recipe	2012-12-14 09:32:01 +0000
 +++ recipes/gram_pl.recipe	2013-03-06 19:41:20 +0000
@@ -11,15 +11,14 @@
      max_articles_per_feed = 100
      ignore_duplicate_articles = {'title', 'url'}
      no_stylesheets= True
++    remove_empty_feeds = True
      #extra_css = 'h2 {font-style: italic;  font-size:20px;} .picbox div {float: left;}'
      cover_url=u'http://www.gram.pl/www/01/img/grampl_zima.png'
      keep_only_tags= [dict(id='articleModule')]
--    remove_tags = [dict(attrs={'class':['breadCrump', 'dymek', 'articleFooter']})]
++    remove_tags = [dict(attrs={'class':['breadCrump', 'dymek', 'articleFooter', 'twitter-share-button']})]
      feeds          = [(u'Informacje', u'http://www.gram.pl/feed_news.asp'),
--	      (u'Publikacje', u'http://www.gram.pl/feed_news.asp?type=articles'),
--                (u'Kolektyw- Indie Games', u'http://indie.gram.pl/feed/'),
--                #(u'Kolektyw- Moto Games', u'http://www.motogames.gram.pl/news.rss')
--                ]
++                        (u'Publikacje', u'http://www.gram.pl/feed_news.asp?type=articles')
++                        ]
      def parse_feeds (self):
        feeds = BasicNewsRecipe.parse_feeds(self)
 === modified file 'recipes/gry_online_pl.recipe'
 --- recipes/gry_online_pl.recipe	2012-10-17 14:12:08 +0000
 +++ recipes/gry_online_pl.recipe	2013-03-06 19:41:20 +0000
@@ -1,20 +1,23 @@
++import time
  from calibre.web.feeds.recipes import BasicNewsRecipe
  class GryOnlinePl(BasicNewsRecipe):
      title          = u'Gry-Online.pl'
      __author__        = 'fenuks'
--    description   = 'Gry-Online.pl - computer games'
++    description   = u'Wiadomości o grach, recenzje, zapowiedzi. Encyklopedia Gier zawiera opisy gier na PC, konsole Xbox360, PS3 i inne platformy.'
      category       = 'games'
      language       = 'pl'
      oldest_article = 13
--    INDEX= 'http://www.gry-online.pl/'
--    masthead_url='http://www.gry-online.pl/im/gry-online-logo.png'
--    cover_url='http://www.gry-online.pl/im/gry-online-logo.png'
++    INDEX = 'http://www.gry-online.pl/'
++    masthead_url = 'http://www.gry-online.pl/im/gry-online-logo.png'
++    cover_url = 'http://www.gry-online.pl/im/gry-online-logo.png'
      max_articles_per_feed = 100
--    no_stylesheets= True
--    keep_only_tags=[dict(name='div', attrs={'class':['gc660', 'gc660 S013']})]
--    remove_tags=[dict({'class':['nav-social', 'add-info', 'smlb', 'lista lista3 lista-gry', 'S013po', 'S013-npb', 'zm_gfx_cnt_bottom', 'ocen-txt', 'wiecej-txt', 'wiecej-txt2']})]
--    feeds          = [(u'Newsy', 'http://www.gry-online.pl/rss/news.xml'), ('Teksty', u'http://www.gry-online.pl/rss/teksty.xml')]
++    no_stylesheets = True
++    keep_only_tags = [dict(name='div', attrs={'class':['gc660', 'gc660 S013', 'news_endpage_tit', 'news_container', 'news']})]
++    remove_tags = [dict({'class':['nav-social', 'add-info', 'smlb', 'lista lista3 lista-gry', 'S013po', 'S013-npb', 'zm_gfx_cnt_bottom', 'ocen-txt', 'wiecej-txt', 'wiecej-txt2']})]
++    feeds          = [
++                     (u'Newsy', 'http://www.gry-online.pl/rss/news.xml'),
++                     ('Teksty', u'http://www.gry-online.pl/rss/teksty.xml')]
      def append_page(self, soup, appendtag):
@@ -24,7 +27,14 @@
              url_part = soup.find('link', attrs={'rel':'canonical'})['href']
              url_part = url_part[25:].rpartition('?')[0]
              for nexturl in nexturls[1:-1]:
--                soup2 = self.index_to_soup('http://www.gry-online.pl/' + url_part + nexturl['href'])
++                finalurl = 'http://www.gry-online.pl/' + url_part + nexturl['href']
++                for i in range(10):
++                    try:
++                        soup2 = self.index_to_soup(finalurl)
++                        break
++                    except:
++                        print 'retrying in 0.5s'
++                        time.sleep(0.5)
                  pagetext = soup2.find(attrs={'class':'gc660'})
                  for r in pagetext.findAll(name='header'):
                      r.extract()
@@ -34,7 +44,42 @@
                  appendtag.insert(pos, pagetext)
              for r in appendtag.findAll(attrs={'class':['n5p', 'add-info', 'twitter-share-button', 'lista lista3 lista-gry']}):
                  r.extract()
++        else:
++            tag = appendtag.find('div', attrs={'class':'S018stronyr'})
++            if tag:
++                nexturl = tag.a
++                url_part = soup.find('link', attrs={'rel':'canonical'})['href']
++                url_part = url_part[25:].rpartition('?')[0]
++                while tag:
++                    end = tag.find(attrs={'class':'right left-dead'})
++                    if end:
++                        break
++                    else:
++                        nexturl = tag.a
++                    finalurl = 'http://www.gry-online.pl/' + url_part + nexturl['href']
++                    for i in range(10):
++                        try:
++                            soup2 = self.index_to_soup(finalurl)
++                            break
++                        except:
++                            print 'retrying in 0.5s'
++                            time.sleep(0.5)
++                    tag = soup2.find('div', attrs={'class':'S018stronyr'})
++                    pagetext = soup2.find(attrs={'class':'gc660'})
++                    for r in pagetext.findAll(name='header'):
++                        r.extract()
++                    for r in pagetext.findAll(attrs={'itemprop':'description'}):
++                        r.extract()
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                for r in appendtag.findAll(attrs={'class':['n5p', 'add-info', 'twitter-share-button', 'lista lista3 lista-gry', 'S018strony']}):
++                    r.extract()
++    def image_url_processor(self, baseurl, url):
++        if url.startswith('..'):
++            return url[2:]
++        else:
++	  return url
      def preprocess_html(self, soup):
           self.append_page(soup, soup.body)
 === added file 'recipes/icons/biweekly.png'
 Binary files recipes/icons/biweekly.png	1970-01-01 00:00:00 +0000 and recipes/icons/biweekly.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/dwutygodnik.png'
 Binary files recipes/icons/dwutygodnik.png	1970-01-01 00:00:00 +0000 and recipes/icons/dwutygodnik.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/dziennik_baltycki.png'
 Binary files recipes/icons/dziennik_baltycki.png	1970-01-01 00:00:00 +0000 and recipes/icons/dziennik_baltycki.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/dziennik_lodzki.png'
 Binary files recipes/icons/dziennik_lodzki.png	1970-01-01 00:00:00 +0000 and recipes/icons/dziennik_lodzki.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/dziennik_wschodni.png'
 Binary files recipes/icons/dziennik_wschodni.png	1970-01-01 00:00:00 +0000 and recipes/icons/dziennik_wschodni.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/dziennik_zachodni.png'
 Binary files recipes/icons/dziennik_zachodni.png	1970-01-01 00:00:00 +0000 and recipes/icons/dziennik_zachodni.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/echo_dnia.png'
 Binary files recipes/icons/echo_dnia.png	1970-01-01 00:00:00 +0000 and recipes/icons/echo_dnia.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/emuzica_pl.png'
 Binary files recipes/icons/emuzica_pl.png	1970-01-01 00:00:00 +0000 and recipes/icons/emuzica_pl.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/film_org_pl.png'
 Binary files recipes/icons/film_org_pl.png	1970-01-01 00:00:00 +0000 and recipes/icons/film_org_pl.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/gazeta_krakowska.png'
 Binary files recipes/icons/gazeta_krakowska.png	1970-01-01 00:00:00 +0000 and recipes/icons/gazeta_krakowska.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/gazeta_lubuska.png'
 Binary files recipes/icons/gazeta_lubuska.png	1970-01-01 00:00:00 +0000 and recipes/icons/gazeta_lubuska.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/gazeta_wroclawska.png'
 Binary files recipes/icons/gazeta_wroclawska.png	1970-01-01 00:00:00 +0000 and recipes/icons/gazeta_wroclawska.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/gazeta_wspolczesna.png'
 Binary files recipes/icons/gazeta_wspolczesna.png	1970-01-01 00:00:00 +0000 and recipes/icons/gazeta_wspolczesna.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/gcn.png'
 Binary files recipes/icons/gcn.png	1970-01-01 00:00:00 +0000 and recipes/icons/gcn.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/glos_wielkopolski.png'
 Binary files recipes/icons/glos_wielkopolski.png	1970-01-01 00:00:00 +0000 and recipes/icons/glos_wielkopolski.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/kurier_lubelski.png'
 Binary files recipes/icons/kurier_lubelski.png	1970-01-01 00:00:00 +0000 and recipes/icons/kurier_lubelski.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/kurier_poranny.png'
 Binary files recipes/icons/kurier_poranny.png	1970-01-01 00:00:00 +0000 and recipes/icons/kurier_poranny.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/kurier_szczecinski.png'
 Binary files recipes/icons/kurier_szczecinski.png	1970-01-01 00:00:00 +0000 and recipes/icons/kurier_szczecinski.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/nowa_fantastyka.png'
 Binary files recipes/icons/nowa_fantastyka.png	1970-01-01 00:00:00 +0000 and recipes/icons/nowa_fantastyka.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/nto.png'
 Binary files recipes/icons/nto.png	1970-01-01 00:00:00 +0000 and recipes/icons/nto.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/tablety_pl.png'
 Binary files recipes/icons/tablety_pl.png	1970-01-01 00:00:00 +0000 and recipes/icons/tablety_pl.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/trojmiasto_pl.png'
 Binary files recipes/icons/trojmiasto_pl.png	1970-01-01 00:00:00 +0000 and recipes/icons/trojmiasto_pl.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/icons/zycie_warszawy.png'
 Binary files recipes/icons/zycie_warszawy.png	1970-01-01 00:00:00 +0000 and recipes/icons/zycie_warszawy.png	2013-03-06 19:41:20 +0000 differ
 === added file 'recipes/jazzpress.recipe'
 --- recipes/jazzpress.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/jazzpress.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,50 @@
++#!/usr/bin/env python
++# -*- coding: utf-8 -*-
++
++__license__ = 'GPL v3'
++__copyright__ = u'Łukasz Grąbczewski 2011-2013'
++__version__ = '2.0'
++
++import re, zipfile, os
++from calibre.ptempfile import PersistentTemporaryFile
++from calibre.ebooks.conversion.cli import main
++
++class jazzpress(BasicNewsRecipe):
++	__author__ = u'Łukasz Grąbczewski'
++	title = 'JazzPRESS'
++	language = 'pl'
++	publisher = 'Fundacja Popularyzacji Muzyki Jazzowej EuroJAZZ'
++	publication_type = 'magazine'
++	description = u'Internetowa gazeta poświęcona muzyce improwizowanej'
++
++	conversion_options = {
++		'authors' : 'Fundacja Popularyzacji Muzyki Jazzowej EuroJAZZ'
++		,'publisher' : publisher
++		,'language' : language
++		,'preserve_cover_aspect_ratio': True
++		,'remove_first_image': True
++	}
++
++	def build_index(self):
++		browser = self.get_browser()
++		rc = browser.open('http://radiojazz.fm/')
++
++		# find the link
++		epublink = browser.find_link(url_regex=re.compile('e_jazzpress\d\d\d\d\_epub'))
++
++		# download ebook
++		self.report_progress(0,_('Downloading ePUB'))
++		response = browser.follow_link(epublink)
++		book_file = PersistentTemporaryFile(suffix='.epub')
++		book_file.write(response.read())
++		book_file.close()
++
++		# convert
++		self.report_progress(0.2,_('Converting to OEB'))
++		oebdir = self.output_dir + '/INPUT/'
++		main(['ebook-convert', book_file.name, oebdir])
++
++		# feed calibre
++		index = os.path.join(oebdir, 'content.opf')
++
++		return index
 === modified file 'recipes/konflikty_zbrojne.recipe'
 --- recipes/konflikty_zbrojne.recipe	2012-10-17 14:12:08 +0000
 +++ recipes/konflikty_zbrojne.recipe	2013-03-06 19:41:20 +0000
@@ -7,7 +7,7 @@
      __author__        = 'fenuks'
      cover_url      = 'http://www.konflikty.pl/images/tapety_logo.jpg'
      language       = 'pl'
--    description ='military news'
++    description = u'Zbiór ciekawych artykułów historycznych, militarnych oraz recenzji książek, gier i filmów. Najświeższe informacje o lotnictwie, wojskach lądowych i polityce.'
      category='military, history'
      oldest_article = 7
      max_articles_per_feed = 100
 === modified file 'recipes/kosmonauta_pl.recipe'
 --- recipes/kosmonauta_pl.recipe	2012-12-14 09:32:01 +0000
 +++ recipes/kosmonauta_pl.recipe	2013-03-06 19:41:20 +0000
@@ -7,7 +7,7 @@
      description   = u'polskojęzyczny portal w całości dedykowany misjom kosmicznym i badaniom kosmosu.'
      category       = 'astronomy'
      language       = 'pl'
--    cover_url='http://bi.gazeta.pl/im/4/10393/z10393414X,Kosmonauta-net.jpg'
++    cover_url = 'http://bi.gazeta.pl/im/4/10393/z10393414X,Kosmonauta-net.jpg'
      no_stylesheets = True
      INDEX = 'http://www.kosmonauta.net'
      oldest_article = 7
@@ -24,6 +24,5 @@
                 href = a['href']
                 if not href.startswith('http'):
                     a['href'] = self.INDEX + href
--                   print '%%%%%%%%%%%%%%%%%%%%%%%%%', a['href']
          return soup
--
 \ No newline at end of file
++
 === added file 'recipes/kurier_lubelski.recipe'
 --- recipes/kurier_lubelski.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/kurier_lubelski.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,34 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class KurierLubelski(BasicNewsRecipe):
++    title          = u'Kurier Lubelski'
++    __author__        = 'fenuks'
++    description   = u'Gazeta Regionalna Kurier Lubelski. Najnowsze Wiadomości Lublin. Czytaj Informacje Lublin!'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/kurierlubelski.png?24'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    use_embedded_content = False
++    ignore_duplicate_articles = {'title', 'url'}
++    #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
++    remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
++    remove_tags=[dict(id='mat-podobne'), dict(name='a', attrs={'class':'czytajDalej'}), dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})]
++
++    feeds          = [(u'Wiadomo\u015bci', u'http://kurierlubelski.feedsportal.com/c/32980/f/533785/index.rss?201302'), (u'Region', u'http://www.kurierlubelski.pl/rss/kurierlubelski_region.xml?201302'), (u'Sport', u'http://kurierlubelski.feedsportal.com/c/32980/f/533786/index.rss?201302'), (u'Kultura', u'http://kurierlubelski.feedsportal.com/c/32980/f/533787/index.rss?201302'), (u'Rozmaito\u015bci', u'http://www.kurierlubelski.pl/rss/kurierlubelski_rozmaitosci.xml?201302'), (u'Dom', u'http://www.kurierlubelski.pl/rss/kurierlubelski_dom.xml?201302'), (u'Serwisy', u'http://www.kurierlubelski.pl/rss/kurierlubelski_serwisy.xml?201302'), (u'Motofakty', u'http://www.kurierlubelski.pl/rss/kurierlubelski_motofakty.xml?201302'), (u'M\xf3j Reporter', u'http://www.kurierlubelski.pl/rss/kurierlubelski_mojreporter.xml?201302'), (u'Praca', u'http://www.kurierlubelski.pl/rss/kurierlubelski_praca.xml?201302')]
++
++    def print_version(self, url):
++        return url.replace('artykul', 'drukuj')
++
++    def skip_ad_pages(self, soup):
++        if 'Advertisement' in soup.title:
++            nexturl=soup.find('a')['href']
++            return self.index_to_soup(nexturl, raw=True)
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.prasa24.pl/gazeta/kurier-lubelski/')
++        self.cover_url=soup.find(id='pojemnik').img['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === added file 'recipes/kurier_poranny.recipe'
 --- recipes/kurier_poranny.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/kurier_poranny.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,78 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class KurierPoranny(BasicNewsRecipe):
++    title          = u'Kurier Poranny'
++    __author__        = 'fenuks'
++    description   = u'Kurier Poranny | poranny.pl - portal miejski Białegostoku,informacje,wydarzenia'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.poranny.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++
++    feeds          = [(u'Wszystkie', u'http://www.poranny.pl/rss.xml'),
++                      (u'Białystok', u'http://www.poranny.pl/bialystok.xml'),
++                      (u'Bielsk Podlaski', u'http://www.poranny.pl/bielskpodlaski.xml'),
++                      (u'Czarna Białostocka', u'http://www.poranny.pl/czarnabialostocka.xml'),
++                      (u'Hajnówka', u'http://www.poranny.pl/hajnowka.xml'),
++                      (u'Łapy', u'http://www.poranny.pl/lapy.xml'),
++                      (u'Sokółka', u'http://www.poranny.pl/sokolka.xml'),
++                      (u'Supraśl', u'http://www.poranny.pl/suprasl.xml'),
++                      (u'Wasilków', u'http://www.poranny.pl/wasilkow.xml'),
++                      (u'Sport', u'http://www.poranny.pl/sport.xml'),
++                      (u'Praca', u'http://www.poranny.pl/praca.xml'),
++                      (u'Kultura', u'http://www.poranny.pl/kultura.xml'),
++                      (u'Dom', u'http://www.poranny.pl/dom.xml'),
++                      (u'Auto', u'http://www.poranny.pl/auto.xml'),
++                      (u'Polityka', u'http://www.poranny.pl/polityka.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === added file 'recipes/kurier_szczecinski.recipe'
 --- recipes/kurier_szczecinski.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/kurier_szczecinski.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,27 @@
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class KurierSzczecinski(BasicNewsRecipe):
++    title          = u'Kurier Szczeci\u0144ski'
++    __author__        = 'fenuks'
++    description   = u'24Kurier jest portalem Kuriera Szczecińskiego. Zawiera aktualności ze Szczecina oraz wiadomości regionalne z  województwa zachodniopomorskiego. '
++    category       = 'newspaper'
++    #publication_type = ''
++    language       = 'pl'
++    #encoding = ''
++    #extra_css = ''
++    cover_url = 'http://www.24kurier.pl/Administracja/Img/24kurier_logo-copy-po-zapis'
++    #masthead_url = ''
++    use_embedded_content = False
++    oldest_article = 7
++    max_articles_per_feed = 100
++    no_stylesheets = True
++    remove_empty_feeds = True
++    remove_javascript = True
++    remove_attributes = ['style', 'font']
++    ignore_duplicate_articles = {'title', 'url'}
++
++    keep_only_tags = [dict(attrs={'class':'section'})]
++    remove_tags = [dict(attrs={'class':['Ikonki', 'rek', 'artComments']})]
++    remove_tags_after = dict(attrs={'class':'artComments'})
++    #remove_tags_before = dict()
++    feeds          = [(u'Aktualno\u015bci', u'http://www.24kurier.pl/cmspages/articles_rss.aspx'), (u'Kraj', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=kraj'), (u'\u015awiat', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=swiat'), (u'Sport', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=sport'), (u'Kultura', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=kultura'), (u'Gospodarka', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=gospodarka'), (u'Nauka', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=nauka'), (u'Region', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=region'), (u'Szczecin', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=szczecin'), (u'Bia\u0142ogard', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=bialogard'), (u'Choszczno', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=choszczno'), (u'Drawsko', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=drawsko'), (u'Goleni\xf3w', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=goleniow'), (u'Gryfice', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=gryfice'), (u'Gryfino', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=gryfino'), (u'Kamie\u0144 Pomorski', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=kamien'), (u'Ko\u0142obrzeg', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=kolobrzeg'), (u'Koszalin', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=koszalin'), (u'\u0141obez', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=lobez'), (u'My\u015blib\xf3rz', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=mysliborz'), (u'Police', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=police'), (u'Pyrzyce', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=pyrzyce'), (u'S\u0142awno', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=slawno'), (u'Stargard', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=stargard'), (u'Szczecinek', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=szczecinek'), (u'\u015awidwin', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=swidwin'), (u'\u015awinouj\u015bcie', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=swinoujscie'), (u'Wa\u0142cz', u'http://www.24kurier.pl/cmspages/articles_rss.aspx?dzial=walcz')]
 === modified file 'recipes/lomza.recipe'
 --- recipes/lomza.recipe	2012-10-17 14:12:08 +0000
 +++ recipes/lomza.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Lomza(BasicNewsRecipe):
      title          = u'4Lomza'
      __author__        = 'fenuks'
--    description   = u'4Łomża - regional site'
++    description   = u'Regionalny portal. Najświeższe informacje z regionu, kulturalne, sportowe. Ogłoszenia, baza biznesu, forum.'
      cover_url      = 'http://www.4lomza.pl/i/logo4lomza_m.jpg'
      language       = 'pl'
      oldest_article = 15
 === modified file 'recipes/mlody_technik_pl.recipe'
 --- recipes/mlody_technik_pl.recipe	2013-02-16 15:44:46 +0000
 +++ recipes/mlody_technik_pl.recipe	2013-03-06 19:41:20 +0000
@@ -7,7 +7,7 @@
      description   = u'Młody technik'
      category       = 'science'
      language       = 'pl'
--    cover_url='http://science-everywhere.pl/wp-content/uploads/2011/10/mt12.jpg'
++    #cover_url = 'http://science-everywhere.pl/wp-content/uploads/2011/10/mt12.jpg'
      no_stylesheets = True
      preprocess_regexps = [(re.compile(r"<h4>Podobne</h4>", re.IGNORECASE), lambda m: '')]
      oldest_article = 7
@@ -18,10 +18,17 @@
      remove_tags = [dict(attrs={'class':'st-related-posts'})]
      remove_tags_after = dict(attrs={'class':'entry-content clearfix'})
      feeds          = [(u'Wszystko', u'http://www.mt.com.pl/feed'),
--		(u'MT NEWS 24/7', u'http://www.mt.com.pl/kategoria/mt-newsy-24-7/feed'),
++		#(u'MT NEWS 24/7', u'http://www.mt.com.pl/kategoria/mt-newsy-24-7/feed'),
  		(u'Info zoom', u'http://www.mt.com.pl/kategoria/info-zoom/feed'),
  		(u'm.technik', u'http://www.mt.com.pl/kategoria/m-technik/feed'),
  		(u'Szkoła', u'http://www.mt.com.pl/kategoria/szkola-2/feed'),
  		(u'Na Warsztacie', u'http://www.mt.com.pl/kategoria/na-warsztacie/feed'),
  		(u'Z pasji do...', u'http://www.mt.com.pl/kategoria/z-pasji-do/feed'),
  		(u'MT testuje', u'http://www.mt.com.pl/kategoria/mt-testuje/feed')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup('http://www.mt.com.pl/')
++        tag = soup.find(attrs={'class':'xoxo'})
++        if tag:
++            self.cover_url = tag.find('img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
 === modified file 'recipes/niebezpiecznik.recipe'
 --- recipes/niebezpiecznik.recipe	2011-08-20 17:45:04 +0000
 +++ recipes/niebezpiecznik.recipe	2013-03-06 19:41:20 +0000
@@ -9,8 +9,8 @@
      oldest_article = 8
      max_articles_per_feed = 100
      no_stylesheets = True
--    cover_url =u'http://userlogos.org/files/logos/Karmody/niebezpiecznik_01.png'
--    remove_tags=[dict(name='div', attrs={'class':['sociable']}), dict(name='h4'), dict(attrs={'class':'similar-posts'})]
--    keep_only_tags= [dict(name='div', attrs={'class':['title', 'entry']})]
++    cover_url = u'http://userlogos.org/files/logos/Karmody/niebezpiecznik_01.png'
++    remove_tags = [dict(name='div', attrs={'class':['sociable']}), dict(name='h4'), dict(attrs={'class':'similar-posts'})]
++    keep_only_tags = [dict(name='div', attrs={'class':['title', 'entry']})]
      feeds          = [(u'Wiadomości', u'http://feeds.feedburner.com/niebezpiecznik/'),
        	      ('Blog', 'http://feeds.feedburner.com/niebezpiecznik/linkblog/')]
 === modified file 'recipes/nowa_fantastyka.recipe'
 --- recipes/nowa_fantastyka.recipe	2013-01-25 12:49:36 +0000
 +++ recipes/nowa_fantastyka.recipe	2013-03-06 19:41:20 +0000
@@ -9,7 +9,7 @@
      __modified_by__   = 'zaslav'
      language       = 'pl'
      encoding='latin2'
--    description ='site for fantasy readers'
++    description = u'Strona dla miłośników fantastyki'
      category='fantasy'
      masthead_url='http://farm5.static.flickr.com/4133/4956658792_7ba7fbf562.jpg'
      #extra_css='.tytul {font-size: 20px;}' #not working
 === added file 'recipes/nto.recipe'
 --- recipes/nto.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/nto.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,63 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class NTO(BasicNewsRecipe):
++    title          = u'Nowa Trybuna Opolska'
++    __author__        = 'fenuks'
++    description   = u'Nowa Trybuna Opolska - portal regionalny województwa opolskiego.'
++    category       = 'newspaper'
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    INDEX = 'http://www.nto.pl'
++    masthead_url = INDEX + '/images/top_logo.png'
++    oldest_article = 7
++    max_articles_per_feed = 100
++    remove_empty_feeds = True
++    no_stylesheets = True
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'Czytaj:.*?</a>', re.DOTALL), lambda match: ''), (re.compile(ur'Przeczytaj także:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++                          (re.compile(ur'Przeczytaj również:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'Zobacz też:.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: '')]
++
++    keep_only_tags = [dict(id=['article', 'cover', 'photostory'])]
++    remove_tags = [dict(id=['articleTags', 'articleMeta', 'boxReadIt', 'articleGalleries', 'articleConnections',
++                  'ForumArticleComments', 'articleRecommend', 'jedynkiLinks', 'articleGalleryConnections',
++                  'photostoryConnections', 'articleEpaper', 'articlePoll', 'articleAlarm', 'articleByline']),
++                  dict(attrs={'class':'articleFunctions'})]
++
++    feeds          = [(u'Wszystkie', u'http://www.nto.pl/rss.xml'), (u'Region', u'http://www.nto.pl/region.xml'), (u'Brzeg', u'http://www.nto.pl/brzeg.xml'), (u'G\u0142ubczyce', u'http://www.nto.pl/glubczyce.xml'), (u'K\u0119dzierzyn-Ko\u017ale', u'http://www.nto.pl/kedzierzynkozle.xml'), (u'Kluczbork', u'http://www.nto.pl/kluczbork.xml'), (u'Krapkowice', u'http://www.nto.pl/krapkowice.xml'), (u'Namys\u0142\xf3w', u'http://www.nto.pl/namyslow.xml'), (u'Nysa', u'http://www.nto.pl/nysa.xml'), (u'Olesno', u'http://www.nto.pl/olesno.xml'), (u'Opole', u'http://www.nto.pl/opole.xml'), (u'Prudnik', u'http://www.nto.pl/prudnik.xml'), (u'Strzelce Opolskie', u'http://www.nto.pl/strzelceopolskie.xml'), (u'Sport', u'http://www.nto.pl/sport.xml'), (u'Polska i \u015bwiat', u'http://www.nto.pl/apps/pbcs.dll/section?Category=RSS&channel=KRAJSWIAT'), (u'Zdrowy styl', u'http://www.nto.pl/apps/pbcs.dll/section?Category=rss_zdrowystyl'), (u'Reporta\u017c', u'http://www.nto.pl/reportaz.xml'), (u'Studia', u'http://www.nto.pl/akademicka.xml')]
++
++    def get_cover_url(self):
++        soup = self.index_to_soup(self.INDEX + '/apps/pbcs.dll/section?Category=JEDYNKI')
++        nexturl = self.INDEX + soup.find(id='covers').find('a')['href']
++        soup = self.index_to_soup(nexturl)
++        self.cover_url = self.INDEX + soup.find(id='cover').find(name='img')['src']
++        return getattr(self, 'cover_url', self.cover_url)
++
++    def append_page(self, soup, appendtag):
++        tag = soup.find('span', attrs={'class':'photoNavigationPages'})
++        if tag:
++            number = int(tag.string.rpartition('/')[-1].replace('&nbsp;', ''))
++            baseurl = self.INDEX + soup.find(attrs={'class':'photoNavigationNext'})['href'][:-1]
++
++            for r in appendtag.findAll(attrs={'class':'photoNavigation'}):
++                r.extract()
++            for nr in range(2, number+1):
++                soup2 = self.index_to_soup(baseurl + str(nr))
++                pagetext = soup2.find(id='photoContainer')
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoMeta'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++                pagetext = soup2.find(attrs={'class':'photoStoryText'})
++                if pagetext:
++                    pos = len(appendtag.contents)
++                    appendtag.insert(pos, pagetext)
++
++    def preprocess_html(self, soup):
++        self.append_page(soup, soup.body)
++        return soup
 === modified file 'recipes/pc_foster.recipe'
 --- recipes/pc_foster.recipe	2012-02-20 05:41:32 +0000
 +++ recipes/pc_foster.recipe	2013-03-06 19:41:20 +0000
@@ -7,12 +7,12 @@
      description   = u'Vortal technologiczny: testy, recenzje sprzętu komputerowego i telefonów, nowinki hardware, programy i gry dla Windows. Podkręcanie, modding i Overclocking.'
      category       = 'IT'
      language       = 'pl'
--    masthead_url='http://pcfoster.pl/public/images/logo.png'
--    cover_url= 'http://pcfoster.pl/public/images/logo.png'
--    no_stylesheets= True
--    remove_empty_feeds= True
--    keep_only_tags= [dict(id=['news_details', 'review_details']), dict(attrs={'class':'pager more_top'})]
--    remove_tags=[dict(name='p', attrs={'class':'right'})]
++    masthead_url = 'http://pcfoster.pl/public/images/logo.png'
++    cover_url = 'http://pcfoster.pl/public/images/logo.png'
++    no_stylesheets = True
++    remove_empty_feeds = True
++    keep_only_tags = [dict(id=['news_details', 'review_details']), dict(attrs={'class':'pager more_top'})]
++    remove_tags = [dict(name='p', attrs={'class':'right'})]
      feeds          = [(u'G\u0142\xf3wny', u'http://pcfoster.pl/public/rss/main.xml')]
@@ -32,4 +32,4 @@
      def preprocess_html(self, soup):
          self.append_page(soup, soup.body)
--        return soup
 \ No newline at end of file
++        return soup
 === modified file 'recipes/polska_times.recipe'
 --- recipes/polska_times.recipe	2012-10-17 14:12:08 +0000
 +++ recipes/polska_times.recipe	2013-03-06 19:41:20 +0000
@@ -7,9 +7,11 @@
      language       = 'pl'
      masthead_url = 'http://s.polskatimes.pl/g/logo_naglowek/polska.gif?17'
      oldest_article = 7
++    encoding = 'iso-8859-2'
      max_articles_per_feed = 100
--    remove_emty_feeds= True
++    remove_empty_feeds = True
      no_stylesheets = True
++    use_embedded_content = False
      ignore_duplicate_articles = {'title', 'url'}
      #preprocess_regexps = [(re.compile(ur'<b>Czytaj także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur',<b>Czytaj też:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>Zobacz także:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<center><h4><a.*?</a></h4></center>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TEŻ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ WIĘCEJ:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>CZYTAJ TAKŻE:.*?</b>', re.DOTALL), lambda match: ''), (re.compile(ur'<b>\* CZYTAJ KONIECZNIE:.*', re.DOTALL), lambda match: '</body>'), (re.compile(ur'<b>Nasze serwisy:</b>.*', re.DOTALL), lambda match: '</body>') ]
      remove_tags_after= dict(attrs={'src':'http://nm.dz.com.pl/dz.png'})
 === modified file 'recipes/spiders_web_pl.recipe'
 --- recipes/spiders_web_pl.recipe	2013-03-05 20:11:58 +0000
 +++ recipes/spiders_web_pl.recipe	2013-03-06 19:41:20 +0000
@@ -4,7 +4,7 @@
      title          = u"Spider's Web"
      oldest_article = 7
      __author__        = 'fenuks'
--    description   = u'Opinie i analizy na temat technologii'
++    description   = u'Autorskie teksty popularnych blogerów, testy sprzętu i aplikacji, oraz wiele więcej.'
      cover_url      = 'http://www.spidersweb.pl/wp-content/themes/new_sw/images/spidersweb.png'
      category       = 'IT, WEB'
      language       = 'pl'
 === modified file 'recipes/tablety_pl.recipe'
 --- recipes/tablety_pl.recipe	2012-03-21 03:22:11 +0000
 +++ recipes/tablety_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Tablety_pl(BasicNewsRecipe):
      title          = u'Tablety.pl'
      __author__        = 'fenuks'
--    description   = u'tablety.pl - latest tablet news'
++    description   = u'Tablety, gry i aplikacje na tablety.'
      masthead_url= 'http://www.tablety.pl/wp-content/themes/kolektyw/img/logo.png'
      cover_url      = 'http://www.tablety.pl/wp-content/themes/kolektyw/img/logo.png'
      category       = 'IT'
 === modified file 'recipes/tanuki.recipe'
 --- recipes/tanuki.recipe	2012-04-18 04:03:44 +0000
 +++ recipes/tanuki.recipe	2013-03-06 19:41:20 +0000
@@ -4,6 +4,7 @@
      title          = u'Tanuki'
      oldest_article = 7
      __author__        = 'fenuks'
++    description = u'Tanuki - portal o anime i mandze.'
      category       = 'anime, manga'
      language       = 'pl'
      max_articles_per_feed = 100
@@ -42,4 +43,4 @@
                      a['href']='http://manga.tanuki.pl' + a['href']
                  elif 'tanuki-czytelnia' in soup.title.string.lower():
                      a['href']='http://czytelnia.tanuki.pl' + a['href']
--        return soup
 \ No newline at end of file
++        return soup
 === added file 'recipes/trojmiasto_pl.recipe'
 --- recipes/trojmiasto_pl.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/trojmiasto_pl.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,37 @@
++import re
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class Trojmiasto(BasicNewsRecipe):
++    title          = u'Tr\xf3jmiasto.pl'
++    __author__        = 'fenuks'
++    description   = u'Wiadomości, imprezy, wydarzenia, spektakle.Gdańsk, Gdynia, Sopot - NOCLEGI, Katalog firm, repertuar kin, wydarzenia, przewodnik, mapa, kwatery, hotele. Portal regionalny trojmiasto.pl'
++    category       = ''
++    #publication_type = ''
++    language       = 'pl'
++    encoding = 'iso-8859-2'
++    extra_css = 'ul {list-style: none; padding:0; margin:0;}'
++    cover_url = 'http://www.trojmiasto.pl/_img/toplong2/logo_trojmiasto.gif'
++    #masthead_url = ''
++    use_embedded_content = False
++    oldest_article = 7
++    max_articles_per_feed = 100
++    no_stylesheets = True
++    remove_empty_feeds = True
++    remove_javascript = True
++    remove_attributes = ['style', 'font']
++    ignore_duplicate_articles = {'title', 'url'}
++
++    preprocess_regexps = [(re.compile(ur'<strong>Czytaj więcej.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''), (re.compile(ur'<strong>Zobacz też.*?</a>', re.DOTALL|re.IGNORECASE), lambda match: ''),
++    (re.compile(ur'<b>[A-ZĄĆĘŁŃÓŚŹŻ \-,.:]*?</b>', re.DOTALL), lambda match: ''),]
++
++    #keep_only_tags = []
++    remove_tags = [dict(id=['logo', 'font_small', 'font_big']), dict(attrs={'class':['title-long', 'ankieta', 'newsletter-inside-content newsletter-wrap', 'copyright_box',
++    'logo', 'btn btn-photo-add', 'related-info-wrap', 'nTabs', 'article-list', 'rate-player horizontal', 'type-box', 'rate-player'
++    'hover-nav', 'live-head tC', 'prev-link', 'next-link', 'ie6']}), dict(attrs={'title':[u'drukuj artykuł', u'podziel się na Facebooku', u'prześlij artykuł']})]
++    remove_tags_after = dict(attrs={'class':'author-wrap'})
++    remove_tags_before = dict(attrs={'class':'text-container'})
++
++    feeds          = [(u'Wszystkie', u'http://rss.trojmiasto.pl/rss,0.xml'), (u'Fakty i opinie', u'http://rss.trojmiasto.pl/rss,1.xml'), (u'Sport', u'http://rss.trojmiasto.pl/rss,2.xml'), (u'Dom', u'http://rss.trojmiasto.pl/rss,3.xml'), (u'Moto', u'http://rss.trojmiasto.pl/rss,4.xml'), (u'Nauka', u'http://rss.trojmiasto.pl/rss,5.xml'), (u'Rozrywka', u'http://rss.trojmiasto.pl/rss,6.xml'), (u'Kultura', u'http://rss.trojmiasto.pl/rss,7.xml'), (u'Rowery', u'http://rss.trojmiasto.pl/rss,8.xml'), (u'Dziecko', u'http://rss.trojmiasto.pl/rss,9.xml'), (u'Zdrowie i uroda', u'http://rss.trojmiasto.pl/rss,10.xml'), (u'Praca', u'http://rss.trojmiasto.pl/rss,11.xml'), (u'Artyku\u0142y czytelnik\xf3w', u'http://rss.trojmiasto.pl/rss,12.xml'), (u'Korki', u'http://rss.trojmiasto.pl/rss,13.xml'), (u'Historia', u'http://rss.trojmiasto.pl/rss,14.xml'), (u'Biznes', u'http://rss.trojmiasto.pl/rss,16.xml'), (u'Kryminalne Tr\xf3jmiasto', u'http://rss.trojmiasto.pl/rss,17.xml'), (u'Przewodnik', u'http://rss.trojmiasto.pl/rss,18.xml'), (u'Aktywne Tr\xf3jmiasto', u'http://rss.trojmiasto.pl/rss,19.xml'), (u'Delux', u'http://rss.trojmiasto.pl/rss,20.xml')]
++
++    def print_version(self, url):
++        return url + '?print=1'
 === modified file 'recipes/tvn24.recipe'
 --- recipes/tvn24.recipe	2013-03-04 03:41:11 +0000
 +++ recipes/tvn24.recipe	2013-03-06 19:41:20 +0000
@@ -8,8 +8,8 @@
      description   = u'Sport, Biznes, Gospodarka, Informacje, Wiadomości Zawsze aktualne wiadomości z Polski i ze świata'
      category       = 'news'
      language       = 'pl'
--    masthead_url= 'http://www.tvn24.pl/_d/topmenu/logo2.gif'
--    cover_url= 'http://www.tvn24.pl/_d/topmenu/logo2.gif'
++    #masthead_url= 'http://www.tvn24.pl/_d/topmenu/logo2.gif'
++    cover_url= 'http://www.qzdrowiu.pl/Upload/KnowQZdrowiu_PressOffice/TVN24_logo_575702b7-edce-4b6f-a41b-4395f9456f96_ff6d6ccf-528a-4b94-9e61-2fed727aba35.png'
      extra_css= 'ul {list-style: none; padding: 0; margin: 0;} li {float: left;margin: 0 0.15em;}'
      remove_empty_feeds = True
      remove_javascript = True
 === modified file 'recipes/ubuntu_pl.recipe'
 --- recipes/ubuntu_pl.recipe	2012-02-20 04:34:40 +0000
 +++ recipes/ubuntu_pl.recipe	2013-03-06 19:41:20 +0000
@@ -3,7 +3,7 @@
  class Ubuntu_pl(BasicNewsRecipe):
      title          = u'UBUNTU.pl'
      __author__        = 'fenuks'
--    description   = 'UBUNTU.pl - polish ubuntu community site'
++    description   = 'Polskie forum użytkowników Ubuntu Linux. Projekty, porady i dyskusje, gotowe rozwiązania problemów.'
      masthead_url= 'http://ubuntu.pl/img/logo.jpg'
      cover_url      = 'http://ubuntu.pl/img/logo.jpg'
      category       = 'linux, IT'
 === added file 'recipes/zycie_warszawy.recipe'
 --- recipes/zycie_warszawy.recipe	1970-01-01 00:00:00 +0000
 +++ recipes/zycie_warszawy.recipe	2013-03-06 19:41:20 +0000
@@ -0,0 +1,46 @@
++#!/usr/bin/env python
++# -*- coding: utf-8 -*-
++
++__license__   = 'GPL v3'
++__copyright__   = u'Łukasz Grąbczewski 2012-2013'
++__version__   = '1.1'
++
++'''
++zw.com.pl
++'''
++
++from calibre.web.feeds.news import BasicNewsRecipe
++
++class zyciewarszawy(BasicNewsRecipe):
++	__author__ = u'Łukasz Grączewski'
++	title = u'Życie Warszawy'
++	description = u'Wiadomości z Warszawy'
++	language = 'pl'
++	publisher = 'Presspublica'
++	publication_type = 'newspapper'
++	masthead_url = 'http://www.zw.com.pl/static/img/logo_zw.gif'
++	no_stylesheets = True
++	remove_javascript = True
++
++	oldest_article = 1 #daily news only
++	max_articles_per_feed = 100
++
++	feeds = [(u'Najnowsze', u'http://www.zw.com.pl/rss/1.html')]
++
++	keep_only_tags = []
++	keep_only_tags.append(dict(name = 'div', attrs = {'id' : 'storyp'}))
++
++	remove_tags = []
++	remove_tags.append(dict(name = 'div', attrs = {'class' : 'authordate'}))
++	remove_tags.append(dict(name = 'div', attrs = {'class' : 'author'}))
++	'''remove_tags.append(dict(name = 'div', attrs = {'class' : 'seealso'}))'''
++	remove_tags.append(dict(name = 'div', attrs = {'class' : 'more'}))
++	remove_tags.append(dict(name = 'div', attrs = {'class' : 'clr'}))
++	remove_tags.append(dict(name = 'div', attrs = {'id' : 'adk_0'}))
++	remove_tags.append(dict(name = 'div', attrs = {'id' : 'adsense_0'}))
++	remove_tags.append(dict(name = 'div', attrs = {'id' : 'share_bottom'}))
++	remove_tags.append(dict(name = 'div', attrs = {'id' : 'copyright_law'}))
++
++	def print_version(self, url):
++		url += "?print=tak"
++		return url

calibre

Merge lp:~tomek3d/calibre/kalibrator into lp:calibre

Commit message

Description of the change

Preview Diff

Subscribers