5 Commits

Author SHA1 Message Date
bas smit
8d0e643637 Explicitly set the html parser to make sure no extra tags get added.
BeautifulSoup supports multiple html parsers. Some of those parsers
try to make the html valid by adding/removing tags[1]. This can lead
to useless html, head & body tags in the final document. By explicitly
setting the parser to ’html.parser’ this behaviour can be avoided.

[1] http://www.crummy.com/software/BeautifulSoup/bs4/doc/#differences-between-parsers
2013-05-24 11:12:51 +02:00
Justin Mayer
e11c18bf48 Merge pull request #27 from fbs/etoc_doc
Update the extract_toc documentation with a md/rst toc example
2013-05-20 03:46:02 -07:00
bas smit
f920f0ec9e Update the extract_toc documentation with a md/rst toc example 2013-05-19 14:59:36 +02:00
bas smit
e07dac8799 Skip static content for toc extraction 2013-05-19 13:37:15 +02:00
Talha Mansoor
45b3094247 Adds Extract table of contents plugin 2013-04-16 10:29:57 -07:00