Table des matières

How to wget blog content

Blog download

 get --recursive --no-clobber --page-requisites --html-extension --convert-links --restrict-file-names=windows --domains free.fr --no-parent gurau-audibert.hd.free.fr/josdblog/category/logiciels/ 
 gurau-audibert.hd.free.fr/josdblog/category/logiciels/ 

it might better to download the whole blog without any category specification whatsoever…

Category and tag identification

Categories

Categorie are downloaded on the file structure, at least in Wordpress. TODO: check how does it work with other blog content managers

 gurau-audibert.hd.free.fr/josdblog/category/logiciels 

Tags

For this post about Eclipse and Ant, we found the tag in the following file

 gurau-audibert.hd.free.fr/josdblog/category/logiciels/eclipse/index.html 

The tag is contained in this particular anchor

 <a href="http://gurau-audibert.hd.free.fr/josdblog/tag/ant/" rel="tag">Ant</a>