How to wget blog content

De wikiRcln
Aller à : navigation, rechercher

Blog download

get --recursive --no-clobber --page-requisites --html-extension --convert-links --restrict-file-names=windows --domains free.fr --no-parent gurau-audibert.hd.free.fr/josdblog/category/logiciels/
  • Altough we ask for all the post from the logiciels cathegory
gurau-audibert.hd.free.fr/josdblog/category/logiciels/

it might better to download the whole blog without any category specification whatsoever...

Category and tag identification

Categories

Categorie are downloaded on the file structure, at least in Wordpress. TODO: check how does it work with other blog content managers

gurau-audibert.hd.free.fr/josdblog/category/logiciels

Tags

For this post about Eclipse and Ant, we found the tag in the following file

gurau-audibert.hd.free.fr/josdblog/category/logiciels/eclipse/index.html

The tag is contained in this particular anchor

<a href="http://gurau-audibert.hd.free.fr/josdblog/tag/ant/" rel="tag">Ant</a> 

Links