[Members] wiki.xmpp.org data recovery
Goffi
goffi at goffi.org
Thu Jun 22 15:03:45 UTC 2017
Le jeudi 22 juin 2017, 10:06:05 CEST Guus der Kinderen a écrit :
> Oh, that's actually handy. I'm not much of a bash scripter, but by
> combining xidel (to select the part of the HTML that is the article
> content) and pandoc (for conversion to the Mediawiki format), I'm getting
> something that is pretty close. Example:
>
> $ xidel --html Edwin_Mons_Application_2011.html --css "#mw-content-text" |
> pandoc --from html --to mediawiki
>
> Can someone improve on that?
We can also use weboob with webcontentedit to automatize publishing on the
wiki, something like
$ xidel --html Edwin_Mons_Application_2011.html --css "#mw-content-text" |
pandoc --from html --to mediawiki |
webcontentedit edit Edwin_Mons_Application_2011
Add curl or wget to the game, and I think we can make a script to handle this
not too badly, we can fix issues after by hand.
I'm too busy right now to work on a script, but it should not be really
complicated to do.
Goffi
More information about the Members
mailing list