[Members] wiki.xmpp.org data recovery

Goffi goffi at goffi.org
Thu Jun 22 15:03:45 UTC 2017


Le jeudi 22 juin 2017, 10:06:05 CEST Guus der Kinderen a écrit :
> Oh, that's actually handy. I'm not much of a bash scripter, but by
> combining xidel (to select the part of the HTML that is the article
> content) and pandoc (for conversion to the Mediawiki format), I'm getting
> something that is pretty close. Example:
> 
> $ xidel --html Edwin_Mons_Application_2011.html --css "#mw-content-text" |
> pandoc --from html --to mediawiki
> 
> Can someone improve on that?

We can also use weboob with  webcontentedit to automatize publishing on the 
wiki, something like

$ xidel --html Edwin_Mons_Application_2011.html --css "#mw-content-text" |
  pandoc --from html --to mediawiki |
  webcontentedit edit Edwin_Mons_Application_2011

Add curl or wget to the game, and I think we can make a script to handle this 
not too badly, we can fix issues after by hand.

I'm too busy right now to work on a script, but it should not be really 
complicated to do.

Goffi



More information about the Members mailing list