Skip to content

mediawiki-client-tools/mediawiki-dump-generator

 
 

Repository files navigation

MediaWiki Dump Generator

MediaWiki Dump Generator can archive wikis from the largest to the tiniest.

MediaWiki Dump Generator is an ongoing project to port the legacy wikiteam toolset to Python 3 and PyPI to make it more accessible for today's archivers.

Most of the focus has been on the core dumpgenerator tool. Python 3 versions of the other wikiteam tools may be added over time.

MediaWiki Dump Generator Toolset

MediaWiki Dump Generator is a set of tools for archiving wikis. The main general-purpose module of MediaWiki Dump Generator is dumpgenerator, which can download XML dumps of MediaWiki sites that can then be parsed or redeployed elsewhere.

Wikipedia is far too large to manage the dump easily and dumps are already freely available.

Installing the tools

For prerequisites and installation see Installation

Using the tools

For usage see Usage

Publishing the dump

Please consider publishing your wiki dump(s). You can do it yourself as explained in Publishing.

Getting help

  • You can read and post in MediaWiki Client Tools' GitHub Discussions.
  • If you need help (other than reporting a bug), you can reach out on MediaWiki Client Tools' Discussions/Q&A.

Contributing

For information on reporting bugs and proposing changes, please see the Contributing guide.

Code of Conduct

mediawiki-client-tools has a Code of Conduct.

At the moment the only person responsible for reviewing CoC reports is the repository administrator, Elsie Hupp, but we will work towards implementing a broader-based approach to reviews.

You can contact Elsie Hupp directly via email at mediawiki-client-tools@elsiehupp.com or on Matrix at @elsiehupp:beeper.com. (Please state up front if your message concerns the Code of Conduct, as these messages are confidential.)

Contributors

WikiTeam is the Archive Team [GitHub] subcommittee on wikis. It was founded and originally developed by Emilio J. Rodríguez-Posada, a Wikipedia veteran editor and amateur archivist. Thanks to people who have helped, especially to: Federico Leva, Alex Buie, Scott Boyd, Hydriz, Platonides, Ian McEwen, Mike Dupont, balr0g and PiRSquared17.

MediaWiki Dump Generator The Python 3 initiative is currently being led by Elsie Hupp, with contributions from Victor Gambier, Thomas Karcher, Janet Cobb, yzqzss, NyaMisty and Rob Kam

Languages

  • HTML 73.5%
  • Python 26.5%