Skip to content

jon-edward/wiki_categories_datastore

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wiki_categories_datastore

This repository contains automatic deployments of the Wikipedia category tree, trimmed by the algorithm defined in main.

The data stored here does not contain faithful reproductions of the Wikipedia category trees. To make a trimmed category tree, many categories are excluded and there are many edges added that do not exist in their respective true category graphs.

To construct the full (untrimmed) category tree, see CategoryTree and Assets,

Assets are sourced from the Wikimedia data dumps.

Format

For each language contained in ./data/ there is a _meta.json file that contains information about the last successful run, _index.bytes that is a concatenated, zero-padded (big-endian) list of unsigned 4-byte integers for the available category IDs, and category files in the file name format [0-9]+\.json that describe the category's name, successors, and predecessors.

Languages

Deployments are available for the following Wikipedia languages:

Disclaimer

The author of this software is not affiliated, associated, authorized, endorsed by, or in any way officially connected with The Wikimedia Foundation or any of its affiliates and is independently owned and created.

About

Automatic deployments of the trimmed Wikipedia category tree.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published