Skip to content

Releases: spencermountain/wtf_wikipedia

8.1.1

29 Mar 15:47
2cafa91
Compare
Choose a tag to compare

bugfix for table parser

8.1.0

25 Mar 17:22
Compare
Choose a tag to compare

8.1.0

  • [major] fix Link json object in .json() result
  • [major] fix inconsistent response for singular method aliases like .template('foo')
  • [major] change in rowspan behaviour to support covid table
  • support <noinclude>
  • add .url() and .language() methods
    • support setters on Link methods
    • add Link.href() method
    • support proper urls for interwiki links
  • replicate wikipedia behaviour for apostrophe-s after link
  • new plugins summary, classify, category, and i18n.
  • Link hrefs are not titlecased anymore by default

8.0.0

11 Mar 21:52
75c90fc
Compare
Choose a tag to compare

8.0.0

  • [breaking] move .html(), .latex(), and .markdown() to their respective plugins
    • drop header/footer boilerplate from outputs
  • [breaking] .templates() and .links() return Template and Link objects, and not bare JSON (use .map(l=> l.json()))
  • [breaking] refactor inputs for .fetch()
    • no longer support 'enwikiquote' etc format as input
    • use 'wiki' instead of undocumented 'wikiUrl' param
    • no more automatic throttling/rate-limiting
  • [breaking] remove Image.exists() method to plugin
  • [major] create seperate client/server-side build formats (use native fetch/node lib)
  • [major] support deep (infinite) recursion in templates
  • [major] much-stronger i18n support
  • no-longer automatically titlecase links
  • support adding template parsers through plugins in .extend()
    • support array, number, and string shorthand for template parsers
  • deprecate .plaintext() in favour of .text()

7.2.10

24 Feb 18:12
706de04
Compare
Choose a tag to compare

7.1.0

  • some template fixes
  • add a 'number' field in sentence json, when it looks like a number
  • slight change in coordinate result format, support inline coordinate text
  • handle fetching a large list of titles in sequence

7.1.1

  • support population, weatherbox templates

7.2.0

  • improved date templates, bugfixes

7.2.9

  • few more sports templates,
  • rowspan parsing fix
  • no-longer include package.json in builds
  • use full template-parser for image captions
  • support manually setting doc.title()

7.2.10

  • improved unicode support for sentence/paragraph splitting
  • supporting more formatting templates, like Mono
  • more flexible reference support in .json()

7.0.0

04 Dec 01:04
aabde6e
Compare
Choose a tag to compare

6.0.0 🚨

  • support .paragraphs()
  • ⚠️ major changes to output of .json(). cleaning-up redundant data.:warning:
    • remove top-level templates data (found in section) - resume it with {templates:true}
    • remove top-level coordinates data (found in templates) - resume it with {coordinates:true}
    • remove top-level citations data (found in section) - resume it with {citations:true}
  • return empty arrays in .json() again ¯_(:/)_ /¯
  • remove

    title on html output

  • change ambiguous options.title for sections to options.headers
  • support lists of 1
  • begin removing empty references section by default
  • begin support for rendering citations at the bottom of documents
  • begin first-class references-parsing as objects at paragraph-level
    • use this: .citations() --> .citations().map(c => c.json());
  • remove .wikitext() and .reparse() methods - keeping wikitext stateful caused too many issues
  • turn Image.file into a function
  • include interwiki() results in .links()
  • support follow_redirects option to fetch
  • hide object data in console.logs
  • move ALL image urls from upload.wikimedia.org/wikipedia/commons to wikipedia.org/wiki/Special:Redirect/file/ via #86
  • image captions are now Sentence objects
  • rename citation → reference internally, and in json output
  • remove references inside section titles

6.1.0

  • titlecase internal link destinations #192

6.2.0

  • support categories in redirects
  • add mongo-encoding from dumpster-dive

6.3.0

  • support way (+20%?) more templates.

7.0.0

  • change result-format in a lot of templates, for more consistency.
    • notably: reference format, see also, IPA, main
  • support colspan/rowspan in tables (a little!)
  • support implicit first-row headers for some tables
  • return templates even if they have no data
  • begin support for some well-used {{foo start}}...{{foo end}} templates
  • remove empty [] for some more section properties in .json() response

5.3.1

21 Sep 16:57
1e5f3e1
Compare
Choose a tag to compare

last stable release before v6

from changelog:
5.1.0
improved support for gallery tag
more support for wiktionary grammar templates
tweak some regexes
5.2.0
make .json() results return proper json for tables
5.3.0
add infobox html back into html output (tentative)
redirect support in .json(), .html() output
remove empty [] properties in .json() results (saves disk space!)
keep # anchor data in .links()
show links default-on in latex output, like in md and html
render html/latex/json 'soft redirect', instead of blank pages

5.0.0

08 Aug 22:17
b7807a4
Compare
Choose a tag to compare

3.0.0

  • move .parse() to main wtf() method
  • allow repeated processes without a pre-parse of the document
  • wtf.fetch() uses promises, and native fetch() method (when available)
  • allow per-section images, lists, tables + templates
  • section depth values now start at 0
  • infobox values now return sentence objects
  • latex output (thanks @niebert!)
  • refactor shell scripts to wtf_wikipedia Toronto --plaintext
  • use babel-preset-env cause it's new-new
  • update deps

3.1.0

  • improved .json() results
  • guess a page's title based on bold formatting in first sentence
  • make section.title a function

4.0.0

  • 🚨 non-api changing, but large result-format change
  • add .wikitext() method to Document, Section, Sentence (thanks @niebert)
  • move infobox, citation parser/data to Section class
  • .templates() are now an ordered array, instead of an object, and include infoboxes and citations
  • add (early) support for 'generic' key-value template parsing
  • normalize/lowercase template/infobox properties - add loose .get('key') method to Infobox class
  • mess-around with citation-template formatting
  • beginning to support unknown template forms
  • move date data from Sentence to Section object.
  • rollback of awkward+undocumented options param in parser (but keep options param for output methods)
  • add support for about a hundred new templates
  • templates, including citations, try to be flat-text, and no-longer return Sentence objects

4.1.0

  • remove repeated/redundant text in .links() results
  • don't automatically titlecase link srcs anymore

4.2.0

  • return a result or undefined for sentences.bolds(0), and the like

4.2.2

  • support dollar templates

4.5.0

  • support section(0).wikitext()
  • support inline {{marriage}} template
  • dangling semi-colons in first-sentence parentheses

4.6.0

  • <gallery> tag support in .images()
  • support pageids again in .fetch()
  • better disambiguation-page detection in english
  • remove wikitext from caption titles
  • support 3-level templates (whew!)

5.0.0

  • new Table class and List classes
  • improved table-parser - generate name col1 instead of col-0
  • support options.verbose_template for debugging
  • support recursive tables

1.0.0

22 Jun 21:27
Compare
Choose a tag to compare

breaking change with 0x, sections are now formatted as an array of objects, with depth information.

tables are parsed into an array of key-value pairs.

options object is removed.

all is refactored