Skip to content

cawfree/reuters-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

6 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

reuters-dataset

๐Ÿ—ž๏ธ A tool for downloading and parsing Reuters-21578. These are a collection of documents that appeared on Reuters newswire back in 1987.

code style: prettier

๐Ÿ”ฅ Features

  • Asynchronously caches the full dataset to your temporary directory.
    • This reduces your project size.
  • Prettifies the results.
    • Uses proper JSON naming conventions and common-sense values.

๐Ÿš€ Getting Started

Using npm:

npm install --save reuters-dataset

Using yarn:

yarn add reuters-dataset

โœ๏ธ Usage

import getReutersDataset from 'reuters-dataset';

(
  async () => {
    const { exchanges, orgs, people, places, topics, articles } = await getReutersDataset();
  }
)();

๐Ÿ“Œ Example

{
  "$": {
    "topics": true,
    "lewissplit": "TRAIN",
    "cgisplit": "TRAINING-SET",
    "oldid": "5544",
    "newid": "1"
  },
  "topics": ["cocoa"],
  "places": ["el-salvador", "usa", "uruguay"],
  "people": [],
  "orgs": [],
  "exchanges": [],
  "companies": [],
  "text": {
    "title": "BAHIA COCOA REVIEW",
    "dateline": "SALVADOR, Feb 26 -",
    "body": "Showers continued throughout [...]"
  },
  "date": "1987-02-26T15:01:01.790Z"
}

โœŒ๏ธ License

MIT

Buy @cawfree a coffee

About

๐Ÿ—ž๏ธ A tool for downloading and parsing Reuters-21578. These are a collection of documents that appeared on Reuters newswire back in 1987.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published