Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unify WARC writing + CDXJ indexing into single class #507

Merged
merged 5 commits into from Mar 26, 2024

Commits on Mar 23, 2024

  1. remove warcresourcewriter

    unify warc-writing into single WARCWriter class to support cdx indexing for all records
    create dedicated writers for screenshots and text
    ikreymer committed Mar 23, 2024
    Configuration menu
    Copy the full SHA
    adbcf76 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    750d51a View commit details
    Browse the repository at this point in the history
  3. close files on interrupt

    ikreymer committed Mar 23, 2024
    Configuration menu
    Copy the full SHA
    6e708f0 View commit details
    Browse the repository at this point in the history

Commits on Mar 26, 2024

  1. Update src/crawler.ts

    Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>
    ikreymer and tw4l committed Mar 26, 2024
    Configuration menu
    Copy the full SHA
    9566073 View commit details
    Browse the repository at this point in the history
  2. Fix linting

    tw4l committed Mar 26, 2024
    Configuration menu
    Copy the full SHA
    8b1ea95 View commit details
    Browse the repository at this point in the history