Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Deleted items re-appear if they are in the indexing queue #1309

Open
235 opened this issue Jan 4, 2024 · 1 comment
Open

Bug: Deleted items re-appear if they are in the indexing queue #1309

235 opened this issue Jan 4, 2024 · 1 comment
Labels
expected: release after next size: hard status: blocked Work is stalled because it's blocked on some other issue or architectural challenge touches: data/schema/architecture type: bug report why: functionality Intended to improve ArchiveBox functionality or features

Comments

@235
Copy link

235 commented Jan 4, 2024

Describe the bug

Deleted items re-appear with a newer date when their indexing has commenced. Likely cause - the deleted items were sitting in the indexer queue, and deletion in the list did not renew the indexing queue.

Steps to reproduce

  1. Import a set of URLS (~200 in my case)
  2. Delete some items, as they are being retrieved
  3. If a deleted item has not been indexed yet - it will re-appear with a newer date when indexer will get to it in the queue

ArchiveBox version

v0.7.1.

This bug could be related, as I've independently from this problem observed it on the latest version: #433

@pirate
Copy link
Member

pirate commented Jan 4, 2024

Ah yeah unfortunately this is a known bug. It's exactly as you guessed, the deleted items are not cleared from the import queue when deleted via the UI.

As a workaround I recommend tagging them with a tag to-delete instead, and then after the import completes you can filter for that tag and delete all the Snapshots that show up.

This should be fixed in the process of my upcoming refactor that moves us to a real job queue system https://github.com/gaiacoop/django-huey.

@pirate pirate added type: bug report size: hard why: functionality Intended to improve ArchiveBox functionality or features touches: data/schema/architecture status: blocked Work is stalled because it's blocked on some other issue or architectural challenge expected: release after next labels Jan 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
expected: release after next size: hard status: blocked Work is stalled because it's blocked on some other issue or architectural challenge touches: data/schema/architecture type: bug report why: functionality Intended to improve ArchiveBox functionality or features
Projects
None yet
Development

No branches or pull requests

2 participants