Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load Sentiment140 failed with HTTP 404 #5289

Open
rayk opened this issue Feb 13, 2024 · 1 comment
Open

Load Sentiment140 failed with HTTP 404 #5289

rayk opened this issue Feb 13, 2024 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@rayk
Copy link

rayk commented Feb 13, 2024

/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET

Short description
Loading the Stanford Sentiment 140 dataset (via Tensorflow data) fails within 404; it is not there.

Environment information

  • Operating System: MACOS
  • Python version: 3.11
tensorboard-data-server      0.7.2
tensorflow                   2.15.0
tensorflow-datasets          4.9.4
  • Does the issue still exists with the last tfds-nightly package (pip install --upgrade tfds-nightly) ? Appears so

Reproduction instructions

ds_builder = tds.builder('sentiment140')

Stack

File ~/Projects/rmap_repo/botty/.venv/lib/python3.11/site-packages/tensorflow_datasets/core/download/downloader.py:331, in _assert_status(response)
    329 """Ensure the URL response is 200."""
    330 if response.status_code != 200:
--> 331   raise download_utils_lib.DownloadError(
    332       'Failed to get url {}. HTTP code: {}.'.format(
    333           response.url, response.status_code
    334       )
    335   )

DownloadError: Failed to get url https://www.cs.stanford.edu/people/alecmgo/trainingandtestdata.zip. HTTP code: 404.

Expected behaviour
What you expected to happen.

Additional context
It looks like a back URL; who would move this without a redirect?

@rayk rayk added the bug Something isn't working label Feb 13, 2024
@ccl-core
Copy link
Collaborator

ccl-core commented Feb 14, 2024

Hi @rayk , thank you for contacting us regarding the Sentiment140 dataset.

I was wondering why TFDS is trying to download https://www.cs.stanford.edu/people/alecmgo/trainingandtestdata.zip while the URL on the builder code is http://cs.stanford.edu/people/alecmgo/trainingandtestdata.zip (without www)?

Have you by any chance modified your local copy of the code?

@ccl-core ccl-core self-assigned this Feb 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants