Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem loading dataset in huggingface #4

Open
dimakarp1996 opened this issue Sep 9, 2022 · 1 comment
Open

Problem loading dataset in huggingface #4

dimakarp1996 opened this issue Sep 9, 2022 · 1 comment

Comments

@dimakarp1996
Copy link

Hello. I faced a problem while using russian SuperGlue in HuggingFace. While having 1.16.0 version of HuggingFace library datasets, and trying to make command

from datasets import load_dataset
load_dataset('russian_super_glue','rcb')

I face the ArrowTypeError mentioned on the screenshot. Cleaning up the directory where the dataset file was cached (.cache/huggingface/datasets/json) didn't solve this issue.
Screenshot from 2022-09-09 12-58-53

Note that the similar command
load_dataset('super_glue','cb')
was completed successfully, so the problem is not on the HuggingFace side.

When I use (on different machine) datasets 2.4.0, I get the following error.
Screenshot from 2022-09-09 13-30-52
And again, similar command with any english SuperGLUE dataset works well.

@averkij
Copy link

averkij commented Jan 12, 2023

Hello! Thanks, we'll dig into the certificate issue.

You can use the following code as a workaround with the latest datasets version:

import os
os.environ["CURL_CA_BUNDLE"]=""

load_dataset('russian_super_glue','rcb')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants