Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read NetCDF files from S3 bucket without downloading the files in the local system #23

Open
najiya-12 opened this issue Jun 5, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@najiya-12
Copy link

najiya-12 commented Jun 5, 2022

I am looking for one year of ABI-L2-CODC data, however, it will be a very large volume of data using goes2go.
data for one month is around 8000 files
image

@blaylockbk
Copy link
Owner

Hi @najiya-12, thanks for your question.

goes2go doesn't currently have this capability and I'm not currenlty planning to implement this (pull request is welcome!).
goes2go can give you a list of file paths for a time range:

# note: this uses the GOES class from the most recent push to the main branch
from goes2go import GOES
G = GOES(16, "ABI-L2-COD", domain='C')
df = G.timerange('2021-01-01', '2021-02-01', download=False)

image

With that info, xarray does have some capability of reading data directly from S3 without downloading it (see here). (I'm run into errors when I do that ValueError: I/O operation on closed file.; I wonder if that's an issue with my virtual environment or if it's a window problem.)


What you are trying to do sounds like a "big data" problem. I would recommend checking out some other tools like https://github.com/fsspec/kerchunk. They might have the right tooling for your needs.

@blaylockbk blaylockbk added the enhancement New feature or request label Jun 5, 2022
@blaylockbk blaylockbk changed the title is it possible to use goes2go to read netcdf files from S3 bucket without downloading the files in the local system Read NetCDF files from S3 bucket without downloading the files in the local system Jun 5, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants