Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

jupyter-ai for a geospatial foundation model -- allow rendering of geojson file types? #737

Open
k-y-le opened this issue Apr 19, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@k-y-le
Copy link

k-y-le commented Apr 19, 2024

Problem

Hi- I’m working with Clay, a foundation model for Earth Observation data. I’m exploring what an integration with Jupyter-ai would look like, but have a couple questions due to the differences between a vision based model (Clay) and a language model such as all the integrations currently available within jupyter-ai.

There are a couple tasks I would like to accomplish with jupyter-ai and Clay, to start. First, to create embeddings, which currently is possible via an API — inputs are a geojson polygon and outputs are a list of geojsons.

The second task is to query embeddings (do similarity search) — in this case, the user offers the ID of an embedding as an input to the API (can be created in the task above ^ or otherwise and receives back a list of recommended similar embeddings (which can then be converted to geojson)

Proposed Solution

I can think of a few ways to make this happen:

  • on Clay's end, we're working on translating text to Earth embeddings, which means in the future a language based query "identify plastic pollution sites in Indonesia" would be functional. Outputs would still need to be in a geojson format to be most useful -- this is a WIP and not ready for integration yet
  • including geojson-extension to render geojson files within the notebook, and allowing it as valid output when using Clay as the model
  • from the UI perspective, there are some mentions of language models explicitly (selecting the model) that could be changed to be more agnostic, or the selection of Clay could operate as a different dropdown / UX

Additional context

I understand that jupyter-ai is meant to be vendor-agnostic, so perhaps the best option is to stick with text-based outputs, as best possible (within Clay, we’re working on translating EO data into text formats, which would be helpful, but is still a ways off), but I think it would be a loss if we didn’t consider the ways in which non-text inputs and outputs can be made available. This is probably part of a larger conversation about how jupyter-ai is set up, and so I wanted to create an issue to hear if there are other thoughts on the subject. Happy to provide more context as needed.

@k-y-le k-y-le added the enhancement New feature or request label Apr 19, 2024
Copy link

welcome bot commented Apr 19, 2024

Thank you for opening your first issue in this project! Engagement like this is essential for open source projects! 🤗

If you haven't done so already, check out Jupyter's Code of Conduct. Also, please try to follow the issue template as it helps other other community members to contribute more effectively.
welcome
You can meet the other Jovyans by joining our Discourse forum. There is also an intro thread there where you can stop by and say Hi! 👋

Welcome to the Jupyter community! 🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant