Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: Docs to be crawler friendly, and LLM discoverable #2203

Open
nikhil-swamix opened this issue May 15, 2024 · 2 comments
Open
Labels
enhancement New feature or request wontfix This will not be worked on

Comments

@nikhil-swamix
Copy link

Describe the problem

i tried loading doc using requests lib and parse it, but due to some tabbing nature of JS and py, it requires browser to render.
example
image

Describe the proposed solution

static site generation, dump next and react once for all! its ment for really huge projects with heavy components, not documentation. not the right tech stack. (my opinion, i prefer svelte) . but hey i can copy the same from github source md files, but it wont allow systematic discovery via crawlers... which can be indexed by LLM for RAG purposes and prevent hallucinations.

in short:
prerender flag / server side render in next

Alternatives considered

hugo or some well known SSG

Importance

nice to have

Additional Information

minor priority. but very useful.

@nikhil-swamix nikhil-swamix added the enhancement New feature or request label May 15, 2024
@tazarov
Copy link
Contributor

tazarov commented May 17, 2024

@nikhil-swamix, thanks for your explanation. There are several reasons why we use markdoc and next, and this is unlikely to change.

  • We value user experience over bot/crawler experience (as you pointed out, anyone that needs to index the docs can use the GH markdown files)
  • We want visual continuity between Chroma docs and the hosted platform (which will be coming out soon).
  • While we value your and the rest of the community's opinions, we do certain things a certain way 😀

Have you considered using a different than requests library - have a look here for inspiration https://python.langchain.com/v0.1/docs/integrations/document_loaders/url/

@tazarov tazarov added the wontfix This will not be worked on label May 17, 2024
@nikhil-swamix
Copy link
Author

nikhil-swamix commented May 17, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants