Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

Closed
3 tasks done
melsenm opened this issue Apr 27, 2024 · 2 comments · Fixed by #3457 · May be fixed by #3529
Closed
3 tasks done

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

melsenm opened this issue Apr 27, 2024 · 2 comments · Fixed by #3457 · May be fixed by #3529
Labels
bug Something isn't working scraper triage

Comments

@melsenm
Copy link

melsenm commented Apr 27, 2024

First Check

  • I used the GitHub search to find a similar issue and didn't find it.

  • I have verified that this issue is not related to the underlying library
    hhyrsev/recipe-scrapers by 1) checking
    the debugger and data is returned, 2)
    verifying that there are errors in the log related to application level code, or
    3) verified that the site provides recipe data, or is otherwise supported by
    hhyrsev/recipe-scrapers

  • This issue can be replicated on the demo site (https://demo.mealie.io/)

Please provide 1-5 example URLs that are having errors

https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
https://www.colruyt.be/nl/lekker-koken/recept/one-pot-pasta-met-4-groenten

Please provide your logs for the Mealie container docker logs <container-id> > mealie.logs

INFO 2024-04-27T12:29:09 - HTTP Request: GET https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri "HTTP/1.1 456 "
ERROR 2024-04-27T12:29:09 - Recipe Scraper was unable to extract a recipe from https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
ERROR 2024-04-27T12:29:09 - failed to scrape url during bulk url import https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
Traceback (most recent call last):
ERROR 2024-04-27T12:29:09 - 400: {'details': 'BAD_RECIPE_DATA'}
fastapi.exceptions.HTTPException: 400: {'details': 'BAD_RECIPE_DATA'}
raise HTTPException(status.HTTP_400_BAD_REQUEST, {"details": ParserErrors.BAD_RECIPE_DATA.value})
File "/app/mealie/services/scraper/scraper.py", line 37, in create_from_url
recipe, _ = await create_from_url(url, self.translator)
File "/app/mealie/services/scraper/recipe_bulk_scraper.py", line 88, in _do

Deployment

Docker (Synology)

@melsenm melsenm added bug Something isn't working scraper triage labels Apr 27, 2024
@melsenm
Copy link
Author

melsenm commented Apr 27, 2024

2 weeks ago it worked adding recipes from this website

@melsenm melsenm changed the title [SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns 403 Forbidden [SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url Apr 27, 2024
@ndragon798
Copy link

I think my PR #3526 will fix this. My mealie instance pulled the recipe just fine with the change in place.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working scraper triage
Projects
None yet
2 participants