[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

melsenm · 2024-04-27T10:39:22Z

First Check

I used the GitHub search to find a similar issue and didn't find it.
I have verified that this issue is not related to the underlying library
hhyrsev/recipe-scrapers by 1) checking
the debugger and data is returned, 2)
verifying that there are errors in the log related to application level code, or
3) verified that the site provides recipe data, or is otherwise supported by
hhyrsev/recipe-scrapers
This issue can be replicated on the demo site (https://demo.mealie.io/)

Please provide 1-5 example URLs that are having errors

https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
https://www.colruyt.be/nl/lekker-koken/recept/one-pot-pasta-met-4-groenten

Please provide your logs for the Mealie container `docker logs <container-id> > mealie.logs`

INFO 2024-04-27T12:29:09 - HTTP Request: GET https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri "HTTP/1.1 456 "
ERROR 2024-04-27T12:29:09 - Recipe Scraper was unable to extract a recipe from https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
ERROR 2024-04-27T12:29:09 - failed to scrape url during bulk url import https://www.colruyt.be/nl/lekker-koken/recept/scampi-torpedo-met-chimichurri
Traceback (most recent call last):
ERROR 2024-04-27T12:29:09 - 400: {'details': 'BAD_RECIPE_DATA'}
fastapi.exceptions.HTTPException: 400: {'details': 'BAD_RECIPE_DATA'}
raise HTTPException(status.HTTP_400_BAD_REQUEST, {"details": ParserErrors.BAD_RECIPE_DATA.value})
File "/app/mealie/services/scraper/scraper.py", line 37, in create_from_url
recipe, _ = await create_from_url(url, self.translator)
File "/app/mealie/services/scraper/recipe_bulk_scraper.py", line 88, in _do

Deployment

Docker (Synology)

The text was updated successfully, but these errors were encountered:

melsenm · 2024-04-27T10:40:38Z

2 weeks ago it worked adding recipes from this website

ndragon798 · 2024-04-29T00:37:12Z

I think my PR #3526 will fix this. My mealie instance pulled the recipe just fine with the change in place.

melsenm added bug Something isn't working scraper triage labels Apr 27, 2024

melsenm changed the title ~~[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns 403 Forbidden~~ [SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url Apr 27, 2024

ndragon798 mentioned this issue Apr 29, 2024

fix: Update user-agent #3526

Closed

hay-kot linked a pull request Apr 29, 2024 that will close this issue

chore: bump user agent #3457

Merged

2 tasks

michael-genson closed this as completed in #3457 Apr 29, 2024

ndragon798 mentioned this issue Apr 29, 2024

feat: Automatically keep user-agent up to date with latest user-agent #3529

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

melsenm commented Apr 27, 2024

melsenm commented Apr 27, 2024

ndragon798 commented Apr 29, 2024

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

[SCRAPER] - www.colruyt.be/nl/lekker-koken recipe scraping returns failed to scrape url #3524

Comments

melsenm commented Apr 27, 2024

First Check

Please provide 1-5 example URLs that are having errors

Please provide your logs for the Mealie container docker logs <container-id> > mealie.logs

Deployment

melsenm commented Apr 27, 2024

ndragon798 commented Apr 29, 2024

Please provide your logs for the Mealie container `docker logs <container-id> > mealie.logs`