Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature request - clean for char(160) \xa0 #198

Open
dornech opened this issue Sep 20, 2020 · 1 comment
Open

feature request - clean for char(160) \xa0 #198

dornech opened this issue Sep 20, 2020 · 1 comment

Comments

@dornech
Copy link

dornech commented Sep 20, 2020

Hi there, many webpages use non-breaking space in textelements, however for subsequent processes this is sometimes troublesome. What's about an option for get() to clean a returned string value, i. e. to replace \xa0 with a normal space automatically?

@Gallaecio
Copy link
Member

I see your point, however I’m not sure it’s worth it doing at the Parsel level. I think it makes sense for post-processing to happen out of Parsel, at a later stage (e.g. using https://github.com/scrapy/itemloaders).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants