Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Character encoding on Windows #129

Open
rhartig-ct opened this issue Dec 9, 2021 · 0 comments · May be fixed by #130
Open

Character encoding on Windows #129

rhartig-ct opened this issue Dec 9, 2021 · 0 comments · May be fixed by #130

Comments

@rhartig-ct
Copy link
Contributor

While writing my own plugin I ran into an issue with results generated from my worker plugin. The default encoding for windows is cp1252 (in most western countries), which only supports 256 characters. It is not uncommon for malware to include various special characters or other languages, which would fail when trying to write an invalid character to a file with cp1252 encoding.

with open(path.joinpath(filename), 'x') as outfile:

Furthermore it is also not obvious to a user what is at fault here the worker plugin (which there may be many of, making it even more ambiguous) or the connector plugin.
image
In order to better support windows the writing on results should either

  1. Write in binary mode
  2. Specify an encoding to accommodate more characters, such as utf-8
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant