Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhance Shapefile Reader to handle UTF-8 Code Points #44

Open
gwlucastrig opened this issue Feb 21, 2019 · 1 comment
Open

Enhance Shapefile Reader to handle UTF-8 Code Points #44

gwlucastrig opened this issue Feb 21, 2019 · 1 comment

Comments

@gwlucastrig
Copy link
Owner

The Shapefile Reader / DBF file reader should be able to detect code-point files (CPG file extension) and adjust reading to handle non-ASCII cases.

Currently, the reader expect String inputs to be either ASCII or ISO-8859-1 (it doesn't distinguish between them). It should probably be able to handle at least UTF-8 and maybe in ISO-8859-1 mode make sure that it screens out non-valid characters.

Not sure I would bother with all the many variations. Just get these major ones likely to be used by Esri ArcGIS users. Something like Windows 1252 could wait until somebody identifies an actual data set that uses it.

@gwlucastrig
Copy link
Owner Author

gwlucastrig commented Jul 31, 2023

For abbreviation "cpg", is the correct term "code page" not "code point" ?

A partial update was made for Issue 99, change to DbfField

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant