in2csv: GeoJSON metadata discarded #870

jayvdb · 2017-07-24T10:42:24Z

The following metadata fields from the "FeatureCollection" object are discarded:

-  "generator": "overpass-ide",
-  "copyright": "....",
-  "timestamp": "2017-07-17T03:42:02Z",

Probably others also.

Perhaps they could be stored in a column header, or a special row, which would assist with Round-tripping data ( #868 ).

The text was updated successfully, but these errors were encountered:

jpmckinney · 2017-07-24T21:29:44Z

It's not clear how to store these in a way that still produces a generic CSV (rather than a CSV that only csvkit knows how to read), but I'll leave the issue open for creative suggestions.

jayvdb · 2017-07-25T03:48:45Z

I suggested embedding in column header.

jpmckinney · 2017-07-25T13:44:56Z

I suppose it could be an opt-in flag - as otherwise most users will find it surprising to have extra columns in their CSV output.

jayvdb · 2017-07-26T06:41:22Z

I was not meaning an extra column; instead add it into a existing column header.

anyway ... a better idea would be to read the metadata from the original. Assuming csvkit has a sensible streaming json reader, it could read only as much of the original to capture the metadata at the top, and emit that in the output.

jpmckinney · 2017-07-26T14:23:00Z

Yeah, my challenge is determining where to output it. in2csv with GeoJSON just outputs a row for each feature (as you know). If we put metadata in a special row or in a special header, then that pollutes the data with non-data that other tools won't know how to parse, because storing metadata like that in CSVs is not standardized. And then we'd have to add support to all csvkit tools for parsing that metadata to avoid it interfering with other operations.

jayvdb · 2017-07-26T14:43:42Z

Another, probably better idea, is adding an "append" / "merge" option, so that I can provide a very small .geojson file with the metadata to be used and then the extra records can be merged into it (in-place rather than stdout would be my preference).

Then in2csv could also emit the metadata as a small geojson file, containing any data it couldnt put into the csv.

jpmckinney · 2017-08-04T13:13:20Z

That sounds reasonable. Like a --write-metadata option for in2csv, and a --read-metadata for csvjson.

jayvdb mentioned this issue Jul 24, 2017

Round-trip GeoJSON #868

Closed

jpmckinney added feature Low Priority labels Jul 24, 2017

jpmckinney modified the milestone: 1.0.3 Jan 28, 2018

jpmckinney added in2csv and removed Low Priority labels Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in2csv: GeoJSON metadata discarded #870

in2csv: GeoJSON metadata discarded #870

jayvdb commented Jul 24, 2017

jpmckinney commented Jul 24, 2017

jayvdb commented Jul 25, 2017

jpmckinney commented Jul 25, 2017

jayvdb commented Jul 26, 2017

jpmckinney commented Jul 26, 2017

jayvdb commented Jul 26, 2017

jpmckinney commented Aug 4, 2017

in2csv: GeoJSON metadata discarded #870

in2csv: GeoJSON metadata discarded #870

Comments

jayvdb commented Jul 24, 2017

jpmckinney commented Jul 24, 2017

jayvdb commented Jul 25, 2017

jpmckinney commented Jul 25, 2017

jayvdb commented Jul 26, 2017

jpmckinney commented Jul 26, 2017

jayvdb commented Jul 26, 2017

jpmckinney commented Aug 4, 2017