Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data format #722

Open
sakshamg94 opened this issue Aug 7, 2020 · 4 comments
Open

Data format #722

sakshamg94 opened this issue Aug 7, 2020 · 4 comments

Comments

@sakshamg94
Copy link

sakshamg94 commented Aug 7, 2020

among the pms_pm_01_0 and similarly for other radiation and PM fields, the numbers have commas in them such that they land as strings when i import data from the CSVs
like 1,000 instead of 1000
commas are not supported in float dtype. so i think if json to csv results in figures with commas in them, then irrespective of the platform (I have checked on Python), we should see those inconsistencies

Here's how I fix it using Pandas library (Py):
df['pms_pm01_0'] = df['pms_pm01_0'].astype(str).str.replace(',', '').astype(float)

@matschaffer
Copy link
Contributor

@sakshamg94 can you post a sample CSV with a comma in it? Could be something going on in the exporting step.

@sakshamg94
Copy link
Author

sakshamg94 commented Aug 7, 2020

see row. 1348 (pms payloads). Converted csv to xlsx for posting on github (csv not supported)
output-2020-07-01T00_00_00+00_00 2.xlsx

@matschaffer
Copy link
Contributor

I’d recommend zipping the file for attachment to avoid the possibility of excel changing the formatting on its own (this happened with @nokton’s investigation)

@sakshamg94
Copy link
Author

I see. good idea. Here (should be row 1347 or 1348)
output-2020-07-01T00_00_00+00_00.csv.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants