You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The deduplication logic for evictions dataset must take into account both the court index number and borough code. The current logic assumes that court index number is unique across the dataset and therefore is dropping data.
The scope of this issue would add the borough code and court index number to the logic.
Additional context:
There are about 100 rows per month that are missing in the nycdb but are present in the Open Data source file due to the current deduplication logic that uses only the court index number.
The deduplication logic for evictions dataset must take into account both the court index number and borough code. The current logic assumes that court index number is unique across the dataset and therefore is dropping data.
The scope of this issue would add the borough code and court index number to the logic.
Additional context:
Link to EDA notebook: https://colab.research.google.com/drive/1sLET77zixEa_bDzbaqsWuUwbsUKwhM7z?usp=sharing
The text was updated successfully, but these errors were encountered: