-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reporting progress of get_significant_points_gdf #361
Comments
Use tqdm? |
tqdm's API looks much like rich's. The issue is how to tie this into MP's aggregation calls. There is no iterator exposed that I could use to keep track of what's happening and I can't pass an object into the aggregation call to report on its progress either. It took all night for my 2020 MBP to generate the generalised route maps for the top 25 airlines in ADSB.lol's dataset. It would be great to know roughly how long is left on jobs like this. https://tech.marksblogg.com/global-flight-tracking-adsb.html#generalising-routes |
Nice blog post, @marklit.
Heads up: The TrajectoryCollection from these 40M points has to fit into your RAM. The script will crash otherwise. Looking at the current implementation, a meaningful progress indicator will be challenging.
the remaining 50% would be
Both of these happen on init |
Is there anywhere deeper when records are iterated over one at a time? This could be a place to add a hook to a progress counter. |
Records are iterated for each trajectory individually. It would be hard to keep track of the overall progress for the whole trajectory collection |
If I feed up to 100K points into MP,
get_significant_points_gdf
will finish in a few minutes. I'm keen to potentially feed upwards 40M points. Is there some way thatget_significant_points_gdf
could be passed a function and it would report its progress?I usually use https://rich.readthedocs.io/en/stable/progress.html for tracking progress in long-running Python scripts.
The text was updated successfully, but these errors were encountered: