List of possible enhancements #84

agentmorris · 2023-05-20T18:01:31Z

Sometimes folks ping us to ask how they can contribute code to the MegaDetector project, and we don't really have a place to point them right now. Combined with the fact that a couple of important open issues have been languishing for a few weeks (months?), I got motivated to create this issue as a snapshot of our internal todo list, so I have somewhere to point folks who want to get involved. I'm making only a weak attempt at prioritization here, instead I'm just trying to sort them into logical buckets.

If you're interested in trying your hand at any of these, email us!

Feature additions for existing scripts/tools
Refactoring or re-writing stuff
Infrastructure things
Feature additions for existing scripts/tools
Cleaning up the classifier training pipeline
Miscellaneous things that are more exploratory
Other projects that could use your help
Random models someone should train

Feature additions for existing scripts/tools

Most of these would qualify as a "good first issue".

postprocess_batch_results.py currently has no support for video. When run on a .json file that points to videos, extract frames in a sensible way to generate previews.
run_detector_batch.py currently does not support checkpointing when --use_image_queue is enabled, but the image queue is helpful when running off of a slow drive. Add checkpointing support when --use_image_queue is enabled. (This is not a significant issue when using manage_local_batch.py/.ipynb to create and run jobs; the user can just use lots of tasks in lieu of checkpointing, so the priority of this issue is pretty low.)
run_inference_with_yolov5_val.py currently does not support the same checkpointing feature that run_detector_batch.py does; add checkpointing support in run_inference_with_yolov5_val.py. (This is not a significant issue when using manage_local_batch.py/.ipynb to create and run jobs; the user can just use lots of tasks in lieu of checkpointing, so the priority of this issue is pretty low.)
compare_batch_results.py supports comparing detections, but not species classifications. Add support for species classification results.
Allow run_detector_batch.py to use multiple GPUs. (This is not quite as critical as it sounds; large jobs are best run via manage_local_batch.py or manage_local_batch.ipynb anyway, and splitting jobs across multiple GPUs is handled there.)
Regarding the repeat detection elimination process... currently if you run the "find repeat detections" portion of the pipeline, and decide you just want a different threshold for the number of repeat detections you want to use to call a detection "suspicious", you have to run the whole process again. This is silly; you should be able to just change the threshold. In fact, even better, you should be able to specify around the number of suspicious detections you feel like dealing with (typically around 1000, which is around 5-10 minutes of manual review), and have that threshold determined automatically.
Allow postprocess_batch_results.py to operate on sequences, rather than just images. Sample based on sequences, do precision/recall analysis based on sequences, and render sequences in a sensible way on the output page.
In repeat detection elimination and sequence-based classification smoothing, write the smoothing parameters into the output file.
The json manager app currently hard-codes the expected structure, so we have to keep it up to date with minor additions to the .json format. This should only hard-code parameters it actually needs to operate on, and pass everything else through unmodified. Json.NET supports all the right things, we're just not doing those things right now.

Refactoring or re-writing stuff

postprocess_batch_results.py is an absurd use of Pandas right now, and has an absurd level of duplication between the code paths (with/without ground truth, with/without classification results). This could use a re-write from scratch.
repeat_detections_core.py isn't nearly as bad, but it's not ideal, and it has some really bizarre properties right now, like the fact that when you run the main function a second time to apply a set of changes after the manual review step, it repeats all the folder-separation stuff it did the first time, which is brittle and silly. Not quite a total re-write, but a significant cleanup.
The sequence-based classification smoothing process is a useful and relatively standalone piece of code that is currently buried in the notebook I use to run MegaDetector and MegaClassifier. This should be refactored into its own script, and possibly updated to operate on detection data too.

Infrastructure things

In certain Mac M1 environments, MD produces incorrect results. It is unlikely that this is specific to MD, this is likely a corner case for YOLOv5. This does not appear to happen in the recommended Python environment, but if the user upgrades YOLOv5 and/or certain Python dependencies, bizarre things might happen. See this issue and this question on the YOLOv5 repo for details and status.
A substantial number (most?) of our users prefer R, and we're forcing them to run a bunch of Python code. It would be great to either wrap the inference process in R, or port the inference code to R. IMO it's not urgent to do this for anything other than the inference code (run_detector_batch.py) and maybe separate_detections_into_folders.py.

Update: after adding this item to the list, I discovered the animl R package, which supports R-based inference for both MDv4 and MDv5. TBD whether any more porting to R is required.

Miscellaneous things that are more exploratory

Though we recently added support for running inference on small patches and stitching the results together via run_tiled_inference.py, after doing that, I discovered SAHI, which does the same thing, only with far more thought. If someone has a dataset with lots of small things (especially rodents), it would be interesting to compare the two approaches and assess whether we should drop run_tiled_inference.py in favor of SAHI.

Other projects that could use your help

If you found this text because you want to work on open-source code related to conservation, and everything I just listed is either too boring or too daunting, please don't give up! Depending on your specific skill set, maybe our close collaborators who maintain EcoAssist, CamTrap Detector, Timelapse, MEWC, or any of the platforms listed here could use contributions. Or head over to the "Open Source Solutions" forum at WILDLABS, and offer your skills there!

Random models someone should train

Now I'm letting this thread really veer off into a tangent, but FWIW, people frequently ask us "can MegaDetector do [x]?", where [x] is something MegaDetector definitely can't do. But there are some values of [x] that have come up a bunch of times and feel like the right balance of "tractable" and "useful", where there's sort of the right training data in the universe, and a focused student project could really get something going. So, to finish up this long post with lots of random ideas:

A model to classify camera trap images as obscured due to fog or snow, knocked over and staring at the sky, and/or completely obscured by vegetation
A model that runs as part of postprocess_batch_results.py to pick out "fun" images (currently we do this manually from the output of postprocess_batch_results, which is fast, but it means we're only ever searching over the ~7500 images we sample for postprocessing)
"MegaDetector for snakes"
"MegaDetector for fish"

Update: after I wrote this item, someone actually did release... wait for it... MegaFishDetector). Still, plenty of work to be done in this area. List of models and data in this space here.
Lots of camera trap data was recently ingested into Hugging Face, with the hope that someone might train a super-giant species classifier for camera trap data, and/or document a nice process for training regional classifiers. AFAIK no one has done either of the above yet.

Note to self: tags for this issue

Issue cloned from Microsoft/CameraTraps, original issue posted by agentmorris on Jan 27, 2023.

agentmorris · 2023-05-20T18:01:32Z

Another to-do might be to rewrite batch detection scripts to use PyTorch Dataloader instead of managing image I/O manually. This will also allow performing batch inference instead of looping over each image one by one. It should significantly improve inference performance.

(Comment originally posted by patelvyom)

agentmorris · 2023-05-20T18:02:34Z

Updating my response to this suggestion: rather than investing time in using the PyTorch data loader, I'd like to see someone experiment with YOLOv5's native inference tools (val.py and detect.py) as a total replacement for our inference scripts. These have all the benefits of "proper" PyTorch data loading, but also have a zillion bells and whistles, especially test-time augmentation that could improve accuracy.

--

That's a great suggestion, I'll add an item to the list... more specifically, though, the item is to do a performance test (which can be arbitrarily inelegant) to see what the benefit would be, with and without a GPU, and make sure results are identical. If the benefit is more than around a 25% speedup, it's probably worth it. If it's less than that, it may be preferable to keep the current approach, which is easier to debug and maintain, and keeps a much longer shared code path across PyTorch and TF. Also I vaguely remember that images in a batch need to be the same size, which isn't guaranteed, so either the test would need to verify that this isn't the case, or the implementation would need to break batches when the image size changes.

(Comment originally posted by agentmorris)

agentmorris · 2023-06-14T19:40:41Z

Closed the following issues:

agentmorris · 2023-06-18T18:24:11Z

Closed:

compare_batch_results.py also lacks a command-line driver (i.e., a main() wrapper)... so, add a main() function.

agentmorris · 2023-07-28T18:48:56Z

Closed the issue re: checkpoint support for multicore inference (thanks, Alex Morling!), and the issue with passing force_cpu deeper into the call stack (easier to just do that with CUDA_VISIBLE_DEVICES).

agentmorris · 2023-07-30T23:40:18Z

v0 release of a script to break images up into tiles, run MD on tiles, and stitch the results back together (run_tiled_inference.py). Useful when a user has Very Large Images and Very Small Animals.

agentmorris · 2023-08-12T02:40:53Z

Closed the issues re: merge_detections.py (a) not knowing how to merge at the level of individual detections, and (b) not having a command-line driver. Thanks, @atmorling!

agentmorris · 2023-08-19T17:39:40Z

Closed the issue re: saving datetime, image size, and EXIF metadata in run_detector_batch. Thanks, @atmorling!

agentmorris · 2023-08-23T22:33:14Z

Closed the issue re: running YOLOv5 val.py inference (run_inference_with_yolov5_val.py) on Windows, which was really just a "does this work?" issue. The answer for now is "yes, it works, but it requires admin privileges".

agentmorris · 2023-09-07T20:46:27Z

Closed and removed the issue re: sorting by confidence in postprocess_batch_results.py.

agentmorris · 2023-09-22T17:56:38Z

Closed issues related to making requirements.txt work, and therefore supporting newer versions of dependent packages, and therefore making the pip package work... better.

agentmorris · 2023-12-06T00:57:01Z

Closed the issue related to MD --> YOLO conversion; there's almost never a reason to do this directly, instead there are new scripts to convert MD results to COCO or labelme format, and scripts to convert to YOLO from COCO and labelme format, typically after editing and/or adding class information.

agentmorris · 2023-12-06T19:52:04Z

Closed:

Most scripts that render lots of images support (optional) parallelization using Python multiprocessing; visualize_detector_output.py is an exception. Support parallelization in visualize_detector_output.py, the same way we do in scripts like postprocess_batch_results.py.
run_detector_batch.py supports resuming from a checkpoint, but right now you have to manually find the checkpoint file, and provide it to the "--resume_from_checkpoint" option. Find the most recent checkpoint automatically if "--resume_from_checkpoint" is specified with no file.
Allow compare_batch_results.py to sort output by confidence (so it's easier to scan for misses).

agentmorris · 2023-12-26T00:37:48Z

Closed everything related to rebooting the classifier training page; the classification page has been updated, with all new training pointed to MEWC.

agentmorris · 2024-01-08T01:11:01Z

Closed issues related to:

Chaotic predictions on some Snapshot Serengeti data (explanation here).
Documentation for fine-tuning MegaDetector; I'm using this as a reference example and will update the MD User Guide accordingly.

agentmorris · 2024-01-15T18:30:19Z

Closed the issue related to creating N! category combinations in postprocess_batch_results. This has been handled now as well as it can be without making opinionated decisions about which combinations to show.

agentmorris · 2024-01-15T22:02:27Z

Closed:

Allow postprocess_batch_results.py to use different confidence thresholds for different categories.

agentmorris · 2024-01-31T00:15:37Z

Removed the item under "random models" about supporting vehicle classification with stock YOLOv5 and YOLOv8 models; this is supported now (and fun!).

agentmorris · 2024-05-10T00:02:45Z

Removing items related to cleanup in process_video.py

--

There are some TODO's in process_video.py related to cleaning up intermediate files; handle these. In the same script, there is some duplicated code across the functions used for handling a video vs. a folder of videos; refactor those functions to have common code paths for individual videos.

agentmorris added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed labels May 20, 2023

agentmorris changed the title ~~Meta-issue: list of open issues, random todo's, and half-baked ideas~~ Open issues and half-baked ideas Jun 16, 2023

agentmorris changed the title ~~Open issues and half-baked ideas~~ List of open issues Sep 19, 2023

agentmorris changed the title ~~List of open issues~~ List of possible enhancements Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

List of possible enhancements #84

List of possible enhancements #84

agentmorris commented May 20, 2023 •

edited

agentmorris commented May 20, 2023

agentmorris commented May 20, 2023

agentmorris commented Jun 14, 2023 •

edited

agentmorris commented Jun 18, 2023 •

edited

agentmorris commented Jul 28, 2023 •

edited

agentmorris commented Jul 30, 2023

agentmorris commented Aug 12, 2023 •

edited

agentmorris commented Aug 19, 2023 •

edited

agentmorris commented Aug 23, 2023 •

edited

agentmorris commented Sep 7, 2023

agentmorris commented Sep 22, 2023 •

edited

agentmorris commented Dec 6, 2023

agentmorris commented Dec 6, 2023

agentmorris commented Dec 26, 2023 •

edited

agentmorris commented Jan 8, 2024 •

edited

agentmorris commented Jan 15, 2024

agentmorris commented Jan 15, 2024

agentmorris commented Jan 31, 2024

agentmorris commented May 10, 2024

List of possible enhancements #84

List of possible enhancements #84

Comments

agentmorris commented May 20, 2023 • edited

Table of contents

Feature additions for existing scripts/tools

Refactoring or re-writing stuff

Infrastructure things

Miscellaneous things that are more exploratory

Other projects that could use your help

Random models someone should train

Note to self: tags for this issue

agentmorris commented May 20, 2023

agentmorris commented May 20, 2023

agentmorris commented Jun 14, 2023 • edited

agentmorris commented Jun 18, 2023 • edited

agentmorris commented Jul 28, 2023 • edited

agentmorris commented Jul 30, 2023

agentmorris commented Aug 12, 2023 • edited

agentmorris commented Aug 19, 2023 • edited

agentmorris commented Aug 23, 2023 • edited

agentmorris commented Sep 7, 2023

agentmorris commented Sep 22, 2023 • edited

agentmorris commented Dec 6, 2023

agentmorris commented Dec 6, 2023

agentmorris commented Dec 26, 2023 • edited

agentmorris commented Jan 8, 2024 • edited

agentmorris commented Jan 15, 2024

agentmorris commented Jan 15, 2024

agentmorris commented Jan 31, 2024

agentmorris commented May 10, 2024

agentmorris commented May 20, 2023 •

edited

agentmorris commented Jun 14, 2023 •

edited

agentmorris commented Jun 18, 2023 •

edited

agentmorris commented Jul 28, 2023 •

edited

agentmorris commented Aug 12, 2023 •

edited

agentmorris commented Aug 19, 2023 •

edited

agentmorris commented Aug 23, 2023 •

edited

agentmorris commented Sep 22, 2023 •

edited

agentmorris commented Dec 26, 2023 •

edited

agentmorris commented Jan 8, 2024 •

edited