Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Features of priority for the next release #14

Open
burtonrj opened this issue Nov 13, 2020 · 1 comment
Open

Features of priority for the next release #14

burtonrj opened this issue Nov 13, 2020 · 1 comment
Labels
enhancement New feature or request

Comments

@burtonrj
Copy link
Owner

burtonrj commented Nov 13, 2020

  • User should be able to load a CSV file or pass a pandas dataframe to add_new_sample method of Experiment when populating an experiment with data
  • Often the optimal parameters for a clustering algorithm used to 'gate' single cell data in two-dimensions differ from one sample to the next, it would be great to provide the option to perform hyperparameter search - parameters chosen that minimise the Hausdorff distance between newly clustered populations and the populations originally defined
  • Improve coverage of unit tests
  • Generate a new class structure similar to CellClassifier but inspired by https://www.pnas.org/content/117/35/21373 - should be able to merge the data from multiple subjects using create_ref_sample, label single cells with their origin, and train a classifier to predict the origin of a single cell (i.e. did it originate from a diseased individual), then inspect the model for populations that contribute to that prediction
  • Port FlowAI for cleaning data prior to entry with add_new_sample
@burtonrj burtonrj added the enhancement New feature or request label Nov 13, 2020
@burtonrj
Copy link
Owner Author

burtonrj commented Apr 12, 2021

  1. Reading csv files has been addressed read in csv (or pandas DataFrame) #13 in release 2.0
  2. Optimising parameters with parameter search for auto gates is provided in release 2.0

Short term priorities:

  1. Increase test coverage
  2. Implement a multi-instance learning approach similar to CellCNN and @hzc363 https://www.pnas.org/content/117/35/21373

Long term priorities:

  1. Create a lightweight clone of CytoPy that swaps out mongoengine for PeeWee ORM, granting the use of SQLite for those that cannot host a MongoDB service on their local machine or on Mongo Atlas
  2. Graphical user interface deployed with Electron JS to expose CytoPy to scientists without training in Python

Porting FlowAI is unnecessary and would add technical debt, therefore it won't be included in any future release

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant