Outstanding issues not specific to any tips #252

SiminaB · 2020-10-07T18:17:52Z

This is to discuss any issues that we may think are not currently adequately covered. If they relate to specific tips, use #242 #243 #244 #245 #246 #247 #248 #249 #250 #251

SiminaB · 2020-10-07T18:32:20Z

In re-rereading this, there are 2 issues that I thought about that we may want to cover. At minimum, I think many people reading this paper will expect them to be covered. I think they can be included in the Intro or Conclusion or as part of existing tips:

How does one go about fitting these models and is special software always required? We can at least give some good references for how to do this and note the main packages and computational requirements. I know this isn't a "getting started with DL" paper, but we can still spend 2-3 sentences on it.
Can DL be inadvertently used to perpetuate existing stereotypes eg racist and sexist ones? We know this can happen either because of the training set (eg training set consists exclusively of individuals of European descent, then model is used on a more diverse population) or because of the predictions are incorrectly interpreted due to confounding (eg the training set has doctors and nurses and most doctors are men and most nurses are women, therefore going forward gender is either explicitly or implicitly used to play an outsized role in predicting career choice.) The paper focuses on biology, so perhaps one good example would be the performance of face recognition approaches on individuals of European vs. non-European descent.

Benjamin-Lee · 2020-10-08T14:51:43Z

Some thoughts in response:

We should mention them as well as mention using auto-ML tools like TPOT.
DL fairness should probably be mentioned in the interpretation or privacy tips. Which place do you think is better?

SiminaB · 2020-10-08T15:50:23Z

We could change Tip 10 to be about ethics I guess? That way both fairness and privacy would fit.

Benjamin-Lee · 2020-10-11T00:48:27Z

@SiminaB I just addressed your first point in the PR for #241. Specifically, I mentioned TF and PyTorch as well as Keras, AutoKeras, Turi Create, and TPOT. If there are any other tools you think are worth mentioning, do let me know.

SiminaB · 2020-10-11T02:13:42Z

Looks good! One question as someone who doesn't use DL in research - can you actually run meaningful DL models on a laptop? The implication is that it would be hard to do so, eg in:

In contrast, traditional ML training can often be done on a laptop (or even a $5 computer [@arXiv:1809.00238]) in seconds to minutes.

Benjamin-Lee · 2020-10-11T02:23:05Z

It's doable in some cases but not really ideal. In my experience, I've always ended up having to use a cloud machine for training all but the simplest models. I've never done transfer learning so I can't comment on whether that brings things down to consumer-grade laptop level. @rasbt probably knows more than I do about that.

SiminaB · 2020-10-11T02:26:58Z

I think it would be helpful to clarify this as it would help inform someone whether they can actually do DL. If it is appropriate to their problem but not really doable on their device, of course they can look into using the cloud or initiating a collaboration.

Benjamin-Lee · 2020-10-11T02:40:24Z

Definitely a good idea to speak affirmatively to what DL needs.

agitter · 2021-01-26T15:11:24Z

I'm copying my comment from #313 (comment) here so we don't lose track of it.

There is a lot of existing guidance about best practices for machine learning and deep learning that we do not reference
The examples we provide in the intro and elsewhere are pretty arbitrary and not necessarily representative or the most impressive applications
Some tips still have no biology examples
Second person is not used consistently (Second person or third person? #237)
Some tips (e.g. 4) aren't very specific to deep learning
There is some redundancy across tips

These are all minor enough to address after the initial submission.

Benjamin-Lee · 2021-01-27T04:25:04Z

Thank you for adding it here and glad to see nothing else is blocking. I'll work on #237 once we do the content freeze since that is cosmetic.

This was referenced Oct 15, 2020

Action items for submission #226

Closed

Ethics section discussion and proposals #272

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Outstanding issues not specific to any tips #252

Outstanding issues not specific to any tips #252

SiminaB commented Oct 7, 2020

SiminaB commented Oct 7, 2020

Benjamin-Lee commented Oct 8, 2020

SiminaB commented Oct 8, 2020

Benjamin-Lee commented Oct 11, 2020

SiminaB commented Oct 11, 2020

Benjamin-Lee commented Oct 11, 2020

SiminaB commented Oct 11, 2020

Benjamin-Lee commented Oct 11, 2020

agitter commented Jan 26, 2021

Benjamin-Lee commented Jan 27, 2021

Outstanding issues not specific to any tips #252

Outstanding issues not specific to any tips #252

Comments

SiminaB commented Oct 7, 2020

SiminaB commented Oct 7, 2020

Benjamin-Lee commented Oct 8, 2020

SiminaB commented Oct 8, 2020

Benjamin-Lee commented Oct 11, 2020

SiminaB commented Oct 11, 2020

Benjamin-Lee commented Oct 11, 2020

SiminaB commented Oct 11, 2020

Benjamin-Lee commented Oct 11, 2020

agitter commented Jan 26, 2021

Benjamin-Lee commented Jan 27, 2021