Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

To do list from beta pilot (2022-05-25-aucenter-online) #142

Open
36 of 50 tasks
ErinBecker opened this issue Jul 13, 2022 · 1 comment
Open
36 of 50 tasks

To do list from beta pilot (2022-05-25-aucenter-online) #142

ErinBecker opened this issue Jul 13, 2022 · 1 comment

Comments

@ErinBecker
Copy link
Contributor

ErinBecker commented Jul 13, 2022

Below is a summary of the 20 pages of notes the workshop observers took at the first beta pilot workshop teaching this lesson (2022-05-25-aucenter-online). Issues are listed by episode and are described in an actionable way wherever possible. In a few cases, more discussion may be needed within the developer team before an action can be decided.

Follow-up activity on the list below will be split among the lesson developers. To claim one or more of the items, place your name in bold at the start of the item. After you have created an PR for the item, add a link to your PR at the end of the item. Check off an item once a PR has been merged.


Introduction to the example dataset and file type

Introducing R and RStudio IDE

R Basics

R basics continued - factors and data frames

  • Add callout explaining why ?read.csv opens read.table help file #149
  • Improve the explanation of the difference between “base R” and “tidyverse” in a way that limits learner cognitive overhead. This explanation may need to come earlier in the lesson.
  • At the end of “importing tabular data into R” include a table that describes what each of the column names means / stands for
  • Change the note “put the first three columns of variants into a new data frame called subset” so that it matches code chunk, which also includes column 6
  • Explain the order of columns and rows in two-dimensional subsetting
  • str(subset) includes integer data, which hasn’t been introduced before. Explain this in the text.
  • Instead of introducing snps <- c(alt_alleles . . . ), break this into smaller demos. First show alt_alleles[alt_alleles=="A"] then show a few variations before stringing them in within c().
  • Might be good to show str(factor_snps) and summary(factor_snps) on a character vector to reinforce the difference between factors vs character vectors.

Using packages from Bioconductor

Data wrangling and analysis with tidyverse

Data visualization with ggplot2

  • Remove the period in the example doing facet_grid, as it hasn’t been explained. (but see PR Minor edits #138)
@naupaka naupaka pinned this issue Sep 7, 2022
poterlowicz-lab added a commit that referenced this issue Jan 11, 2023
poterlowicz-lab added a commit that referenced this issue Jan 11, 2023
poterlowicz-lab added a commit that referenced this issue Jan 11, 2023
@naupaka naupaka reopened this Jan 11, 2023
@naupaka naupaka assigned naupaka and unassigned naupaka Jan 11, 2023
zkamvar pushed a commit that referenced this issue May 1, 2023
zkamvar pushed a commit that referenced this issue May 1, 2023
zkamvar pushed a commit that referenced this issue May 1, 2023
zkamvar pushed a commit that referenced this issue May 1, 2023
@ytakemon
Copy link
Contributor

ytakemon commented May 10, 2023

  • Given our discussion today, we will be removing the Knitr lesson to ensure that we stay within time and instead will be pointing to other resources (eg. R for reproducible research).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants