Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor improvements #5

Open
byee4 opened this issue Jan 19, 2023 · 0 comments
Open

Minor improvements #5

byee4 opened this issue Jan 19, 2023 · 0 comments

Comments

@byee4
Copy link
Member

byee4 commented Jan 19, 2023

  • Capitalize 'Snakefile' as this is the default file that Snakemake looks for
  • Add 'xmltodict' package (see ModuleNotFoundError in piechart.py environment #4)
  • Remove hardcoded "/oasis/tscc/scratch/eczhang/larp6/larp6_GRCh38/results/" from Snakefile and add to config.yaml
  • In config.yaml, let's actually use the bam file instead as inputs, and modify Snakefile accordingly (eg.
SAMPLES:
  - /oasis/tscc/scratch/eczhang/larp6/larp6_GRCh38/results/LARP6.CTRL_IN1.umi.r1.fq.genome-mapped.bam
  - /oasis/tscc/scratch/eczhang/larp6/larp6_GRCh38/results/LARP6.CTRL_IN2.umi.r1.fq.genome-mapped.bam
  - ...
  • If we can, let's add another step: try using this database (this is the 'protein' version of the nonredundant database): /projects/ps-yeolab3/bay001/annotations/nr/nr. The idea is that instead of telling us just which species has the best alignment, perhaps we can tell which protein our reads are aligning to. The command and environment will be similar, except instead of "blastn" we will want to use "blastx" along with this new database.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant