compseq: add page #12713

kamurani · 2024-05-02T08:16:54Z

The page(s) are in the correct platform directories: common, linux, osx, windows, sunos, android, etc.
The page(s) have at most 8 examples.
The page description(s) have links to documentation or a homepage.
The page(s) follow the content guidelines.
The PR title conforms to the recommended templates.
Version of the command being documented (if known): EMBOSS:6.6.0.0

CLAassistant · 2024-05-02T08:16:59Z

All committers have signed the CLA.

kamurani · 2024-05-02T08:20:11Z

Alternative documentation link (to identical webpage content):

https://emboss.sourceforge.net/apps/cvs/emboss/apps/compseq.html

…ied in the example description

Magrid0

Looks good to me, thanks for your contribution
The only thing is that maybe there are a bit too much example but they're not more than 8 so it's fine

sebastiaanspeck · 2024-05-03T00:53:13Z

pages/linux/compseq.md

+
+- Count observed frequencies of words in a FASTA file, providing parameter values with interactive prompt:
+
+`compseq {{example.fasta}}`


Suggested change

`compseq {{example.fasta}}`

`compseq {{path/to/file.fasta}}`

sebastiaanspeck · 2024-05-03T00:54:15Z

pages/linux/compseq.md

+
+- Count observed frequencies of amino acid pairs from a FASTA file, save output to a text file:
+
+`compseq {{example_protein.fasta}} -word 2 {{result1.comp}}`


Suggested change

`compseq {{example_protein.fasta}} -word 2 {{result1.comp}}`

`compseq {{path/to/input_file.fasta}} -word 2 {{path/to/output_file.comp}}`

sebastiaanspeck · 2024-05-03T00:54:41Z

pages/linux/compseq.md

+
+- Count observed frequencies of hexanucleotides from a FASTA file, save output to a text file and ignore zero counts:
+
+`compseq {{example_dna.fasta}} -word 6 {{result2.comp}} -nozero`


Suggested change

`compseq {{example_dna.fasta}} -word 6 {{result2.comp}} -nozero`

`compseq {{path/to/input_file.fasta}} -word 6 {{path/to/output_file.comp}} -nozero`

sebastiaanspeck · 2024-05-03T00:55:12Z

pages/linux/compseq.md

+
+- Count observed frequencies of codons in a particular reading frame; ignoring any overlapping counts (i.e. move window across by word-length 3):
+
+`compseq -sequence {{example_rna.fasta}} -word 3 {{result3.comp}} -nozero -frame {{1}}`


Suggested change

`compseq -sequence {{example_rna.fasta}} -word 3 {{result3.comp}} -nozero -frame {{1}}`

`compseq -sequence {{path/to/input_file.fasta}} -word 3 {{path/to/output_file.comp}} -nozero -frame {{1}}`

sebastiaanspeck · 2024-05-03T00:55:34Z

pages/linux/compseq.md

+
+- Count observed frequencies of codons frame-shifted by 3 positions; ignoring any overlapping counts (should report all codons except the first one):
+
+`compseq -sequence {{example_rna.fasta}} -word 3 {{result4.comp}} -nozero -frame 3`


Suggested change

`compseq -sequence {{example_rna.fasta}} -word 3 {{result4.comp}} -nozero -frame 3`

`compseq -sequence {{path/to/input_file.fasta}} -word 3 {{path/to/output_file.comp}} -nozero -frame 3`

sebastiaanspeck · 2024-05-03T00:56:34Z

pages/linux/compseq.md

+
+- Count amino acid triplets in a FASTA file and compare to a previous run of `compseq` to calculate expected and normalised frequency values:
+
+`compseq -sequence {{human_proteome.fasta}} -word 3 {{result5.comp}} -nozero -infile {{prev.comp}}`


Suggested change

`compseq -sequence {{human_proteome.fasta}} -word 3 {{result5.comp}} -nozero -infile {{prev.comp}}`

`compseq -sequence {{path/to/input_file.fasta}} -word 3 {{path/to/output_file1.comp}} -nozero -infile {{path/to/output_file2.comp}}`

sebastiaanspeck · 2024-05-03T00:56:54Z

pages/linux/compseq.md

+
+- Approximate the above command without a previously prepared file, by calculating expected frequencies using the single base/residue frequencies in the supplied input sequence(s):
+
+`compseq -sequence {{human_proteome.fasta}} -word 3 {{result6.comp}} -nozero -calcfreq`


Suggested change

`compseq -sequence {{human_proteome.fasta}} -word 3 {{result6.comp}} -nozero -calcfreq`

`compseq -sequence {{path/to/input_file.fasta}} -word 3 {{path/to/output_file.comp}} -nozero -calcfreq`

Thanks for your suggestions @sebastiaanspeck , I agree showing that it accepts a path is more clear.

However, would it still be good having different example filenames (as opposed to always being path/to/input_file.fasta) -- for example, I want to highlight that the program can be equivalently used for amino acid and nucleotide sequences.

However, would it still be good having different example filenames (as opposed to always being path/to/input_file.fasta) -- for example, I want to highlight that the program can be equivalently used for amino acid and nucleotide sequences.

If that clarifies the example, that is a good thing to do.

@sebastiaanspeck If possible, can you update your suggestions to use the example names as the author suggests?

sebastiaanspeck

LGTM, after the suggestions are applied

kbdharun

LGTM, Thanks for your contribution.

sebastiaanspeck

LGTM, after the suggestions are applied

…rity

kamurani · 2024-05-15T06:31:46Z

LGTM, after the suggestions are applied

@sebastiaanspeck added your suggestions but modified them to still retain information such as the sequence type (amino acid / nucleotide) in the input filename to match the examples' descriptions.

kamurani added 3 commits May 2, 2024 17:55

compseq: add page

7359c96

remove version from command description

0e37934

fix colon at end of command desc; remove trailing whitespace

c7a34e3

kamurani requested a review from cyqsimon as a code owner May 2, 2024 08:16

github-actions bot added the new command Issues requesting creation of a new page. label May 2, 2024

clarity; remove word-length placeholder syntax when a value is specif…

4f4820f

…ied in the example description

Magrid0 approved these changes May 2, 2024

View reviewed changes

sebastiaanspeck requested changes May 3, 2024

View reviewed changes

Merge branch 'main' into compseq

2a5a251

kamurani requested a review from sebastiaanspeck May 8, 2024 08:07

sebastiaanspeck approved these changes May 8, 2024

View reviewed changes

kbdharun approved these changes May 9, 2024

View reviewed changes

sebastiaanspeck requested changes May 11, 2024

View reviewed changes

kamurani and others added 2 commits May 15, 2024 16:28

add (modified) suggested changes to include path/to/input_* for cla…

75ead44

…rity

Merge branch 'main' into compseq

34eeb7a

sebastiaanspeck approved these changes May 15, 2024

View reviewed changes

sebastiaanspeck merged commit 60254aa into tldr-pages:main May 15, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compseq: add page #12713

compseq: add page #12713

kamurani commented May 2, 2024 •

edited

CLAassistant commented May 2, 2024 •

edited

kamurani commented May 2, 2024

Magrid0 left a comment

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

sebastiaanspeck May 3, 2024

kamurani May 3, 2024

sebastiaanspeck May 8, 2024

kbdharun May 11, 2024

sebastiaanspeck left a comment

kbdharun left a comment

sebastiaanspeck left a comment

kamurani commented May 15, 2024


		- Count observed frequencies of words in a FASTA file, providing parameter values with interactive prompt:

		`compseq {{example.fasta}}`


		- Count observed frequencies of amino acid pairs from a FASTA file, save output to a text file:

		`compseq {{example_protein.fasta}} -word 2 {{result1.comp}}`

	`compseq {{example_protein.fasta}} -word 2 {{result1.comp}}`
	`compseq {{path/to/input_file.fasta}} -word 2 {{path/to/output_file.comp}}`


		- Count observed frequencies of hexanucleotides from a FASTA file, save output to a text file and ignore zero counts:

		`compseq {{example_dna.fasta}} -word 6 {{result2.comp}} -nozero`


		- Count observed frequencies of codons in a particular reading frame; ignoring any overlapping counts (i.e. move window across by word-length 3):

		`compseq -sequence {{example_rna.fasta}} -word 3 {{result3.comp}} -nozero -frame {{1}}`

	`compseq -sequence {{example_rna.fasta}} -word 3 {{result3.comp}} -nozero -frame {{1}}`
	`compseq -sequence {{path/to/input_file.fasta}} -word 3 {{path/to/output_file.comp}} -nozero -frame {{1}}`


		- Count observed frequencies of codons frame-shifted by 3 positions; ignoring any overlapping counts (should report all codons except the first one):

		`compseq -sequence {{example_rna.fasta}} -word 3 {{result4.comp}} -nozero -frame 3`


		- Count amino acid triplets in a FASTA file and compare to a previous run of `compseq` to calculate expected and normalised frequency values:

		`compseq -sequence {{human_proteome.fasta}} -word 3 {{result5.comp}} -nozero -infile {{prev.comp}}`


		- Approximate the above command without a previously prepared file, by calculating expected frequencies using the single base/residue frequencies in the supplied input sequence(s):

		`compseq -sequence {{human_proteome.fasta}} -word 3 {{result6.comp}} -nozero -calcfreq`

compseq: add page #12713

compseq: add page #12713

Conversation

kamurani commented May 2, 2024 • edited

CLAassistant commented May 2, 2024 • edited

kamurani commented May 2, 2024

Magrid0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sebastiaanspeck left a comment

Choose a reason for hiding this comment

kbdharun left a comment

Choose a reason for hiding this comment

sebastiaanspeck left a comment

Choose a reason for hiding this comment

kamurani commented May 15, 2024

kamurani commented May 2, 2024 •

edited

CLAassistant commented May 2, 2024 •

edited