quality filter: imbalance forward vs reverse reads - dada2- #1948

valengirardi · 2024-05-07T18:51:08Z

Hello! I'm performing quality filtering on my sequences due to low quality in the reverse reads. However, I've encountered an imbalance where I have fewer reverse reads compared to forward reads, making it impossible to proceed with DADA2. Can you advise me on how to address this issue?

this is the filtering code:
for file in "${INPUT_DIR}"/*.fastq; do
base_name=$(basename "${file}" .fastq)
bbduk.sh in="${file}" out="${OUTPUT_DIR}/${base_name}.fastq" qtrim=rl trimq=10
echo "Archivo ${file} procesado."
done

Additionally, I've attempted to apply separate quality filters for the forward and reverse reads, but I'm unable to achieve a balanced number of sequences in both:

for file in "${INPUT_DIR}"/*.fastq; do
base_name=$(basename "${file}" .fastq)
if [[ $base_name == "_2".fastq ]]; then
bbduk.sh in="${file}" out="${OUTPUT_DIR}/${base_name}.fastq" qtrim=rl trimq=10
echo "Archivo ${file} procesado."
else
bbduk.sh in="${file}" out="${OUTPUT_DIR}/${base_name}.fastq" qtrim=rl trimq=15
echo "Archivo ${file} procesado."
fi
done

benjjneb · 2024-05-07T19:14:59Z

I'd recommend using filterAndTrim to jointly filter your forward and reverse reads (i.e. each read pair is evaluated, and then filter out or kept). This will maintain the matching between forward and reverse reads that is lost when you separately filter those files. The dada2 tutorial shows an example of this for Illumina paired end data.

valengirardi · 2024-05-09T12:53:38Z

Hi! thank u for your response. I usually run this kind of things on terminal, not on R. How can I adapt filterandtrim in this situation? I tried with bbduk, cutadapt and Trimmomatic

benjjneb · 2024-05-14T15:47:23Z

The dada2 tutorial gives a worked example of inspecting the quality profile and then running filterAndTrim on paired-end Illumina reads: https://benjjneb.github.io/dada2/tutorial.html

If you are using dada2 later, then just run the filterAndTrim part in the same way you would have run the later parts of the workflow.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quality filter: imbalance forward vs reverse reads - dada2- #1948

quality filter: imbalance forward vs reverse reads - dada2- #1948

valengirardi commented May 7, 2024

benjjneb commented May 7, 2024

valengirardi commented May 9, 2024

benjjneb commented May 14, 2024

quality filter: imbalance forward vs reverse reads - dada2- #1948

quality filter: imbalance forward vs reverse reads - dada2- #1948

Comments

valengirardi commented May 7, 2024

benjjneb commented May 7, 2024

valengirardi commented May 9, 2024

benjjneb commented May 14, 2024