Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconsider rgsm value in fastq list rows #606

Open
alexiswl opened this issue Jul 13, 2023 · 13 comments
Open

Reconsider rgsm value in fastq list rows #606

alexiswl opened this issue Jul 13, 2023 · 13 comments
Assignees
Labels
change request triage & trello Triage and migrate into respective Trello project

Comments

@alexiswl
Copy link
Member

As highlighted by Stephen before running the validation samples, there exists some cases where the tumor sample id and the normal sample id are the same value.

See Subject SBJ00480 as an example:

image

This is a bit of a problem.

Some proposed solutions (to be discussed on Tuesday)

Note change of rgsm values wont impact file names as these are determined by the output-prefix parameter for dragen runs.
However it will impact the values of sample names in vcfs and bams.

  1. Replace rgsm and use the library id instead.

    The tumor and normal names in vcfs and bams would instead be the library id.

  2. Append _N or _T based on phenotype to the RGSM

    This would mean that vcfs and bams would have similar names as existing

  3. Use sample name + library id

    Would help if users are relying on the sample name in vcfs and bams for tumor and normal samples.

  4. Nothing at all and ensure that sample names differ between tumors and normals for a given subject.

@alexiswl alexiswl self-assigned this Jul 13, 2023
@ohofmann
Copy link
Member

Ouch. I can see cases where the same 'sample' was used to generate both tumor and normal DNA (in the sense of a tissue specimen) although in this particular case I would argue that the various mixes should have different sample identifiers - they are distinct mixes of two other samples. I.e., for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

If we want to future-proof things option 3 feels best to me; the _N/_T suffix will run into issues at some point, but just the library ID could also work.

@alexiswl
Copy link
Member Author

I.e., for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

Given there were other samples on this run, we were considering just updating the labmetadata and the fastq list row rgsm values for these entries. Saves a demultiplex.

@ohofmann
Copy link
Member

Demultiplexing is stable, I don't mind re-generating all FASTQs if it saves some extra work...

@victorskl
Copy link
Member

for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

I would also recommend this and re-convert affected samples. I can help with data cleanup and, re-run SOP at Portal side, if any.

As I am aware, these samples are for validation lot. So, it has future value on re-running on them again to validate analysis pipeline changes, etc.

So, it worth the effort to make it right at best as much, I reckon.

@alexiswl
Copy link
Member Author

Also worth noting, as Florian pointed out yesterday, we may come across a situation where the tumor and normal having the same sample id is actually appropriate (same tube, and some magical process separates the too - I don't know what this process is called). So it is inevitable we will have to move away from the current set up at some point.

If you're happy to rerun the demux Victor and manage that then happy, thought it would be easier for you if we just updated the fastq-list-rows

@victorskl
Copy link
Member

Let me access the situation and impact a bit further, pls...

@reisingerf
Copy link
Member

If we want to future-proof things option 3 feels best to me

I (strongly) vote for number 1.

@ohofmann
Copy link
Member

Note that we'd still be running samples in T/N mode with the same library ID at some point (maybe not automatically, but the overall workflow should not fail if we run the same library against its copy).

@reisingerf
Copy link
Member

against its copy

Huh? A library against itself?

I can't see a reason why the orchestration should have an issue with that, but how would we distinguish between them in workflow results (not sure if all tools would support that)?

We could possibly work out a naming convention, e.g. a second (virtual) library with a different ID pointing to the same FASTQs that could be used as the copy?

@ohofmann
Copy link
Member

Part of the accreditation process - sequence the same library twice at different coverage levels and analyse, anything called is a false positive. Might ignore that for the portal and co.

@alexiswl
Copy link
Member Author

alexiswl commented Jul 19, 2023

After discussing with Victor, I don't think demultiplexing is the right way forward, for this sample alone we have 204 'libraryrun' entries

SELECT sequencerun.instrument_run_id, limsrow_metadata.phenotype, fastqlistrow.*
FROM silver.limsrow_metadata
INNER JOIN portal.libraryrun
  ON limsrow_metadata.library_id = libraryrun.library_id
INNER JOIN portal.sequencerun
  ON sequencerun.instrument_run_id = libraryrun.instrument_run_id
INNER JOIN portal.fastqlistrow 
  ON libraryrun.library_id = fastqlistrow.rglb
  AND sequencerun.id = fastqlistrow.sequence_run_id
  AND libraryrun.lane = fastqlistrow.lane
WHERE 
  limsrow_metadata.subject_id = 'SBJ00480'
AND
  fastqlistrow.rgsm = 'PTC_TsqN200511'
ORDER BY sequencerun.instrument_run_id

Gives us the following:

Output
instrument_run_id	phenotype	id	rgid	rgsm	rglb	lane	read_1	read_2	sequence_run_id	partition_name
200605_A01052_0016_AH5LYNDSXY	tumor	6278	ACTAAGAT.AACCGCGG.4.200605_A01052_0016_AH5LYNDSXY.PTC_TsqN200511_L2000431	PTC_TsqN200511	L2000431	4	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000431_S8_L004_R1_001.fastq.gz	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000431_S8_L004_R2_001.fastq.gz	1025	*
200605_A01052_0016_AH5LYNDSXY	normal	6279	GGCATTCT.CTAGCTTG.4.200605_A01052_0016_AH5LYNDSXY.PTC_TsqN200511_L2000437	PTC_TsqN200511	L2000437	4	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000437_S9_L004_R1_001.fastq.gz	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000437_S9_L004_R2_001.fastq.gz	1025	*
200612_A01052_0017_BH5LYWDSXY	tumor	6056	TCCAACGC.TTGGACTT.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000434	PTC_TsqN200511	L2000434	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000434_S4_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000434_S4_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6052	CTTGGTAT.CCAAGTCC.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000433	PTC_TsqN200511	L2000433	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000433_S2_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000433_S2_L003_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6058	TTACAGGA.TGACAAGC.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000436	PTC_TsqN200511	L2000436	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000436_S6_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000436_S6_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6051	GTCGGAGC.GGTTATAA.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000432	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000432_S1_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000432_S1_L003_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6057	CCGTGAAG.CAGTGGAT.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000435	PTC_TsqN200511	L2000435	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000435_S5_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000435_S5_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6053	ACTAAGAT.AACCGCGG.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000431_topup	PTC_TsqN200511	L2000431	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000431_topup_S3_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000431_topup_S3_L003_R2_001.fastq.gz	1023	*
200619_A01052_0018_AH5LYGDSXY	tumor	6004	GTCGGAGC.GGTTATAA.2.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	2	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L002_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L002_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	6017	GTCGGAGC.GGTTATAA.3.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L003_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L003_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	6030	GTCGGAGC.GGTTATAA.4.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	4	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L004_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L004_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	5991	GTCGGAGC.GGTTATAA.1.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	1	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L001_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L001_R2_001.fastq.gz	1024	*
200624_A01052_0019_BH5L7WDSXY	tumor	6137	GTCGGAGC.GGTTATAA.2.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	2	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L002_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L002_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6163	GTCGGAGC.GGTTATAA.4.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	4	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L004_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L004_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6124	GTCGGAGC.GGTTATAA.1.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	1	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L001_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L001_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6150	GTCGGAGC.GGTTATAA.3.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L003_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L003_R2_001.fastq.gz	1026	*
210331_A01052_0041_BHYMHFDSXY	normal	2444	GGCATTCT.CAAGCTAG.1.210331_A01052_0041_BHYMHFDSXY.PTC_TsqN200511_L2000437_rerun	PTC_TsqN200511	L2000437	1	gds://production/primary_data/210331_A01052_0041_BHYMHFDSXY/2022010540ea2224/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun_S1_L001_R1_001.fastq.gz	gds://production/primary_data/210331_A01052_0041_BHYMHFDSXY/2022010540ea2224/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun_S1_L001_R2_001.fastq.gz	777	*
230602_A00130_0258_BH55TMDSX7	tumor	13801	GTCGGAGC.TTATAACC.4.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000432_rerun2	PTC_TsqN200511	L2000432	4	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000432_rerun2_S14_L004_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000432_rerun2_S14_L004_R2_001.fastq.gz	1981	*
230602_A00130_0258_BH55TMDSX7	tumor	13800	ACTAAGAT.CCGCGGTT.4.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000431_rerun2	PTC_TsqN200511	L2000431	4	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L004_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L004_R2_001.fastq.gz	1981	*
230602_A00130_0258_BH55TMDSX7	tumor	13797	ACTAAGAT.CCGCGGTT.3.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000431_rerun2	PTC_TsqN200511	L2000431	3	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L003_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L003_R2_001.fastq.gz	1981	*
230608_A01052_0153_AH53JWDSX7	tumor	13822	CTTGGTAT.GGACTTGG.1.230608_A01052_0153_AH53JWDSX7.PTC_TsqN200511_L2000433_rerun2	PTC_TsqN200511	L2000433	1	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L001_R1_001.fastq.gz	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L001_R2_001.fastq.gz	1991	*
230608_A01052_0153_AH53JWDSX7	tumor	13827	CTTGGTAT.GGACTTGG.2.230608_A01052_0153_AH53JWDSX7.PTC_TsqN200511_L2000433_rerun2	PTC_TsqN200511	L2000433	2	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L002_R1_001.fastq.gz	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L002_R2_001.fastq.gz	1991	*
230629_A01052_0154_BH7WF5DSX7	tumor	13948	TCCAACGC.AAGTCCAA.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000434_rerun2	PTC_TsqN200511	L2000434	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000434_rerun2_S1_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000434_rerun2_S1_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	tumor	13949	CCGTGAAG.ATCCACTG.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000435_rerun2	PTC_TsqN200511	L2000435	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun2_S2_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun2_S2_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	normal	13950	GGCATTCT.CAAGCTAG.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000437_rerun3	PTC_TsqN200511	L2000437	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun3_S5_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun3_S5_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	normal	13952	GGCATTCT.CAAGCTAG.3.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000437_rerun2	PTC_TsqN200511	L2000437	3	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun2_S4_L003_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun2_S4_L003_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	tumor	13951	TTACAGGA.GCTTGTCA.3.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000436_rerun2	PTC_TsqN200511	L2000436	3	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000436_rerun2_S3_L003_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000436_rerun2_S3_L003_R2_001.fastq.gz	2034	*
230707_A00130_0264_BH7WJTDSX7	tumor	14027	CCGTGAAG.ATCCACTG.2.230707_A00130_0264_BH7WJTDSX7.PTC_TsqN200511_L2000435_rerun3	PTC_TsqN200511	L2000435	2	gds://production/primary_data/230707_A00130_0264_BH7WJTDSX7/2023070859f58140/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun3_S3_L002_R1_001.fastq.gz	gds://production/primary_data/230707_A00130_0264_BH7WJTDSX7/2023070859f58140/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun3_S3_L002_R2_001.fastq.gz	2066	*

Summarised as

SELECT sequencerun.instrument_run_id, limsrow_metadata.phenotype, COUNT(DISTINCT fastqlistrow.rglb) as libraries_in_run
FROM silver.limsrow_metadata
INNER JOIN portal.libraryrun
  ON limsrow_metadata.library_id = libraryrun.library_id
INNER JOIN portal.sequencerun
  ON sequencerun.instrument_run_id = libraryrun.instrument_run_id
INNER JOIN portal.fastqlistrow 
  ON libraryrun.library_id = fastqlistrow.rglb
  AND sequencerun.id = fastqlistrow.sequence_run_id
  AND libraryrun.lane = fastqlistrow.lane
WHERE 
  limsrow_metadata.subject_id = 'SBJ00480'
AND
  fastqlistrow.rgsm = 'PTC_TsqN200511'
GROUP BY limsrow_metadata.phenotype, sequencerun.instrument_run_id
ORDER BY sequencerun.instrument_run_id
instrument_run_id phenotype libraries_in_run
200605_A01052_0016_AH5LYNDSXY normal 1
200605_A01052_0016_AH5LYNDSXY tumor 1
200612_A01052_0017_BH5LYWDSXY tumor 6
200619_A01052_0018_AH5LYGDSXY tumor 1
200624_A01052_0019_BH5L7WDSXY tumor 1
210331_A01052_0041_BHYMHFDSXY normal 1
230602_A00130_0258_BH55TMDSX7 tumor 2
230608_A01052_0153_AH53JWDSX7 tumor 1
230629_A01052_0154_BH7WF5DSX7 tumor 3
230629_A01052_0154_BH7WF5DSX7 normal 1
230707_A00130_0264_BH7WJTDSX7 tumor 1

@alexiswl
Copy link
Member Author

Instead the short-term approach is to update these fastq list rows' rgsm values to contain the sample name and the library id

@victorskl
Copy link
Member

@victorskl victorskl added the on hold too fast, temporary on hold, to coordinate changes label Jul 21, 2023
@victorskl victorskl added triage & trello Triage and migrate into respective Trello project change request and removed on hold too fast, temporary on hold, to coordinate changes labels Aug 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
change request triage & trello Triage and migrate into respective Trello project
Projects
None yet
Development

No branches or pull requests

4 participants