Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about example 2 #21

Open
Yecats77 opened this issue May 7, 2024 · 3 comments
Open

Question about example 2 #21

Yecats77 opened this issue May 7, 2024 · 3 comments

Comments

@Yecats77
Copy link

Yecats77 commented May 7, 2024

Hi,

I tried your example 2. The commands and output results(adjusted_TADs2.txt_types.txt) are attached.

# ========= Example2: Testing TADs on all chromosomes ============= 

# <bed> data/GSE63525_GM12878_primary+replicate_Arrowhead_domainlist.txt
# <ofile: output file> temp/temp.txt
python3 ${diffdomain_py3}diffdomains.py dvsd multiple ${hic0} ${hic1} ${diffdomain_home}data/GSE63525_GM12878_primary+replicate_Arrowhead_domainlist.txt --ofile ${res_path}temp/temp.txt

# <input> temp/temp.txt
# <output> adjusted_TADs2.txt 
python ${diffdomain_py3}diffdomains.py adjustment fdr_bh ${res_path}temp/temp.txt ${res_path}adjusted_TADs2.txt 
# --filter true adj_pvalue<=0.05
# The final output is saved to <diff_res/reorganized_TADs_GM12878_K562.tsv>
python ${diffdomain_py3}diffdomains.py adjustment fdr_bh ${res_path}temp/temp.txt ${res_path}reorganized_TADs_GM12878_K562.tsv --filter true

cd ${res_path}
# -d diffdoamin's outcome
# -t the other tadlist
# output diffDomain/diff_res/{}_types.txt
python ${diffdomain_py3}classification.py -d ${res_path}adjusted_TADs2.txt  -t ${diffdomain_home}data/GSE63525_K562_Arrowhead_domainlist.txt
python ${diffdomain_py3}classification.py -d ${res_path}reorganized_TADs_GM12878_K562.tsv  -t ${diffdomain_home}data/GSE63525_K562_Arrowhead_domainlist.txt

In the result, it is observed that, for some regions, there are only condition 1 entries whose type is loss; while for other regions, the type of consition 2 are always nan.
I am not sure what happend. May I ask whether I made any mistakes when using the diffDomain tools? Could you please help with this problem?

Thank you.

Best regards,
Stacey LIU

image
adjusted_TADs2.txt
adjusted_TADs2.txt_types.txt

@Tian-Dechao
Copy link
Owner

Hey

Your run on the example is correct.
The subtype column was meant to further classify the type column. Missing value nan in the subtype column represents that the type results cannot be further classified into more than one subgroups, as in the case for the reorganized TADs that are loss in condition 2.

@Yecats77
Copy link
Author

Yecats77 commented May 8, 2024

Thanks for your reply!

I understand what the subtype mean at this time.

So, in this example 2, the results mean that there are no types of reorgnized TADs are identified under the condition 2, since all elements from type column under conditon 2 are nan. Is this understanding correct?

And may I ask how you compare and get this number "DiffDomain identifies that 30.771% of GM12878 TADs are reorganized in K562" based on the results in adjusted_TADs2.txt_types.txt?

Thank you.

Best regards,
Stacey LIU

@Yecats77 Yecats77 closed this as completed May 8, 2024
@Yecats77 Yecats77 reopened this May 8, 2024
@Tian-Dechao
Copy link
Owner

  1. Yes.
    Please note that this does not mean that condition 2 has no TADs that are reorganized in condition 1. To identify condition 2 TADs that are reorganized in condition 1, just treat condition 2 as a new condition 1 and condition 1 as a new condition 2 and rerun DiffDomain. In the example2, treat K562 as condition 1 and GM12878 as condition 2.

2 To get the proportion of TADs that are reorganized in condition 2, we need to count two numbers.

  1. the number of reorganized TADs
    significant column in adjusted_TADs2.txt_types.txt indicates whether a TAD is significantly reorganized, with 1 standing for significant, 0 for not significant, and nan for condition 2 TADs. Count the number of 1 should get the number of reorganized TADs.
  2. the number of TADs
    Count 'condition1' in the origin column.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants