Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

calculate cis/total during balancing and use it to filter rows #134

Closed
golobor opened this issue Sep 20, 2018 · 4 comments
Closed

calculate cis/total during balancing and use it to filter rows #134

golobor opened this issue Sep 20, 2018 · 4 comments

Comments

@golobor
Copy link
Member

golobor commented Sep 20, 2018

A major practical issue with processing Hi-C data is the presence of genomic translocations. They lead errors into the calculated genomic distances and confuse cis contacts with trans ones, thus breaking the expectations of downstream analyses (obs/exp, eigenvectors).
Historically, @mimakaev and @gfudenberg dealt with these issues by filtering out genomic bins that form an untypically high fraction of trans contacts. Recently, @Phlya pointed out the need for such filters in cooler.
A simple suggestion would be to calculate the cis/total fraction in raw bins (cis_tot_raw), detect low-value outliers using MADmax and filter them out (on top of the already used MADmax-coverage filter).It may also be useful to report both the cis_tot_raw and cis_tot_balanced (i.e. cis/total fraction after filtering/balancing).

@Phlya
Copy link
Member

Phlya commented Sep 20, 2018

Also intra-chromosomal translocations, which can, probably, be filtered similarly, but by long-range cis over total cis, or over short range cis.

@gfudenberg
Copy link
Member

gfudenberg commented Sep 20, 2018 via email

@gfudenberg
Copy link
Member

bumping this-- @golobor suggests that cis & total sums can be set as columns during balancing

@nvictus nvictus mentioned this issue Jun 30, 2020
5 tasks
@nvictus
Copy link
Member

nvictus commented Jun 30, 2020

See #210

@nvictus nvictus closed this as completed Jun 30, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants