Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't work for the the sequences in variable lengths. #107

Open
ruizhideng opened this issue Mar 31, 2023 · 2 comments
Open

Doesn't work for the the sequences in variable lengths. #107

ruizhideng opened this issue Mar 31, 2023 · 2 comments

Comments

@ruizhideng
Copy link

Hi @AvantiShri,

Thanks for the amazing work!

In the following notebook, you mentioned the pipeline also works for the sequences in different lengths. However, when I was testing the notebook with input data with shape of [100, length, 4], the length ranges from 500-1000 bp, it raised the ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (100,) + inhomogeneous part.
Then if I crop the sequences and contribution scores to the same length, it works again. I am wondering is there any version available for the sequences with different length?
https://github.com/kundajelab/tfmodisco/blob/master/examples/simulated_TAL_GATA_deeplearning/TF_MoDISco_TAL_GATA.ipynb

Thank you again!

Best wishes,
Ruizhi

@AvantiShri
Copy link
Collaborator

AvantiShri commented Apr 11, 2023 via email

@ruizhideng
Copy link
Author

Thanks for the reply.

Yes, the outermost iterable is a python list. I am wondering whether you still have the pipeline for that version.

I also tried lite version, it only works for arraries in the same length.

But it's fine. If it's too tricky, I will just pad the input data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants