New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data for pie chart proportions #105
Comments
In this case, TEstrainer has failed to produce a strained version of the input TE library. This could be because no non-satellite repeats were found in the RepeatModeler run, but this needs to be verified for the runs that failed. What version of Earl Grey are you using? Which OS? What flags did you use to run Earl Grey? Providing the whole log file should help to understand where the process has failed. Verify that the If you are happy to ignore any filtering steps, you can just rename *-families.fa to |
Thank you for the prompt response. It is possible that these samples had no non-satellite repeats, but I'd want to check before ruling that out. Also, I'm now only trying to get the raw data with which the pie charts are generated. The full log file is attached |
The script that failed was the simple repeat filter trimming step in TEstrainer... @jamesdgalbraith any ideas on this one? It looks like all previous steps of TEstrainer completed successfully, so it is something in the post-processing that has caused an issue |
Regarding the data used for the plots, this is in the |
Thank you, this is perfect for the samples that ran successfully. As for those with an unsuccessful run that didn't generate anything in the |
You will need to run the final repeatmasker step, followed by the post-filtering. The easiest way to do this is to rename the families.fa with the .strained file, delete everything in *mergedRepeats/ , *RepeatMasker_Against_Custom_Library/ (but not the directories themselves) then run exactly the same command you did before, and Earl Grey will continue as if the TEstrainer step completed successfully. I would still recommend trying to work out why the TEstrainer step failed, hopefully James could give us some more insight! |
Sorry for the delay, I think I've identified the problem. In the |
Hello again, thank you for following up on this to try to find a fix. |
Hello, I ran EarlGrey on 21 assemblies but it only produced summary files for 5/21.
All runs produced SAMPLE-families.fa files showing representative sequences for each family, but the log files show the following:
This doesn't seem to be an assembly issue, because there were good and poor assembled sequences in both the successful runs and the failed runs.
I would appreciate any guidance on how/where to find the identified TEs in the temporary files in both successful and failed runs to calculate raw values for percentage of TEs covering the genomes, similar to what the pie charts show in the completed runs.
The text was updated successfully, but these errors were encountered: