Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error running vcf2xg.sh #117

Open
mariarecuerda6 opened this issue Feb 12, 2024 · 2 comments
Open

Error running vcf2xg.sh #117

mariarecuerda6 opened this issue Feb 12, 2024 · 2 comments

Comments

@mariarecuerda6
Copy link

Hello,

I want to generate the pcf file but I get the following error when running the vcf2xg.sh in my own data. The vcf file was generated using minigraph-cactus pangenome pipeline.

./vcf2gfa/vcf2pcf.rb:24:in split': invalid byte sequence in UTF-8 (ArgumentError) from ./vcf2gfa/vcf2pcf.rb:24:in block in

'
from ./vcf2gfa/vcf2pcf.rb:18:in each' from ./vcf2gfa/vcf2pcf.rb:18:in '

  • input=.//sporophila_output.pcf
  • reference=/workdir/mjr432/IBEB009696_masked.1.fa
  • pref=.//sporophila_output.pcf
  • sort -t, -k 7n .//sporophila_output.pcf
  • awk -F '[,:]' '{print $1,"\t",$2,"\n",$4,"\t",$5}' /dev/fd/63
  • sed -e 's/[ ]*//g'
    ++ sed 1d .//sporophila_output.pcf
  • sort -k 1,1 -k 2,2n
  • uniq
  • '[' '!' -s /workdir/mjr432/IBEB009696_masked.1.fa.fai ']'
    ++ dirname ./vcf2gfa/pcf2gfa.sh
  • ruby ./vcf2gfa/json_to_breakpoint_list.rb .//sporophila_output.pcf.bp.tsv /workdir/mjr432/IBEB009696_masked.1.fa
  • cat .//sporophila_output.pcf.bp.tsv breakpoint_list_tmp.tsv
  • sort -k 1,1 -k 2,2n
  • uniq
  • awk '{print $1 "\t" $2 "\t" $2}' .//sporophila_output.pcf.bp.merged.tsv
  • awk '{print $1 "\t0\t" $2}' /workdir/mjr432/IBEB009696_masked.1.fa.fai
  • bedtools intersect -a .//sporophila_output.pcf.bp.merged.bed -b .//sporophila_output.pcf.reference.tsv
  • cut -f 1,2
    ++ dirname ./vcf2gfa/pcf2gfa.sh
  • ruby ./vcf2gfa/gfa_generator.rb .//sporophila_output.pcf.bp.merged.filtered.tsv .//sporophila_output.pcf.output.pcf /workdir/mjr432/IBEB009696_masked.1.fa
    [faidx] Zero length sequence: chr1:0-49999
    [faidx] Zero length sequence: chr10:0-49999
    [faidx] Zero length sequence: chr100:0-49999
    [faidx] Zero length sequence: chr101:0-49999
    [..........]
    error [vg convert]: Input GFA is not acceptable.
    GFA format error: On pass 1: On line 2: At column 5: Expected nonempty value while parsing sequence
    error[VPKG::load_one]: Correct input type not found in .//sporophila_output.vg.old while loading handlegraph::MutablePathDeletableHandleGraph
    error[VPKG::load_one]: Correct input type not found while loading handlegraph::HandleGraph

I check the gfa that is generated and it just have to columns in line 2, but I have already the xg and the gfa files that were generated at the same time as the vcf file that look fine, it is possible to run it with my own gfa?

Thank you in advance for your assistance.

@asunboi
Copy link

asunboi commented Mar 7, 2024

Hi, I'm having the exact same issue. Was wondering if you or the developers have found any solution to this?

@mariarecuerda6
Copy link
Author

I haven't find any solution..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants