Extract Dialogue output IDs are not stable: they can change even when line content is exactly the same #5373

hsandt · 2024-02-20T15:39:05Z

I was using auto-voice and noticed that one of my text blocks was not voiced. So I re-Extracted Dialogues to make sure I had the latest IDs, and noticed it had changed. Yet the VCS shows no line content change (only line number change due to offset - but if it was causing the ID change, then all other paragraphs below should have changed ID too, but they didn't).

Below, my diff: only the ID of the first sentence changed. Lines content were the same, except the line number.

I assumed that line content was used for hashing ~~(apparently ignoring tags based on my previous testing)~~, but it seems that it's not. Or maybe it somewhat hit a super rare conflict but then again I don't see any other line that hit the same ID.

EDIT: I tested changing tags again and they seem to affect the ID even when stripping tags, which seems to defeat the purpose of stripping tags in order to avoid ID changing when the write is simply changing text format and wait timing but not the text itself.

This makes me pretty worried about starting any VA until definitive text format and timing is done (even if sentences don't change), as I'd have to re-assign all IDs properly. Unless someone knows of a full pipeline to auto-assign IDs to audio files with a single script run.

renpytom · 2024-02-21T05:17:14Z

At least right now, the voice ids are the translation IDs, which change when the strings (including the text tags) change. Stripping tags is just for proofreading and so on - it doesn't change the IDs Ren'Py generates.

One thing you might want to do to support this model - I'm not sure this is a good idea, but it's possible - would be to use the translation system to generate an english transation, which defaults to the text in the game. When that exists, you can changes the text of the translation how you want, while leaving the game script intact.

Alternatively, I could define a function that takes the information that goes into generating a voice identifier, and lets you generate your own identifiers (say, by stripping tags and doing the hashing yourself).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract Dialogue output IDs are not stable: they can change even when line content is exactly the same #5373

Extract Dialogue output IDs are not stable: they can change even when line content is exactly the same #5373

hsandt commented Feb 20, 2024 •

edited

renpytom commented Feb 21, 2024

Extract Dialogue output IDs are not stable: they can change even when line content is exactly the same #5373

Extract Dialogue output IDs are not stable: they can change even when line content is exactly the same #5373

Comments

hsandt commented Feb 20, 2024 • edited

renpytom commented Feb 21, 2024

hsandt commented Feb 20, 2024 •

edited