Feature request: Use depth pass as input mask for new SD 2.0 depth model for more structural coherence #49

marianbasti · 2022-11-29T17:22:29Z

Describe the feature you'd like to see:

This is exciting! Alongside the last release, we now have a model that has an extra input for a depth map https://huggingface.co/stabilityai/stable-diffusion-2-depth.

A1111's repo hasn't included this in the API yet, but it is definitely something to have an eye on, as it brings us a step closer to temporal coherence.

Additional information

No response

benrugg · 2022-11-29T17:52:28Z

Yes! I can't wait for this. I think it will likely provide some really nice power, especially to keep animation more stable. As soon as it's implemented in DreamStudio, Stable Horde or Automatic1111, I'll try to add it quickly!

jacbouzada · 2023-01-25T16:59:32Z

It would be great if you could implement it !!!!
In my case, I would be very interested in order to control volumetries of architectural images, as it is done in this video: https://www.youtube.com/watch?v=CHfCT2lqNdo

benrugg · 2023-01-25T17:29:08Z

Yes! This is going to be so useful. The Stability folks have a release planned for next week that will finally support depth2img. I'm planning to add support for it next week (to the integrations with both DreamStudio and Automatic1111).

benrugg · 2023-04-14T01:13:22Z

This is now done with ControlNet in Automatic1111. The latest AI Render release supports it! Hopefully it will be available in DreamStudio soon.

https://github.com/benrugg/AI-Render/releases/tag/v0.7.5

(or update through AI Render add-on preferences)

ghost · 2023-04-16T14:03:35Z

Yeah! You're awesome!

JensSchmidt72 · 2023-04-17T12:13:36Z

Can you pass the actual rendered depth values (z-buffer) into Control Net, as opposed to a preprocessed guess?
The normals would be awesome too ;)

benrugg · 2023-04-17T14:46:26Z

@JensSchmidt72 This is next on my list. I had assumed this would be very important, since Blender would have accurate depth info, as compared to ControlNet's estimated depth pass. I experimented with it for a while in the web ui, and realized that the real depth image actually underperformed the estimated one most of the time, so I shelved this feature for later.

(As an example, an object on a table blends the bottom of the object into the table in the real depth pass, vs separating the object from the table in the estimated pass).

JensSchmidt72 · 2023-04-19T08:02:51Z

Hi Ben :)
If I remember correctly, an artist friend of mine said that he had better success using the Blender "Mist" pass in A1111 than the z-buffer. The blender manual describes the Mist pass as; "Distance to visible surfaces, mapped to the 0.0 - 1.0 range.". It sounds like they map the z-buffer values from camera clip to the farthest pixel, in that case it would naturally give better separation between objects in smaller scenes like the preprocessor does :)

edit: grammar and clarity

JensSchmidt72 · 2023-04-19T12:36:10Z

Mist info..
I checked the Mist pass in Blender3.5 and for me it defaults to; Start 5m and Depth 25m. In other words, there is no auto-magic normalization going on.
also it's inverted in color compared to the z-buff.

benrugg · 2023-04-19T15:09:55Z

Ah, yeah, I'll have to check into the miss past. That's a great idea. Sucks that there's not an automatic normalization of some kind.

In my tests with the depth pass, I was sending it to a normalization node in Blender first, which would still make sense. But if the mist pass is set by default to 5-25m, that could easily be all white or all black, huh? I'd need to include more instructions, etc.

JensSchmidt72 · 2023-04-21T15:46:24Z

Yeah, it could fail to black/white. Of course, UI-wise, it would be amazing one didn't need to adjust any settings but rather got a relevant normalization :)

benrugg · 2023-04-21T16:11:06Z

Yeah, I usually agonize over the UI to try to make it as easy as possible. (It's so difficult in Blender!)

JensSchmidt72 · 2023-04-21T16:21:07Z

I have very little knowledge about how addons (and UI-system) are build in blender but I'll take your word for it.
Would a node based approach be easier to implement? It could be more powerful, but it would of course require more blender knowledge to use.

benrugg · 2023-04-21T16:41:29Z

I think there could be a good case for nodes. There are a few features I could add to AI Render where nodes would be helpful (the depth, mist, normal passes like we're talking about; and also a way to give different prompts for different areas of the image!).

At the moment, this is more than I'm planning on doing, but it could be great in the future.

benrugg self-assigned this Nov 29, 2022

benrugg added the enhancement New feature or request label Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Use depth pass as input mask for new SD 2.0 depth model for more structural coherence #49

Feature request: Use depth pass as input mask for new SD 2.0 depth model for more structural coherence #49

marianbasti commented Nov 29, 2022 •

edited

benrugg commented Nov 29, 2022

jacbouzada commented Jan 25, 2023

benrugg commented Jan 25, 2023

benrugg commented Apr 14, 2023

ghost commented Apr 16, 2023

JensSchmidt72 commented Apr 17, 2023 •

edited

benrugg commented Apr 17, 2023

JensSchmidt72 commented Apr 19, 2023 •

edited

JensSchmidt72 commented Apr 19, 2023

benrugg commented Apr 19, 2023

JensSchmidt72 commented Apr 21, 2023

benrugg commented Apr 21, 2023

JensSchmidt72 commented Apr 21, 2023

benrugg commented Apr 21, 2023

Feature request: Use depth pass as input mask for new SD 2.0 depth model for more structural coherence #49

Feature request: Use depth pass as input mask for new SD 2.0 depth model for more structural coherence #49

Comments

marianbasti commented Nov 29, 2022 • edited

Describe the feature you'd like to see:

Additional information

benrugg commented Nov 29, 2022

jacbouzada commented Jan 25, 2023

benrugg commented Jan 25, 2023

benrugg commented Apr 14, 2023

ghost commented Apr 16, 2023

JensSchmidt72 commented Apr 17, 2023 • edited

benrugg commented Apr 17, 2023

JensSchmidt72 commented Apr 19, 2023 • edited

JensSchmidt72 commented Apr 19, 2023

benrugg commented Apr 19, 2023

JensSchmidt72 commented Apr 21, 2023

benrugg commented Apr 21, 2023

JensSchmidt72 commented Apr 21, 2023

benrugg commented Apr 21, 2023

marianbasti commented Nov 29, 2022 •

edited

JensSchmidt72 commented Apr 17, 2023 •

edited

JensSchmidt72 commented Apr 19, 2023 •

edited