Spaces:
Running
Running
Linoy Tsaban
commited on
Commit
·
f1c4123
1
Parent(s):
6e7303e
Update index.html
Browse files- index.html +15 -12
index.html
CHANGED
|
@@ -207,7 +207,7 @@
|
|
| 207 |
</p>
|
| 208 |
<section class="section">
|
| 209 |
<div class="container is-max-desktop">
|
| 210 |
-
|
| 211 |
<div class="columns is-centered has-text-centered">
|
| 212 |
<img src="static/images/variations.png"
|
| 213 |
class="interpolation-image"
|
|
@@ -256,18 +256,16 @@
|
|
| 256 |
<div class="content">
|
| 257 |
<h2 class="title is-4">Component 1: Perfect Inversion</h2>
|
| 258 |
<p>
|
| 259 |
-
Utilizing
|
| 260 |
-
identify a noisy xT that will be denoised to the input image x0.
|
| 261 |
-
We
|
|
|
|
| 262 |
of steps while maintaining no reconstruction error.
|
| 263 |
-
|
| 264 |
-
|
| 265 |
-
equation
|
| 266 |
-
(SDE) solver when
|
| 267 |
-
formulating the reverse diffusion process as an SDE. This
|
| 268 |
SDE can be solved more efficiently—in fewer steps—
|
| 269 |
-
using a higher-order differential equation solver, hence we
|
| 270 |
-
Inversion.
|
| 271 |
|
| 272 |
</p>
|
| 273 |
|
|
@@ -300,7 +298,12 @@
|
|
| 300 |
<div class="columns is-centered">
|
| 301 |
<div class="column content">
|
| 302 |
<p>
|
| 303 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 304 |
of an image relevant to an editing concept that is not already present.
|
| 305 |
Specifically for multiple edits, calculating a
|
| 306 |
dedicated mask for each edit prompt ensures that the corresponding
|
|
|
|
| 207 |
</p>
|
| 208 |
<section class="section">
|
| 209 |
<div class="container is-max-desktop">
|
| 210 |
+
|
| 211 |
<div class="columns is-centered has-text-centered">
|
| 212 |
<img src="static/images/variations.png"
|
| 213 |
class="interpolation-image"
|
|
|
|
| 256 |
<div class="content">
|
| 257 |
<h2 class="title is-4">Component 1: Perfect Inversion</h2>
|
| 258 |
<p>
|
| 259 |
+
Utilizing T2I models for editing real images is usually done by inverting the sampling
|
| 260 |
+
process to identify a noisy xT that will be denoised to the input image x0.
|
| 261 |
+
We draw characteristics from edit friendly DDPM inversion [] and propose an efficient
|
| 262 |
+
inversion method that greatly reduces the required number
|
| 263 |
of steps while maintaining no reconstruction error.
|
| 264 |
+
DDPM can be viewed as a first-order
|
| 265 |
+
SDE solver when formulating the reverse diffusion process as an SDE. This
|
|
|
|
|
|
|
|
|
|
| 266 |
SDE can be solved more efficiently—in fewer steps—
|
| 267 |
+
using a higher-order differential equation solver, hence we derive a new, faster
|
| 268 |
+
technique - dpm-solver++ Inversion.
|
| 269 |
|
| 270 |
</p>
|
| 271 |
|
|
|
|
| 298 |
<div class="columns is-centered">
|
| 299 |
<div class="column content">
|
| 300 |
<p>
|
| 301 |
+
In our defined LEDITS++ guidance, we include a masking term composed of the
|
| 302 |
+
intersection between the mask generated from
|
| 303 |
+
the U-Net’s cross-attention layers and a mask derived from
|
| 304 |
+
the noise estimate - yielding a mask both focused on relevant image
|
| 305 |
+
regions and of fine granularity.
|
| 306 |
+
We empirically demonstrate that these maps can also capture regions 290
|
| 307 |
of an image relevant to an editing concept that is not already present.
|
| 308 |
Specifically for multiple edits, calculating a
|
| 309 |
dedicated mask for each edit prompt ensures that the corresponding
|