TokenFlow's Implementation Details: Everything That We Used | HackerNoon
Briefly

In our experiments utilizing Stable Diffusion for text-to-image modeling, we found that efficiency in runtime significantly increased, reducing per-frame editing time by 20%.
Using DDIM inversion with 1000 steps is a bottleneck in our method. However, we discovered that a much smaller number of steps, such as 50, often suffices.
Read at Hackernoon
[
|
]