{"id":7,"title":"Characterizing Stable Diffusion's Latent Space","independent":true,"description":"Stay tuned for more details on this project as it develops. For now, I am focused on developing a more mathematically and theoretically grounded understanding of Stable Diffusion, including a deeper review of its code.\r\n\r\nSome questions this project may attempt to explore:\r\nWhat is the reachability of stable diffusion? In other words, what is the typical difference between some arbitrary image and the most similar image to it that is reachable by stable diffusion. Did the model lose the ability to reach some categories of images, or were only noisy images filtered out?\r\n\r\nDisentanglement - can we uncover directions or trajectories in the latent space that correspond to semantically meaningful actions like rotations, spatial movement, or temporal causality? Can this allow us to generate some kinds of rudimentary sequences of temporally related images (also known as a video) just by exploiting properties of the latent space rather than training a whole new architecture?","start":"2024-07-01","end":null,"img":"https://imgur.com/zOPNRjR.gif","link":"https://github.com/SevanBrodjian/sd-latent-exploration","slug":"sd-latent-space-exploration","topic":[3],"association":[]}