People&Business Lab @P_and_B_Lab - Twitter Profile

over 2 years ago

HOW DIFFUSION MODELS CREATE IMAGES: HIGH-LEVEL OVERVIEW. V2 Diffusion models are tools (like Midjourney) used to generate images from random noise. Here's a simple explanation of how they work: 1. Stochastic Differential Equations: At the core of diffusion models are stochastic differential equations. These equations describe how to gradually add noise to an image, transforming it into a random pattern over time. This process is akin to blurring an image more and more until it becomes unrecognizable. 2. Reverse Process: The true magic happens in the reverse of this process. Using another set of SDEs, the model learns to reconstruct the original image from the noise. This is where the training data plays a vital role, as the model has learned the statistical properties of images and how to gradually form coherent structures from randomness. The training doesn't involve remembering specific images but understanding general patterns and features. 3. Navigating the Image Space: Diffusion models work in a complex, high-dimensional space known as latent space. Here, every point can be thought of as a potential image. The model starts with a random point (pure noise) and then uses mathematical rules to refine this point into a detailed image that makes sense. The model doesn't store any images. ‘Navigating the image space’ refers to a conceptual and mathematical process. 4. Interpreting Text Prompts Mathematically: When the model receives a text prompt, it converts the text into mathematical instructions. These instructions are then used to guide the image creation process in the latent space, ensuring the final image matches the description. 5. Generating Unique Images: The randomness of the starting noise means the images produced are different each time. The model makes calculated predictions at every step, influencing how the image evolves. It's similar to using a different set of building blocks and a new plan for each construction project. I hope this was helpful and you now understand diffusion models a bit more.

ciguleva's tweet photo. HOW DIFFUSION MODELS CREATE IMAGES: HIGH-LEVEL OVERVIEW. V2

Diffusion models are tools (like Midjourney) used to generate images from random noise. Here's a simple explanation of how they work:

1. Stochastic Differential Equations: At the core of diffusion models are stochastic differential equations. These equations describe how to gradually add noise to an image, transforming it into a random pattern over time. This process is akin to blurring an image more and more until it becomes unrecognizable.

2. Reverse Process: The true magic happens in the reverse of this process. Using another set of SDEs, the model learns to reconstruct the original image from the noise. This is where the training data plays a vital role, as the model has learned the statistical properties of images and how to gradually form coherent structures from randomness.

The training doesn't involve remembering specific images but understanding general patterns and features.

3. Navigating the Image Space: Diffusion models work in a complex, high-dimensional space known as latent space. Here, every point can be thought of as a potential image. The model starts with a random point (pure noise) and then uses mathematical rules to refine this point into a detailed image that makes sense.

The model doesn't store any images. ‘Navigating the image space’ refers to a conceptual and mathematical process.

4. Interpreting Text Prompts Mathematically: When the model receives a text prompt, it converts the text into mathematical instructions. These instructions are then used to guide the image creation process in the latent space, ensuring the final image matches the description.

5. Generating Unique Images: The randomness of the starting noise means the images produced are different each time. The model makes calculated predictions at every step, influencing how the image evolves. It's similar to using a different set of building blocks and a new plan for each construction project.

I hope this was helpful and you now understand diffusion models a bit more.

24

572

81

486

55K

P_and_B_Lab retweeted

WHAT'S INSIDE?

@whatsinside

over 2 years ago · The Las Vegas Strip

What’s inside the Las Vegas Sphere?

289

14K

2K

6M

P_and_B_Lab retweeted

AsieNews @AsiaNews_FR

almost 3 years ago

#Chine 🇨🇳 La cérémonie d'ouverture des #AsianGames avec aucun feu d'artifice traditionnel. Feux d'artifice numériques, 3D sans lunettes et réalité augmentée(#AR)

6

73

35

0

10K

P_and_B_Lab retweeted

Sénat

@Senat

almost 3 years ago

🔴 Le Sénat a adopté, à l'unanimité, le projet de loi visant à sécuriser et réguler l'espace numérique. #DirectSénat #PJLNumérique En savoir plus : 🔗 https://t.co/jZ5YJHgDxg

Senat's tweet photo. 🔴 Le Sénat a adopté, à l'unanimité, le projet de loi visant à sécuriser et réguler l'espace numérique. #DirectSénat #PJLNumérique

En savoir plus :
🔗 https://t.co/jZ5YJHgDxg https://t.co/Wf2OIJ0Du5

2

45

28

2

18K

P_and_B_Lab retweeted

Barsee 🐶

@heyBarsee

about 3 years ago

Midjourney 5.1 is out and it's incredible. Text-to-image is reaching near photo-realistic in 2023. Here are 7 realistic images I generated in less than 10 minutes: 👇

heyBarsee's tweet photo. Midjourney 5.1 is out and it's incredible.

Text-to-image is reaching near photo-realistic in 2023.

Here are 7 realistic images I generated in less than 10 minutes: 👇 https://t.co/sbthGgCV5I

85

2K

220

2K

1M

P_and_B_Lab retweeted

💙 #TechForGood 💙 @Shi4Tech

about 3 years ago

Neuro expert Christof Koch weighs #AI progress against its potential threats https://t.co/4LPnq7Icjr v/ @tobiaskintzel #GenerativeAI #GPT4 #AGI @sallyeaves @jblefevre60 @CurieuxExplorer @ipfconline1 @Xbond49 @FrRonconi @SpirosMargaris @Fabriziobustama @LaurentAlaus @RLDI_Lamy

3

9

10

0

1K

P_and_B_Lab retweeted

Twitch.tv/1030 @1030

over 3 years ago

👁️👁️ I'm not looking at you. Amazing new machine-learning technology from @nvidia called Eye Contact. As an autistic guy I wish I had this in real-life. I'm testing it now LIVE on https://t.co/fladAbb1Rg Congrats @gerdelgado and team.

95

3K

561

312

1M

People&Business Lab @P_and_B_Lab

almost 4 years ago

#IA #deepfake #movie 👉9min 40 VFX Artist Breaks Down via @wired https://t.co/WpyrGLbL68 via @wired

1

0

P_and_B_Lab retweeted

ROBA @vjroba

about 4 years ago

Portalgraphの卓上偽ホログラムはかなりインパクトあるのでこれもデモしたいのだが、流石に2台持っていくのはキツい。

9

3K

813

240

0

People&Business Lab @P_and_B_Lab

about 4 years ago

https://t.co/24OJ5QAdTZ

0

P_and_B_Lab retweeted

Prosthetic Knowledge @prostheticknowl

over 8 years ago

The Parallax View Project from @algomystic is an IphoneX visual toy using TrueDepth facetracking to produce a Trompe-l'œil effect of depth from the position of your head https://t.co/sOv7pV7BpD

18

3K

1K

25

0

P_and_B_Lab retweeted

Science girl

@sciencegirl

about 4 years ago

It looks alive ferrofluid dances in sync to music in a ferrofluid audio-visualizer This video has sound

218

8K

2K

539

0

P_and_B_Lab retweeted

💙 #TechForGood 💙 @Shi4Tech

over 4 years ago

Until April the 3rd, Smiley is celebrating its 50th anniversary😍 By @Galeries_Laf v/ @anand_narang #AR #AugmentedReality #TechForGood @CurieuxExplorer @enricomolinari @jblefevre60 @sebbourguignon @TerenceLeungSF @Hana_ElSayyed @jeancayeux @gvalan

1

27

28

1

0

P_and_B_Lab retweeted

Jérémy Fa @jeremyfaivre

over 4 years ago

A few rooms later, “Effet de champ” by Stéphane Bissières. Mesmerizing

1

173

61

16

0

P_and_B_Lab retweeted

GiGadgets

@gigadgets_

about 5 years ago

These playful art installations use rotating bricks to interact with you. Driven and coordinated by software and hardware, these art installations are completed through concerted efforts from engineers of multiple domains. #gigadgets #visualarts #bricks #artinstallation #playful