aydao

@AydaoAI

Creator of "This Anime Does Not Exist" ( model. AI/ML Research + Art.

Joined August 2019

1K Following

2.6K Followers

761 Posts

AydaoAI retweeted

Emily @the_aiju

almost 2 years ago

apparently you can nerdsnipe an LLM by asking it if there are primes whose digits sum to 9 (the correct answer is “no”) ^^ every single one i’ve tried it on goes crazy trying every single prime it can think of, for some reason they are all convinced such primes must exist

132

4K

189

894

406K

about 2 years ago

@nearcyan been saying this

0

4

0

0

272

AydaoAI retweeted

over 3 years ago

"Waliugi" as the LLM shadow self presages adopting other literary/cultural references as we cope with increasingly alien intelligences by mapping them to familiar personas. Mythologizing AI phenomena the way the Greeks equated lightning with the weapons of Zeus.

1

35

3

3

4K

over 2 years ago

The Sora authors analyzed Square crops vs just training in native aspect ratios is giving flashbacks to TADNE. Ultimately we went with random square crops but I am pretty sure some tensorfork folks looked at rescaling images based on resolution

1

6

0

1

1K

Who to follow

Xander Steenbrugge

Verified account

AI engineer, creative tech, community, online educator. Building https://t.co/9DVk5HK1xC

Verified account

programmer, prompter, primate

Learning deeply at @Canva

over 2 years ago

Excellent points about a critically undervalued paper!

Ivan Skorokhodov

over 2 years ago

This paper has received significantly less attention than it deserves, so let me shed a bit more light on it and describe why it's so good: 1. It turns out that the classical U-Net image diffusion backbone, which the entire community has been happily building upon during the past ~3 years (including Stable Diffusion), has severe flaws in its training dynamics. If you track its weights/activations statistics during training, you will observe a steady malignant growth in their magnitudes. Turns out, it impairs convergence and "simply" re-designing the architecture to incorporate a better normalization pipeline improves the performance by a staggering ~2.5 times in terms of image quality. 2. If you've ever trained large neural networks, you might have found yourself ranting about EMA (Exponential Moving Average) parameter updates. This technique involves keeping an exponential moving average of the model weights during training and using this EMA at inference time, throwing away the original network. I think it's one of the most mysterious and unexplored hacks in modern deep learning optimization, significantly influencing final performance (EMA usually yields 2-3 times better quality than the original model itself). Selecting a proper EMA width is pure pain since we know almost no heuristics about it. Apparently, Karras et al. got fed up with this and developed a rigorous strategy on how to store checkpoints in a way that allows you to find the optimal EMA width post-hoc after training is complete. The nicest thing about this new EMA strategy is that it's applicable to any DL model (i.e., not just image diffusion) and, honestly, I would even expect it to be incorporated in some GPT-5 in the future.

14

1K

222

1K

272K

0

3

0

1

788

over 2 years ago

@TheodoreGalanos Yes I've been thinking a lot about this. The lesson in this era is that "the ux is the message" Great tooling and interfaces are game changers. Even smart people can't be bothered to use something if the experience sucks.

0

1

0

0

18

over 2 years ago

AI makes smart people smarter. Never seen someone stupid benefit or get anything out of the interaction. Competency gaps are going to widen.

3

6

0

1

620

over 2 years ago

@yacineMTB damn I keep forgetting this was the year I coulda been paid to shitpost on twitter. ok, that's it, I know what my new year's resolution is gonna be

1

2

0

0

82

over 2 years ago

@andrewitsover I use google maps for my local neighborhood 😩😩😭

1

0

0

0

57

over 2 years ago

@dokumor8 Yes, and I have a HUGE respect for their researchers, scientists, and engineers. Something has gone horribly wrong in their structure to prevent real tech progress from manifesting.

0

1

0

0

17

over 2 years ago

@proales Exactly. Huge cultural problem. Innovation is not possible with corporate thinking.

0

0

0

0

20

over 2 years ago

@dofelite There's some thought that AI will make non experts into experts

1

1

0

0

30

AydaoAI retweeted

over 2 years ago

18 months before midjourney's open beta! I made these images with a GAN - 0.1sec per image

2

99

4

9

15K

AydaoAI retweeted

over 3 years ago

Back in guided diffusion days (remember, earlier this year?) a bug in the official github broke finetuning on my GPU. Lurking the EleutherAI discord I saw someone had posted a fix. Which is great, yet so much knowledge risks being lost to time in discord's impenetrable archives

3

19

1

1

0

over 2 years ago

Two shocking facts that continually astound: 1.) Literally BILLIONs of people exist. So, so many. It's tough to fathom. 2.) Random anons change the world by posting on this site (often, with anime avis)

0

8

0

0

683

over 2 years ago

Never thought I'd see the day

AydaoAI's tweet photo. Never thought I'd see the day https://t.co/zgusjeoZjU

1

18

0

1

1K

AydaoAI retweeted

over 2 years ago

Returning to OpenAI & getting back to coding tonight.

1K

39K

3K

711

6M

over 2 years ago

enjoy being cozy in this early era of quirky AI video generation while you can soon models will be high quality, open source, and in the hands of competent prompters... that's when things are going to get really weird

0

5

0

1

676

over 2 years ago

Eternal September but it's AI orgs this time

over 2 years ago

all the people that can make eye contact at openai joined in the last 6 months and they’re making me uncomfortable with their eye contact

140

5K

150

372

1M

1

7

0

0

656

about 3 years ago

@jonathanfly Glad to see you tweeting again!

0

2

0

0

128

Last Seen Users on Sotwe

Trends for you

Most Popular Users