Max @max_pe2002 - Twitter Profile

Pinned Tweet

Max @max_pe2002

almost 3 years ago

max_pe2002's tweet photo. https://t.co/iYDBYf2Qgj

0

5

0

3K

Max @max_pe2002

4 days ago

@engnr_george @niccruzpatane for collection and logging this would be absolutely overkill

1

0

54

Max @max_pe2002

22 days ago

@DavidSHolz dont give them ideas

0

1

0

11

Max @max_pe2002

24 days ago

@teslaeurope @yunta_tsai the EU is actively killing people by now allowing Tesla FSD

0

2

0

60

Who to follow

I be rapping and stuff. Soundcloud: https://t.co/gyw1IGylpX

Jake

@IMdrJake

I'm the Mountain which the world climbs down from and I laugh because it Tickles.

Max @max_pe2002

24 days ago

Thought my ImageNet model was brokens until i finally increased the CFG. CFG=[1.0, 1.5, 2.0, 2.5, 3.0, 4.0]

0

1

0

33

Max @max_pe2002

25 days ago

@ph_reinhardt can you link the paper?

1

3

0

319

Max @max_pe2002

26 days ago

@___Harald___ @antferdom how do typical robotics models work for self driving? like simple diffusion policy which is pretrained on all data and finetuned on good data? in my mind this would already completly solve self driving. (with high context lenght)

0

75

Max @max_pe2002

27 days ago

@JDVance @elonmusk the USA saved Europe from the Nazis when will the USA save Europe from mass migration?

0

1

0

16

Max @max_pe2002

about 1 month ago

@MozarellaPesto @ludocomito so FD Loss?

1

2

0

279

Max @max_pe2002

about 1 month ago

@MozarellaPesto do you think you could get rid of the transforms if adding a small image space mse loss?

0

2

0

57

Max @max_pe2002

about 1 month ago

why arent LLM people using things like REPA? i think SRA could be usefull. aligning an early layer with a late layer

0

54

Max @max_pe2002

about 1 month ago

@tokenpilled65B @madebyollin @Shauray7 maybe its a eval issue. JiT atleast for me doesnt produce good fine details for far away objects which might not impact FID much but which might impact perceptual losses

0

1

0

74

Max @max_pe2002

about 1 month ago

@tokenpilled65B @madebyollin @Shauray7 oh okay thats very interesting isnt this a bit then what JiT says. like JiT does ps64 with dim 768 and stuff

1

0

66

Max @max_pe2002

about 1 month ago

@tokenpilled65B @madebyollin @Shauray7 im still working on the arch before scaling. i did a small test scale with 1.4b params total. with the decoder having a dim of 1920

1

0

56

Max @max_pe2002

about 1 month ago

@tokenpilled65B @madebyollin @Shauray7 yep convs seem to work better than just ViTs but i dont like convs

0

1

0

22

Max @max_pe2002

about 1 month ago

@tokenpilled65B @madebyollin @Shauray7 im also training tokenizer right now and LPIPS and Dino loss cant help my large patch size issues a little but not much

1

0

49

Max @max_pe2002

about 1 month ago

@madebyollin @Shauray7 JiT allows large patch sizes to work which doesnt mean that large patch sizes are equally good. also JiT really only works well for Imagenet as subjects are close. As soon as a face is only handled by like 1 patch it just cant model it properly

1

5

0

166

Max @max_pe2002

about 1 month ago

@ostrisai why arent you using FD loss for this?

1

0

105

Max @max_pe2002

about 1 month ago

@elonmusk @SawyerMerritt @AnthropicAI @SpaceX when can individuals or smaller labs rent them?

0

15

Max @max_pe2002

about 2 months ago

@iScienceLuvr isnt FID also measuring coverage?

0

3

0

616

Max @max_pe2002

about 2 months ago

@HyperTechInvest i remember a time i could get 8x B200 for 8$

0

78

Max

@max_pe2002

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users