kirk

@68kirk

"Nobody dies a virgin... Life f*** us all!"

Joined May 2014

438 Following

73 Followers

1.1K Posts

27 days ago

@che_shr_cat Meh, you could technically argue that but at the end of the day you still take the grad of a loss wrt to sth (input, token, etc.). What would happen if the model doesn't have refusal phrases in its training data (distribution), would the approach still work?

0

0

0

0

12

28 days ago

@che_shr_cat Looks like early days of adversarial examples all over again. Is there anything conceptually different from them?

1

0

0

0

15

about 2 months ago

@fatihdin4en @wredman4 @Xiaoxiao_Lin1 Nice work, has the effect been validated for other architectures as well? Does the high-d neural codes exist broadly in other archs or is it a byproduct of certain conditions?

1

2

0

0

43

about 2 months ago

@ShamKakade6 @Kimi_Moonshot @tri_dao @_albertgu @SonglinYang4 @srush_nlp @ZeyuanAllenZhu @HannaHajishirzi @simran_s_arora @lambdaviking @DimitrisPapail Validation CE vs. Transformer looks more like noise than clear signal, difference in the 3rd decimal digit. What are the std error bars?

0

0

0

0

115

Who to follow

Verified account

Research @Meta MSL TBD | past @GoogleDeepMind @Stanford @MSRNE @VectorInst @RIKEN_AIP_EN @Tsinghua_Uni. Building probabilistic & algorithmic models for learning

VP of Learning and Perception Research @NvidiaAI. Views and opinions are my own.

Verified account

Post-Training @Cohere 🇨🇦 Formerly @ServiceNowRSRCH, @Mila_Quebec, @GoogleDeepmind, @AWScloud, @SpotifyResearch.

about 2 months ago

@gautamcgoel @SimonsInstitute No opposite gender candidates in the group?

0

0

0

0

74

2 months ago

@fleetwood___ Are you sequentially learning the tasks one after the other (at least that's what I can infer from the plots)? Try in a multitask fashion if u haven't already.

0

0

0

0

51

2 months ago

Unpopular opinion, all LLM generated code out there should have a visible human readable disclaimer and a technical machine readable signal.

0

0

0

0

45

2 months ago

@DimitrisPapail One hypothesis is that they probably changed sth since the source code leakage, which btw revealed that underneath the hood there's a ton of prompt orchestration going on. By that logic even a small prompt change could alter LLM behavior. They also had prompts obscuring info.

0

0

0

0

293

2 months ago

@adamlsteinl Tests & verifiers should be isolated from agents and encrypted. Ideally tests should not be online. Given inputs agents provide answers, which at a 2nd step are passed to verifiers. It would be interesting to look at traces to see what percentage will try to decrypt the answers.

0

1

0

0

87

3 months ago

@che_shr_cat Didn't we already knew that transformers fail at algorithmic tasks? They mostly solve these kind of tasks by using spurious correlations https://t.co/k8jW3yEpSv

0

0

0

0

64

4 months ago

@JFPuget Welcome to the era of "we'll delete everything and you'll be happy"

0

0

0

0

35

4 months ago

@DimitrisPapail Nice write up! I lf I'm not mistaken I would say the pair encoding trick of tokens resembles a lot the RLE trick where u compress repeating values/letters in a condensed format representation. Has been used a lot in vision & I bet there's a lot of codebases on web with it.

0

0

0

0

251

4 months ago

@ZimingLiu11 @naturecomputes @SuryaGanguli @AToliasLab Nice thread! Any thoughts on how do continuous but non autoregressive models like fno and pinns compare to transformer based? Based on your findings they should be avoiding both issues present in transformers?

0

0

0

0

41

4 months ago

@giffmana @crude2refined Indeed, the optimizers we use are quite sensitive and as such lr can help when stuck in a local min to get unstuck, all other hyperparams are there to smooth the optimizer trajectory.

0

0

0

0

52

5 months ago

@JFPuget @jm_alexia 🤔 isn't that already happening? The majority of academics use overleaf and given that overleaf has integrated 3rd party AI models, those could potentially be sending data to underlying companies, no?

1

0

0

0

56

5 months ago

@branerico @adrian1977 @burkov I think it already has lost its value if u consider that on average a PhD makes what a 20 year old earns with vocational training working as an electrician in datacenters.

0

1

0

0

77

5 months ago

c.f. https://t.co/oOUUYSiJ6r

0

0

0

0

59

5 months ago

History has a funny way of repeating itself...

68kirk's tweet photo. History has a funny way of repeating itself... https://t.co/35uW23BXs4

1

0

0

0

62

6 months ago

@connordavis_ai Didn't LLM's already knew how to answer what-if questions, even at a superficial level. What was the baseline here?

0

0

0

0

14

6 months ago

@elonmusk The default should be reject all by definition. There problem solved.

0

0

0

0

9

Last Seen Users on Sotwe

Trends for you

Most Popular Users