jared tumiel @jnearestn - Twitter Profile

10 months ago

super excited to announce our collaboration to the world! here’s the backstory: we started with a shared goal of using AI to generate novel discoveries that would be non-trivial for human scientists to make

13

143

16

51

25K

jnearestn retweeted

rico meinl

@ricomnl

11 months ago

my team is hiring for a research associate! we're looking for ambitious people right out of their undergrad who want to get 1-2 years of experience in a fast moving environment before starting a PhD (at or outside of Retro) https://t.co/PLqXgVLxPW

1

19

3

6

2K

jared tumiel @jnearestn

about 1 year ago

it’s a good model sir

0

2

0

325

jnearestn retweeted

Simón Vidal @SimonVidalV

about 1 year ago

We keep building retro/altos 🚀

0

19

2

0

627

Who to follow

Active Inference Institute

@InferenceActive

Open-science Institute for learning, researching, and applying Active Inference.

Alan Tomusiak

@alantomusiak

building something new to prevent cancer

Johannes Kleiner

@JohannesKleiner

Physicist and mathematician working on consciousness. @LMU: Sys. Neuroscience + @LMU_MCMP. @U_of_Bamberg: Institute for Psychology. Co-founder of #AMCS.

jared tumiel @jnearestn

over 1 year ago

@Andrei_Tarkhov staring at the fastq

0

1

0

51

jnearestn retweeted

John David Pressman

@jd_pressman

over 1 year ago

@Meaningness EY basically told you to predict the next token and take it as a bug report if your epistemology is not helping you predict the next token. If an agent wants to make its Brier score go down eventually it *has* to learn to balance various incomplete systems or it gets stuck.

2

23

1

741

jnearestn retweeted

roon

@tszzl

over 1 year ago

there is no way to "prepare for the future of work". you just need to survive

154

2K

156

235

345K

jnearestn retweeted

Stephen Malina

@an1lam

over 1 year ago

This is unfortunate because the ostensible benefit of aggregators is matching people with diverse preferences to the diverse menu of options available. But instead we get "cultural mode collapse".

1

3

1

0

338

jnearestn retweeted

🍓🍓🍓

@iruletheworldmo

over 1 year ago

literally stands for Oct 1 lul. agi is cool I guess.

8

149

8

3

13K

jnearestn retweeted

𒐪

@SHL0MS

over 1 year ago

the fact that Apple sets this as default-on and only allows you to turn it off manually from each app’s individual settings implies that they are collecting a ton of data from this you don’t need dark patterns for settings that aren’t valuable to you

SHL0MS's tweet photo. the fact that Apple sets this as default-on and only allows you to turn it off manually from each app’s individual settings implies that they are collecting a ton of data from this

you don’t need dark patterns for settings that aren’t valuable to you https://t.co/JRxXOWW3nV

13

319

34

71

38K

jnearestn retweeted

Aidan McLaughlin

@aidan_mclau

over 1 year ago

chain-of-thought tree-of-thought monte-carlo-tree-of-thought graph-of-thought backtracking-tokens-of-thought vector-space-of-thought oh-wait-that's-just-a-model-of-thought hilbert-space-of-thought non-euclidean-geometry-of-thought covariant-general-relativity-of-thought

114

1K

122

396

92K

jnearestn retweeted

Armand Domalewski

@ArmandDoma

over 1 year ago

the world has entered its spacepunk era

474

88K

7K

8K

3M

jnearestn retweeted

Aidan McLaughlin

@aidan_mclau

over 1 year ago

it's only called reasoning if it's from the brain region of homo sapiens. otherwise, it's just sparkling auto-regression

46

2K

222

180

125K

jnearestn retweeted

Sam Altman

@sama

over 1 year ago

no more patience, jimmy

709

9K

656

404

1M

jared tumiel @jnearestn

over 1 year ago

@garybasin inference compute go brr

0

1

0

101

jared tumiel @jnearestn

almost 2 years ago

@acidshill internal tiktok PM

0

17

jnearestn retweeted

Mark Goldstein

@marikgoldstein

almost 2 years ago

Probably need gradient clipping or cosine LR decay

12

1K

64

68

77K

jnearestn retweeted

Tom Tumiel @tomtumiel

almost 2 years ago

link: https://t.co/9plCQiSb4Q

0

3

1

2

339

jared tumiel @jnearestn

almost 2 years ago

discusses tricks & techniques for model training and inference at scale: - model compilation - kernel fusion - KV-caching - gradient accumulation - low-rank finetuning - sharding & data parallelisation + more, go check it out! https://t.co/3YoZQUEzBU

0

3

0

99

jared tumiel @jnearestn

almost 2 years ago

so much alpha in one blog post: practical scaling of LLMs by @tomtumiel link below

2

6

1

159

jared tumiel

@jnearestn

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users