Sam Ching @samcwl - Twitter Profile

Sam Ching @samcwl

9 months ago

@swyx @cognition Congrats sir!!!

0

59

Sam Ching @samcwl

about 1 year ago

@polynoamial @iclr_conf Welcome! Worth checking out this thread: https://t.co/GR5lAGxomq

Daniel Ching

@danielchingwq

about 1 year ago

1/n: A thread for local eats+transport in 🇸🇬 for those coming to @ICLR ! For those coming to Singapore for the first time, a huge welcome :D ICLR would be held at the Expo -- closest MRT station: CG1 (Green East-West Line, 1 stop from Changi Airport), DT35 (Blue Downtown Line)

danielchingwq's tweet photo. 1/n: A thread for local eats+transport in 🇸🇬 for those coming to @ICLR !
For those coming to Singapore for the first time, a huge welcome :D
ICLR would be held at the Expo -- closest MRT station: CG1 (Green East-West Line, 1 stop from Changi Airport), DT35 (Blue Downtown Line) https://t.co/hHPttDjvUA

1

14

4

11

3K

0

2

0

343

samcwl retweeted

Daniel Ching

@danielchingwq

about 1 year ago

1/n: A thread for local eats+transport in 🇸🇬 for those coming to @ICLR ! For those coming to Singapore for the first time, a huge welcome :D ICLR would be held at the Expo -- closest MRT station: CG1 (Green East-West Line, 1 stop from Changi Airport), DT35 (Blue Downtown Line)

1

14

4

11

3K

Sam Ching @samcwl

about 1 year ago

Worth a read! Kudos to @darshj_shah , @peter_rushton, @ashVaswani and the rest of the team for this work. Looking fwd to future releases.

Ashish Vaswani

@ashVaswani

about 1 year ago

To learn more, read our complete paper. We will be sharing additional results with the community in the coming weeks because we believe open science will accelerate the development of frontier capabilities. Paper Link: https://t.co/o53ncpQH0J [4/4]

2

116

12

68

11K

0

14

0

1

967

Who to follow

Edwin Bodge

@edwin_bodge

Founding PM: @duolingo Chess + Max • Engineering @DukeU

Ali Abouelatta

@abouelatta_ali

Tinkering with AI https://t.co/RB3vd7jhFk Prev PM @ Duolingo. Blog: https://t.co/qnAhv4dikS

about 1 year ago

@aidenybai Worth checking out @morph_labs

0

2

0

72

Sam Ching @samcwl

over 1 year ago

@justLV @DrOnwude @sesame Very cool. Kudos on the launch! Also interested if you’ll have a finetuning API or guide down the line.

0

2

0

372

Sam Ching @samcwl

over 1 year ago

@rachpradhan Nicely done Rach! Would be great to integrate with @daft_dataframe cc @JayChia5

1

3

0

72

Sam Ching @samcwl

over 1 year ago

@rosstaylor90 Congrats on the launch Ross!! Thank you for sharing this and looking fwd to what the community does. Curious what's the timeline to getting the datasets on 🤗? https://t.co/AoS4N9EZ1U

1

0

68

samcwl retweeted

Brendan (can/do)

@BrendanFoody

over 1 year ago

When we started the company at 19, we had grand ambitions, but I never imagined how fast it would happen. I'm incredibly grateful for the team we've built and everything they've accomplished. Labor allocation is the most important problem in the world and we're only cracking the surface.

26

233

17

47

41K

samcwl retweeted

Michael Poli

@MichaelPoli6

over 1 year ago

[1/7] Introducing Evo 2, a new foundation model for biology. 🚀 Evo 2 is the largest-scale, fully open-source AI model ever released: 40 billion parameters, over 9 trillion tokens, and a 1 million context length. All the details are public: weights, data, training infrastructure, and inference infrastructure. ⚡Evo 2 is built on a new model architecture: convolutional multi-hybrids (StripedHyena 2). StripedHyena 2 excels at modeling byte-tokenized data, providing faster training and lower perplexity compared to both Transformers and previous-generation hybrids based on state-space models. I am grateful for the team behind Evo 2—working with you was one of the proudest moments of my career (the core pretraining team was fewer than five people; you can just do things). 📚 Today, we release two papers (yes, plural), as well as weights, data, training, and inference codebases. Enjoy!

MichaelPoli6's tweet photo. [1/7] Introducing Evo 2, a new foundation model for biology.
🚀 Evo 2 is the largest-scale, fully open-source AI model ever released: 40 billion parameters, over 9 trillion tokens, and a 1 million context length. All the details are public: weights, data, training infrastructure, and inference infrastructure.

⚡Evo 2 is built on a new model architecture: convolutional multi-hybrids (StripedHyena 2). StripedHyena 2 excels at modeling byte-tokenized data, providing faster training and lower perplexity compared to both Transformers and previous-generation hybrids based on state-space models.

I am grateful for the team behind Evo 2—working with you was one of the proudest moments of my career (the core pretraining team was fewer than five people; you can just do things).

📚 Today, we release two papers (yes, plural), as well as weights, data, training, and inference codebases. Enjoy!

25

475

86

221

75K

Sam Ching @samcwl

over 1 year ago

@n0riskn0r3ward Great thread! Tks for sharing. Would love to hear more about 6c - what are the org incentives that shape the odds this way?

0

1

0

471

Sam Ching @samcwl

over 1 year ago

@eddy_data3 Gotcha totally makes sense! This looks promising though - excited to see more work in this area (bootstrapping environments from webscale data) Kudos on the work!

0

1

0

21

Sam Ching @samcwl

over 1 year ago

One qn: > 5.1: In contrast, for data from WebInstruct without fully reliable supervision signals but with a much larger scale, we sample one response per prompt from the teacher model without filtration. -> why not use filtered subset and do rejection sampling for SFT?

1

0

154

Sam Ching @samcwl

over 1 year ago

@zephyr_z9 Traniums though

0

78

Sam Ching @samcwl

over 1 year ago

@IvanVendrov @alexatallah have you seen any? Also curious. also cc @hyperbolic_labs

0

54

Sam Ching @samcwl

over 1 year ago

(also highly recommend the entire talk - it's a banger!) https://t.co/cka7Ouyx0R

0

1

0

197

Sam Ching @samcwl

over 1 year ago

This + @sama 's tweet (https://t.co/Wpj02oJBof) remind me of @rosstaylor90 's talk from last year. What is the most helpful style for models to express their reasoning in?

samcwl's tweet photo. This + @sama 's tweet (https://t.co/Wpj02oJBof)
remind me of @rosstaylor90 's talk from last year.

What is the most helpful style for models to express their reasoning in? https://t.co/ayBGXoqAic

1

0

1

480

Sam Ching @samcwl

over 1 year ago

SFT initialization helps, and could also contribute to style, per @rosstaylor90 https://t.co/qpZHpwxQTt

1

2

0

129

Sam Ching @samcwl

over 1 year ago

Scaling verifiable reward signals is the key: appreciate the silver signals study. Love to see more work push on mining verifiable rewards from webscale data: https://t.co/M1BTJpNpSg cf Karparthy / Shane on more rl envs -- https://t.co/3xjYvGRV4P

Shane Gu

@shaneguML

over 1 year ago

@karpathy In short, we need BIG-bench for LLM RL. https://t.co/G997YwFLGH

0

7

0

4

1K

1

2

0

198

Sam Ching @samcwl

over 1 year ago

Congrats to @eddy_data3 and @xiangyue96 and the team for the detailed study on Long CoT. Many interesting ablations and lessons, similar to parallel work coming out e.g. by @junxian_he

eddy

@eddy_data3

over 1 year ago

Such a rewarding experience (pun intended) collaborating with @tongyx361 @xiangyue96 @sirius_ctrl @gneubig! We hope our results are useful to the community 🙏

2

10

6

2

3K

1

5

0

585

Sam Ching

@samcwl

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users