Jingfeng Yang @JingfengY - Twitter Profile

Pinned Tweet

over 3 years ago

#ChatGPT and #GPT3 are hot. But let’s be practical, when we want to reproduce GPT-3 or use it in our applications. Why did all of the public reproduction of GPT-3 fail? In which tasks should we use GPT-3.5/ChatGPT? I tried to answer them in a new blog: https://t.co/KMoyiC7eh0 .

JingfengY's tweet photo. #ChatGPT and #GPT3 are hot. But let’s be practical, when we want to reproduce GPT-3 or use it in our applications. Why did all of the public reproduction of GPT-3 fail? In which tasks should we use GPT-3.5/ChatGPT? I tried to answer them in a new blog: https://t.co/KMoyiC7eh0 . https://t.co/mqYAKaOlQT

15

487

110

186

94K

Jingfeng Yang

@JingfengY

about 10 hours ago

@StevenyzZhang Congrats!

0

2

0

118

JingfengY retweeted

Jiwei Li

@JiweiLi1

7 days ago

Excited to share Ornith, our latest family of open-source models specialized for agentic coding. Ornith achieves SOTA performance among open-source models of comparable size on a variety of coding benchmarks (Terminal-Bench 2.1, SWE, NL2Repo, OpenClaw, SWE Atlas, etc) Feedback is deeply appreciated! 📖Tech Blog: https://t.co/MiaaDExj9B 🤗Huggingface: https://t.co/eDtzanc5Vp

52

564

42

194

46K

Jingfeng Yang

@JingfengY

28 days ago

This is what I suggested to @lawhy_X for agentic RL back then. He moved incredibly fast, implemented it, and clearly demonstrated its effectiveness. Check out the implementation!

Nan Jiang

@nanjiangwill

30 days ago

@justintchiu but slime has TITO example 5 months ago 👀👀 https://t.co/AmxBnvFsf2

2

12

2

3

3K

0

6

0

1

2K

Who to follow

Jie Huang

@jefffhj

Building intelligence @xAI. Grok-2🍍, 3🍫, 4🫐, Video Gen🪄. PhD from UIUC CS.

Bill Yuchen Lin

@billyuchenlin

RL for coding @xAI @SpaceX Affiliate Assistant Prof @UW. Ex: @allen_ai; Google, Meta FAIR.

Wenhu Chen

@WenhuChen

MSL FAIR@Meta. I led PoT, MMMU, MMLU-Pro, MAmmoTH, General-Reasoner, VL-Rethinker, Pixel-Reasoner. I contributed to Gemini-2.5. Prev @GoogleDeepMind.

JingfengY retweeted

Diyi Yang

@Diyi_Yang

about 1 month ago

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

Diyi_Yang's tweet photo. The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :)

With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

14

290

77

183

55K

JingfengY retweeted

Sasha Rush

@srush_nlp

about 1 month ago

Been working on text feedback / OPSD in Composer. Really interesting space, and much more to be explored.

11

280

28

133

40K

Jingfeng Yang

@JingfengY

about 2 months ago

@mycharmspace 🐐

0

1

0

366

JingfengY retweeted

OpenAI

@OpenAI

2 months ago

We’re talking about Goblins. https://t.co/dqmcLGCW71

523

8K

832

2K

2M

JingfengY retweeted

DeepSeek

@deepseek_ai

2 months ago

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: https://t.co/drlDrxkYtp 🤗 Open Weights: https://t.co/T13Y8i7SDM 1/n

deepseek_ai's tweet photo. 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!

📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM

1/n

2K

46K

8K

10K

10M

JingfengY retweeted

Shangbang Long @ ICML 2026 Seoul @ShangbangLong

2 months ago

🚀 Excited to announce Vision Banana 🍌 and our new paper: “Image Generators are Generalist Vision Learners”. We turn Nano Banana Pro into a state-of-the-art visual generation and understanding model. 🖼️ Check out our gallery at https://t.co/CEQJXroPaE 🧵 (1/N) continue ⬇️

22

434

70

264

61K

JingfengY retweeted

Yu Su

@ysu_nlp

2 months ago

Introducing @NeoCognition, the agent lab for specialized intelligence. Everyone needs experts, but human expertise does not scale. Backed by $40M seed funding, we build self-learning agents that specialize across domains to make expertise abundant.

91

892

132

364

192K

JingfengY retweeted

Lucy Shi @lucy_x_shi

3 months ago

1/ We just released π0.7 — a steerable generalist robot model with emergent capabilities. I want to share a bit of the backstory, because π0.7 taught me something surprising about where robot learning is heading. A thread on bittersweet lessons 🧵

32

853

102

378

87K

JingfengY retweeted

Diyi Yang

@Diyi_Yang

3 months ago

Very excited to share that our project was selected as a @LaudeInstitute Moonshots seed grant winner on workforce upskilling @tatsu_hashimoto @erikbryn This is something I think about a lot these days. With so much uncertainty about AI and jobs today, I'm deeply motivated by this question of how can we use LLMs not to replace people, but to empower them...

8

182

25

33

38K

Jingfeng Yang

@JingfengY

3 months ago

“Automated research on outcome-gradable problems is already practical.” This work strongly validates my own experience using agents for automated research. I’m very excited to see this research come out—huge congrats to @liangqiu_1994! He’s a brilliant researcher, and I’m constantly inspired by his passion, creativity, and rigor.

Liang Qiu @liangqiu_1994

3 months ago

Something not studied here: the joy human researchers get from working on a problem purely out of curiosity, and from collaborating with others.

1

22

2

3

3K

0

6

0

532

JingfengY retweeted

Jan Leike

@janleike

3 months ago

New research result: we use Claude to make fully autonomous progress on scalable oversight research, as measured by performance gap recovered (PGR). Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits.

janleike's tweet photo. New research result: we use Claude to make fully autonomous progress on scalable oversight research, as measured by performance gap recovered (PGR).

Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits. https://t.co/fbVpCPPtaU

39

1K

120

608

147K

JingfengY retweeted

Zexuan Zhong @ZexuanZhong

3 months ago

Excited to share Muse Spark 🥑, a big step in MSL's journey towards personal superintelligence. Try it out on https://t.co/rgVnOxYD04 and let us know your feedback!

8

117

10

2

20K

JingfengY retweeted

Qian Huang

@qhwang3

3 months ago

It’s been amazing to move so fast with the team here 🙂 Still much more to do Give a try at https://t.co/S9JqRcaKWO and let us know your feedback!

12

128

14

6

17K

JingfengY retweeted

Zhuohan Li

@zhuohan123

3 months ago

It’s been an exciting nine months training this model from scratch. I’m especially proud of the opportunity to rebuild the foundational infrastructure alongside the strongest infra team I’ve ever worked with. The systems we’ve built will serve as a solid foundation for many more models to come. Stay tuned!

6

112

5

8

13K

JingfengY retweeted

Zhiqing Sun

@EdwardSun0909

3 months ago

Excited to share Muse Spark, the first model from whole team’s work in MSL! 🚀 It’s natively multimodal and agentic. I’ve been using it for my daily coding and research tasks. Still plenty of room to improve in agentic domains, but we’re moving with great velocity. It’s a seriously good model! Check out the full breakdown and try it out in https://t.co/Fka0wdAswy

8

205

26

10

23K

JingfengY retweeted

Hongyu Ren

@ren_hongyu

3 months ago

Check out Muse Spark, our first milestone in the quest for personal superintelligence! Scaling this with the team has been a total blast. Give it a spin and let us know what you think! 🥑