场效应猫 @meowfet - Twitter Profile

MeowFET retweeted

22 days ago

🚀Introducing UniRL, an RL infra for unified multimodal models. Together with two new RL algorithms: DRPO and Flow-DPPO. One RL loop across diffusion/flow matching models, LLMs/VLMs, and unified multimodal models👇 Code: https://t.co/fhKEqqFpc8 (yes — U(you)-ni-(need) RL ��)

TencentHunyuan's tweet photo. 🚀Introducing UniRL, an RL infra for unified multimodal models. Together with two new RL algorithms: DRPO and Flow-DPPO.

One RL loop across diffusion/flow matching models, LLMs/VLMs, and unified multimodal models👇

Code: https://t.co/fhKEqqFpc8

(yes — U(you)-ni-(need) RL ��) https://t.co/1o9Swg2biE

8

153

25

56

27K

MeowFET retweeted

Xiangxin Zhou @NickZhou523786

17 days ago

🚨 Uniform token-level trust regions are not enough for LLM RL! Our new paper: Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning. We introduce CPPO, a drop-in mask that reallocates divergence budget by position & prefix drift 👇 https://t.co/svooEqAcss

NickZhou523786's tweet photo. 🚨 Uniform token-level trust regions are not enough for LLM RL!

Our new paper: Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning.

We introduce CPPO, a drop-in mask that reallocates divergence budget by position & prefix drift 👇

https://t.co/svooEqAcss https://t.co/QNQWTXQ4Ts

3

179

29

223

37K

MeowFET retweeted

Daniel Lockyer

@DanielLockyer

7 months ago

wait, is GitHub down now? cannot fetch, push or pull atm

90

581

39

16

70K

MeowFET retweeted

Popp123 @PoppPopp123

about 1 year ago

@wangzhian8848 以后台湾过年改名“余人节”

18

1K

11

10

30K

Who to follow

almost 3 years ago

@Carlos_Gong 罗老师别这样

0

41

场效应猫 @MeowFET

almost 3 years ago

literally LOL

心臓��眞君 @xinzoruo

almost 3 years ago

谢谢你，留云借风真君 #Ganyu #Shenhe #CloudRetainer

183

88K

12K

1K

2M

0

313

场效应猫 @MeowFET

almost 3 years ago

@cnpoliwatch 行礼如仪。同理，负责更新外交部网站的工作人员想来无权传递额外的信息，暂时移除相关页面可以理解为避免展示过时内容，感觉无需作额外解读。

0

10

0

4K

场效应猫 @MeowFET

almost 3 years ago

@tankman2002 这里引用的话是秦的下属说的

0

29

0

24K

MeowFET retweeted

LuckyJ @LuckyJ1443836

about 3 years ago

LuckyJ has reached 10 dan. I would like to express my gratitude to https://t.co/Sy7EE92tEA and everyone who played with us.

LuckyJ1443836's tweet photo. LuckyJ has reached 10 dan. I would like to express my gratitude to https://t.co/Sy7EE92tEA and everyone who played with us. https://t.co/ZKu6tJh56C

8

631

237

27

204K

场效应猫 @MeowFET

about 3 years ago

现在入坑，说不准能干到退休

0

61

场效应猫 @MeowFET

about 3 years ago

感觉随着显示技术进步和 XR 落地，视频编解码作为一个特别工程的学科，还大有用武之地

1

0

79

场效应猫 @MeowFET

about 3 years ago

@Carlos_Gong 发布 Vision Pro 的 Moment 和贵前司在鸟巢掏出下一代计算设备属实有点押韵…… https://t.co/Lmu0JVfTkd

1

0

216

场效应猫 @MeowFET

over 3 years ago

@tinyfool 百度本来就有 GPT-3 规模的大模型，只不过实际性能有差距，可能存在对训练数据过拟合的问题

0

404

场效应猫 @MeowFET

over 3 years ago

@noahlin 感觉耗能很高……

0

23

场效应猫 @MeowFET

over 3 years ago

@Carlos_Gong Skin

0

37

场效应猫 @MeowFET

over 3 years ago

虽然方法非常直给 —— 标注 + RL (PPO)，但从网上各种例子看，真是把 GPT3 的能力发挥到极致了

0

场效应猫 @MeowFET

over 3 years ago

可怕捏，至少“装”这个事，ChatGPT 已经玩明白了

Jiayuan (JY) Zhang

@jiayuan_jy

over 3 years ago

1/ 在 ChatGPT 中实现了一门新的编程语言：GPTLang，并用这个语言写了一个排序算法。定义了一个新的命令 `gptlc`，用来编译 GPTLang 的代码。下图是最终的效果：让 ChatGPT 用 GPTLang 写了一个选择排序，并在命令行编译运行。这个 thread 将会详细讲述一下是如何一步步实现这门语言的。

jiayuan_jy's tweet photo. 1/ 在 ChatGPT 中实现了一门新的编程语言：GPTLang，并用这个语言写了一个排序算法。

定义了一个新的命令 `gptlc`，用来编译 GPTLang 的代码。

下图是最终的效果：让 ChatGPT 用 GPTLang 写了一个选择排序，并在命令行编译运行。

这个 thread 将会详细讲述一下是如何一步步实现这门语言的。 https://t.co/4BPQCWplC2

62

1K

412

328

0

1

0

场效应猫 @MeowFET

almost 4 years ago

真挺好喝的，无周边款 19 块，甚至显得很有性价比。

心臓弱眞君 @xinzoruo

almost 4 years ago

喝喝喝🥥

144

24K

1K

252

0

MeowFET retweeted

Duan Dang

@duandang

almost 4 years ago

China's military exercises around Taiwan in August 2022 and March 1996 (Third Taiwan Strait crisis). This time, some exercise areas overlap with Taiwan's territorial waters, an apparent escalation.