Dango233 @dango233max - Twitter Profile

10 days ago

Make your agent smarter. The II-Commons skill gives your agent reliable knowledge from arxiv, PubMed & more, plug it in Repo: https://t.co/JdV84AMPuv Add it to II-Agent: https://t.co/OWH1dFkwc4

3

55

12

37

8K

dango233max retweeted

-Zho-

@ZHO_ZHO_ZHO

22 days ago

变成喵在香港监狱喝咖啡哈哈哈哈哈 @dango233max

0

6

2

0

2K

dango233max retweeted

virushuo @virushuo

23 days ago

20 years ago, my first startup was all about enterprise search. Two decades later, we’re still building search engines. The technology has shifted from NLP to NN and the users from humans to agents. but searching is still the core. opensource the fastest bm25 engine:

5

54

8

31

20K

dango233max retweeted

Sixia "Leask" Huang

@LeaskH

25 days ago

我們開源了這顆星球🌎上速度最快的低成本 bm25 引擎。

6

232

32

186

46K

Who to follow

Alexander S

@devdef

CTO @ https://t.co/EyUQvm6IVc Beta is open! warpfusion, ArcaneGAN, face2comics. All tweets are sarcastic unless stated otherwise.

Takyon∞

@Takyon

AI ART + Strudel + Local LLM Run + Cybersecurity

David Marx (@digthatdata.bsky.social)

@DigThatData

Generative AI MLE, FOSS toolmaker, innovation catalyst @CoreWeave + @AiEleuther. https://t.co/z0fpuhlWRs

Dango233 @dango233max

27 days ago

https://t.co/4HMaHUZTT1

0

1

56

Dango233 @dango233max

27 days ago

DS4 is geart! I made a temporary fork with my weekend patches while the PRs are under review - unlock q4 on 192GB MAC - llama.cpp-style raw completions endpoint: enable Pre-filling and custom templates in SillyTaverns etc. Pre-merge convenience fork only :)

antirez @antirez

about 1 month ago

Welcome to DS4, a specialized inference engine for DeepSeek v4 Flash. https://t.co/UrUJz5I2R1 This project would have been impossible without the existence of llama.cpp and GGML and the work of @ggerganov and all the other contributors. Thanks!

47

1K

218

776

197K

1

3

0

265

Dango233 @dango233max

4 months ago

@karminski3 我召回测试用的都是苏丹的游戏的文本...

0

846

dango233max retweeted

virushuo @virushuo

4 months ago

我们始终还是相信 multi-agents 是必须的，尽管很多公司都认为它实现起来难度太大。我承认确实比预期困难一些，但是这应该是目前最“不一样”的多agent框架了。这个视频中每个节点都是agent，没有工作流，它们是自组织的，诞生，合作，互相攻击和死亡都是自主行为。

5

88

13

69

20K

dango233max retweeted

Intelligent Internet @ii_posts

4 months ago

Unstructured intelligence = chaos Most agent frameworks ship without a nervous system: deadlocks, context loss, vacuum hallucinations. We built Common Ground to fix this, agents coordinate on a shared protocol.

24

448

46

376

536K

dango233max retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

4 months ago

Chinese New Year is rapidly becoming the AI researcher's favorite holiday

40

1K

57

63

141K

dango233max retweeted

virushuo @virushuo

4 months ago

我参与了中文版翻译工作。希望把关于 AI 时代经济与治理的讨论带给更多中文读者，欢迎大家指出任何翻译/术语建议。虽然AI已经能做大部分翻译任务，但翻译过程中还是有很大量的人类对齐工作，尤其一些概念中/英差距很大，又要兼顾原作者表达的语气和方式，整个工作体验还是很有意思的。

19

398

70

297

63K

dango233max retweeted

Emad

@EMostaque

4 months ago

Our state of the art open source general purpose agent hits V1 Feature equivalent to Replit / Manus / Genspark etc, to make websites to presentations and more connected to all your other tools Readying open repo update in a week or two, give it a try and give feedback!

35

326

38

117

31K

dango233max retweeted

Intelligent Internet @ii_posts

4 months ago

II-Agent V1 is here. The AI agent built for real work is finally out of beta. Faster, smarter, and production-ready. It’s time to change how you build. 👇 Let’s see what’s new.

21

214

47

110

132K

Dango233 @dango233max

5 months ago

@bboczeng 在AI paper里面用manifold谈不上装hhhh

0

1

0

1

373

Dango233 @dango233max

5 months ago

锐评：挂流形的羊头，卖运筹学的狗肉，论文命名的反向工程，给工程解法找理论爹千万别被“流形”这个词骗了。说是从流形理论推导的，我敢打赌这绝对是从运筹学“倒着来”的，想从微分几何去理解是南辕北辙。我工业工程的DNA动了，怪不得这么多人“看不懂”。说是指派问题我的IE同学们是不是能看懂？

alphaXiv

@askalphaxiv

5 months ago

DeepSeek just dropped a banger paper to wrap up 2025 "mHC: Manifold-Constrained Hyper-Connections" Hyper-Connections turn the single residual “highway” in transformers into n parallel lanes, and each layer learns how to shuffle and share signal between lanes. But if each layer can arbitrarily amplify or shrink lanes, the product of those shuffles across depth makes signals/gradients blow up or fade out. So they force each shuffle to be mass-conserving: a doubly stochastic matrix (nonnegative, every row/column sums to 1). Each layer can only redistribute signal across lanes, not create or destroy it, so the deep skip-path stays stable while features still mix! with n=4 it adds ~6.7% training time, but cuts final loss by ~0.02, and keeps worst-case backward gain ~1.6 (vs ~3000 without the constraint), with consistent benchmark wins across the board