Joe Wu @joe623 - Twitter Profile

joe623 retweeted

cogsec

@affaan

5 months ago

https://t.co/s9QK7Xdoaz

42

2K

245

4K

505K

joe623 retweeted

Andrej Karpathy

@karpathy

8 months ago

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...

558

13K

2K

7K

3M

joe623 retweeted

Unwind AI

@unwind_ai_

10 months ago

These 10 MCP servers are almost all you'll ever need. Curated after months of building. 1. DeepGraph MCP turns code repos into interactive knowledge graphs. Semantically search code functionalities, analyze dependencies, and explore direct relationships. 100% open-source.

13

364

67

759

37K

joe623 retweeted

Jeff Weinstein

@jeff_weinstein

about 1 year ago

Dearest MCP developers, You can now monetize your MCP in a few lines of code with @stripe, available today. 💸 - Bill subscriptions or usage-based - Works with any client - Supports @Cloudflare's Agents SDK (more to come) MCP, a new AI-native customer channel. Get started. ⤵️

jeff_weinstein's tweet photo. Dearest MCP developers,

You can now monetize your MCP in a few lines of code with @stripe, available today. 💸

- Bill subscriptions or usage-based
- Works with any client
- Supports @Cloudflare's Agents SDK (more to come)

MCP, a new AI-native customer channel. Get started. ⤵️ https://t.co/CB2BQZTSpI

39

2K

170

2K

388K

Who to follow

Laura Liu🎒

@Laura6liu

🎒｜🇹🇼 Founder of @grenadetw

Tom Chen

@chungtingc

Founder and CEO of Fugu Fish Creations, band leader of 21 樂團 (21 Band). Also known as "yychen" a long time ago.

DEFTeam Solutions

@DEFTeamSolution

Providing End-to-end Open Source Business Intelligence And Predictive Analytics Solutions On - Premise & On - Cloud Contact [email protected] for more details

joe623 retweeted

Deedy

@deedydas

about 1 year ago

🚨Viral rumors of DeepSeek R2 leaked! —1.2T param, 78B active, hybrid MoE —97.3% cheaper than GPT 4o ($0.07/M in, $0.27/M out) —5.2PB training data. 89.7% on C-Eval2.0 —Better vision. 92.4% on COCO —82% utilization in Huawei Ascend 910B Big shift away from US supply chain.

deedydas's tweet photo. 🚨Viral rumors of DeepSeek R2 leaked!

—1.2T param, 78B active, hybrid MoE
—97.3% cheaper than GPT 4o ($0.07/M in, $0.27/M out)
—5.2PB training data. 89.7% on C-Eval2.0
—Better vision. 92.4% on COCO
—82% utilization in Huawei Ascend 910B

Big shift away from US supply chain. https://t.co/Jncg0PvEYU

114

2K

270

776

658K

joe623 retweeted

Alex Xu

@alexxubyte

about 1 year ago

Top 6 Tools to Turn Code into Beautiful Diagrams

10

1K

218

1K

147K

joe623 retweeted

Aravind Srinivas

@AravSrinivas

about 1 year ago

What did we get done this week at Perplexity? 1. Fact-Check any part of the answer with sources: Pick any part of the answer you want fine-grained sources for, or think there's a potential hallucination, and fact-check further.

41

944

44

287

175K

joe623 retweeted

Lovis Odin

@OdinLovis

over 1 year ago

MCP Claude that have full control on ChatGPT 4o to generate full storyboard in Ghibli style ! All automatic I am doing nothing at all, we live a pretty crazy time @AnthropicAI @OpenAI

61

3K

445

4K

404K

joe623 retweeted

xAI

@xai

over 1 year ago

This is it: The world’s smartest AI, Grok 3, now available for free (until our servers melt). Try Grok 3 now: https://t.co/Tj0afLoxEz X Premium+ and SuperGrok users will have increased access to Grok 3, in addition to early access to advanced features like Voice Mode

4K

36K

6K

5K

43M

joe623 retweeted

David

@dzhng

over 1 year ago

Introducing deep-research - my own open source implementation of OpenAI's new Deep Research agent. Get the same capability without paying $200. You can even tweak the behavior of the agent with adjustable breadth and depth. Run it for 5 min or 5 hours, it'll auto adjust.

140

4K

435

6K

522K

joe623 retweeted

Mckay Wrigley

@mckaywrigley

over 1 year ago

A great use case for OpenAI Deep Research is a 1-stop daily news report. Prompt it with: - General rules - Personal bio - Your interests - Preferred sources It’ll generate a comprehensive news report 100% customized to you. This is how I’ll get my news now. Full prompt below.

75

2K

154

3K

256K

joe623 retweeted

elvis

@omarsar0

over 1 year ago

🎓OpenAI Deep Research Guide Just finished our live webinar on Deep Research, including examples, prompting tips, use cases, and what's missing. I am releasing the full guide I shared with our members (link in the comments).

omarsar0's tweet photo. 🎓OpenAI Deep Research Guide

Just finished our live webinar on Deep Research, including examples, prompting tips, use cases, and what's missing.

I am releasing the full guide I shared with our members (link in the comments). https://t.co/3e0OtrcL0N

17

256

51

358

34K

joe623 retweeted

Ruud van der Linden

@RuudNL

over 1 year ago

Sora v2 release is impending: * 1-minute video outputs * text-to-video * text+image-to-video * text+video-to-video OpenAI's Chad Nelson showed this at the C21Media Keynote in London. And he said we will see it very very soon, as @sama has foreshadowed.

98

2K

324

736

502K

joe623 retweeted

AI Will @FinanceYF5

almost 2 years ago

这个人帮Uber获得了最初的1亿用户。然后推动了Tinder、Dropbox和Zynga的爆炸式增长。他刚刚说：“创作者经济是创新的新前沿。” Andrew Chen 对未来财富创造的5个预测（以及你为什么应该关心）：

8

534

150

715

226K

joe623 retweeted

Star@Day1Global Podcast

@starzq

almost 2 years ago

https://t.co/D8cEcPNPDb

20

795

239

1K

229K

joe623 retweeted

Thomas Wolf

@Thom_Wolf

over 2 years ago

You likely missed it if you only follow ML Twitter but there's a series of mind-blowing tech reports and open-source models coming from China (DeepSeek, MiniCPM, UltraFeedback...) with so much lesson learned and experiments openly shared together with models, data, etc This level of candid sharing of knowledge and insights is something we've lost in most recent western tech models releases and reports (with the noticeable exception of a few places like the recent AllenAI OLMO release) Just take a look for instance at these two fresh examples published in the past few days: - the new MiniCPM blog (amazing super small model – deep dive in the experiments): https://t.co/mheQjCkiYW - the new DeepSeek Math paper archieving over 60% on MATH: https://t.co/MyVKdj9Dsw

25

531

89

384

112K

joe623 retweeted

宝玉

@dotey

over 2 years ago

年底最值得一读的 RAG 论文：《Retrieval-Augmented Generation for Large Language Models: A Survey | 面向大语言模型的检索增强生成技术：调查 [译]》摘要：在这篇调查中，我们关注的是面向大语言模型的检索增强生成技术。这项技术通过结合检索机制，增强了大语言模型在处理复杂查询和生成更准确信息方面的能力。我们从同济大学和复旦大学的相关研究团队出发，综合分析了该领域的最新进展和未来趋势。校对中难免有疏漏指出，有翻译错误请指出！ https://t.co/uMmnAbPvHx

dotey's tweet photo. 年底最值得一读的 RAG 论文：
《Retrieval-Augmented Generation for Large Language Models: A Survey | 面向大语言模型的检索增强生成技术：调查 [译]》

摘要：

在这篇调查中，我们关注的是面向大语言模型的检索增强生成技术。这项技术通过结合检索机制，增强了大语言模型在处理复杂查询和生成更准确信息方面的能力。我们从同济大学和复旦大学的相关研究团队出发，综合分析了该领域的最新进展和未来趋势。

校对中难免有疏漏指出，有翻译错误请指出！

https://t.co/uMmnAbPvHx

12

600

207

578

120K

joe623 retweeted

little shrimp🐳

@c_littleshrimp

almost 3 years ago

币安IEO的 $Arkm, 类似nansen的链上数据分析工具，看起来数据清洗和追踪都更到位，白名单邀请机制。产品体验链接https://t.co/hWwEOnlm2l 代币经济模型，算是高度控盘了。简简单单，没有套路，上交易所后的第一年因为团队和VC的币还没解锁，流通筹码不多，嗯，适合炒作。工具里第一个发币的吧