Jiahao Chen @GeorgeChen92 - Twitter Profile

about 1 hour ago

@garrytan Garry you are too optimistic. From what I gathered with my US and German colleagues, applying AI in healthcare will still take quite some time to get that level of boost.

0

9

Jiahao Chen @GeorgeChen92

about 1 hour ago

@jun_song Understanding the vision is one thing, and having taste is quite another thing. I do see much better results with a combination of certain models and the taste skill from @LexnLin. Most of capable models just need a bit kick.

0

18

Jiahao Chen @GeorgeChen92

1 day ago

@MiaAI_lab @u1tra_instinct Cool. What's the total tks we can get from the 12 concurrent sessions? Can't wait to put this on when the 2 Sparks arrive in a month.

1

0

221

Jiahao Chen @GeorgeChen92

2 days ago

@jerryjliu0 Is this retrieval harness also in the github repo or only on the cloud API?

0

8

Jiahao Chen @GeorgeChen92

3 days ago

@sachindetrax @MiaAI_lab @UnslothAI @NVIDIAAI Use Qwen3.6-35B-A3B Q4 quant. Fitting between 16GB VRAM and 16GB CPU RAM is okay with MTP enabled. Speed would take a hit, expect 20~40 tks generation with one session depending on your GPU (I assume 16GB VRAM is an old GPU).

1

3

0

73

Jiahao Chen @GeorgeChen92

4 days ago

@quxiaoyin 🤣 I don't know about Anthropic, but Liang's company is trading in Chinese markets so... I guess less competition there.

0

1K

Jiahao Chen @GeorgeChen92

12 days ago

@TheAhmadOsman That's true. If I get an offer of similar pay and that bonus, I'd take it on the spot.

0

36

Jiahao Chen @GeorgeChen92

12 days ago

@TheAhmadOsman One sensible way to do the math is how they depreciate IT assets in accounting, 5 years to zero and free milk afterwards. Individuals can fire up old PCs, buy used GPUs and start small. I'm already running local models on a 5 year old company workstation so it's literally free.

0

33

Jiahao Chen @GeorgeChen92

12 days ago

Also, obra's superpowers skills are amazing. Getting to know how to dev with Obsidian API & publishing procedure took some time but really interesting to format codes to avoid plugin review warnings & such with coding agents.

0

7

Jiahao Chen @GeorgeChen92

12 days ago

Built my first Obsidian plugin with pi agent and Qwen3.6-35B-A3B Q4KM UD served with llama.cpp. This is also my first project developed purely by local model & vibe coding (I don't know even the most basic java/ts). Check it out: https://t.co/h3CCmO7fHN

1

0

12

Jiahao Chen @GeorgeChen92

17 days ago

@s0me1suspicious @catalinmpit The thing is, these are company devices that I cannot take them apart and put the 2 GPUs into one build, hence the separate setup.

0

1

Jiahao Chen @GeorgeChen92

19 days ago

@hxiao Local inference for sure is not there yet, but for most users who don't run that many agents on longer tasks, they can satisfy most tasks. With recent developments with Fable, I've started to shift some of my work to rely on local models. This is going to start for most folks.

0

1

0

69

Jiahao Chen @GeorgeChen92

22 days ago

@TheAhmadOsman Hello Ahmad, today I used only a local model to develop a plugin for Obsidian. This is the first project I developed with only local models and I'm sure it won't be the last. It took me a couple of weeks' spare time to get it up and running. Thanks for the inspiration.

0

34

Jiahao Chen @GeorgeChen92

27 days ago

@TheAhmadOsman Felt this on Jun 1st when our corporate's github copilot switched to usage-based billing. Looking for an excuse to ask my boss to buy a GPU for local AI work.

0

26

Jiahao Chen @GeorgeChen92

about 1 month ago

@TheAhmadOsman Our company provides github copilot subscription and it turned into usage-based billing and I burnt through half of monthly credits on the first day. I don't think it's sustainable without a local model at this point, or a cheap Chinese API.

0

26

Jiahao Chen @GeorgeChen92

about 1 month ago

@dunik_7 After two days tinkering Ollama, now everytime I see the word I frown. It really is a beginner's hot dream but only that.

0

84

Jiahao Chen @GeorgeChen92

about 1 month ago

@mingchikuo 先有鸡还是先有蛋吧，等着微软做太慢了，不如把显存拉满让 saas 公司自己做整合

0

353

Jiahao Chen @GeorgeChen92

about 1 month ago

@ivanfioravanti It's too inconvenient to get things going. My colleagues are asking me for help with hermes installation & local model setup.

0

7

Jiahao Chen @GeorgeChen92

about 1 month ago

@natolambert @Zai_org Main issue is the lack of compute. It's hard for them to balance between training new models and selling more coding plan. Deepseek is in a much better situation because of their effort to train and build infra on Huawei chips.

0

13

Jiahao Chen

@GeorgeChen92

Last Seen Users on Sotwe

Trends for you

Most Popular Users