Everlier

Verified account

@Everlier

Building LLM agents & tools - Harbor / Facts / Mi / Skilled @tryjitera

Joined April 2010

456 Following

1.6K Followers

10.1K Posts

Pinned Tweet

3 months ago

You don't even need Kimi 2.5 for a decent local LLM setup. - llama.cpp - Unsloth's Qwen 3.5 35B A3B with UD Q4 K XL quants - OpenCode - av/harbor It'll take a while to download/install, but otherwise it's something that mid-range hardware (>32GB RAM, ~8GB VRAM) can run today.

Everlier's tweet photo. You don't even need Kimi 2.5 for a decent local LLM setup.

- llama.cpp
- Unsloth's Qwen 3.5 35B A3B with UD Q4 K XL quants
- OpenCode
- av/harbor

It'll take a while to download/install, but otherwise it's something that mid-range hardware (>32GB RAM, ~8GB VRAM) can run today. https://t.co/fJfLdKcVDA

3 months ago

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

fynnso's tweet photo. was messing with the OpenAI base URL in Cursor and caught this

accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast

so composer 2 is just Kimi K2.5 with RL
at least rename the model ID https://t.co/fyUWbo1InF

281

7K

470

2K

3M

14

109

6

57

16K

11 minutes ago

@thsottiaux Centroids at OpenAI are finally converging :)

Everlier's tweet photo. @thsottiaux Centroids at OpenAI are finally converging :) https://t.co/LT3GaaQozI

0

0

0

0

158

15 minutes ago

@Mayhem4Markets Agreed, I was really hoping that when they release Gemini 3.5 it'll give enough "headroom" for a larger Gemma to avoid overlap in the capability, still hoping :)

0

0

0

0

3

16 minutes ago

@Polymarket Wait, so for a company with ~10-20% profit margin, an extra ~30% productivity costs are unsustainable?

0

0

0

0

5

Who to follow

Canary Care is helping people live independently at home with smart, unobtrusive monitoring technology - delivering care when you can’t be there.

19 minutes ago

@ajaxdavis I think your understanding is spot on! Agentic CLIs should return a little extra information/instructions for the agents where it gives more context about the task at hand

0

0

0

0

6

21 minutes ago

@melvynx I think there's little reason not to optimise for the agents as the main users right away Splitting in various skills helps with progressive disclosure a bit, since showing everything all at once might be a bit too much

0

0

0

0

4

22 minutes ago

Maybe it's a mix of things: - I'm running the XL quant, it's ~10% larger - llama.cpp version differences + Vulcan vs. ROCm differences - this was in a container Anyways, I think Strix Halo's sweet spot are models with 3-4B active parameters per decoding pass, so this model is a tad outside of that. I think Qwen with MTP gives same TPS, but with benefits of a larger model. This one is great for 12GB-16GB GPUs though

0

0

0

0

3

about 11 hours ago

model: Gemma 4 12B, Q8 XL quant from Unsloth, hardware: Strix Halo APU, Ryzen AI 395+, 128GB speed: 2033 tokens in 3min 24s, 9.93 t/s, llamacpp version: 8738 (d6f303004) args: llama-server --no-mmap -dio -ngl 99 -np 1 --kv-unified neat.

Everlier's tweet photo. model:
Gemma 4 12B, Q8 XL quant from Unsloth,

hardware:
Strix Halo APU, Ryzen AI 395+, 128GB

speed:
2033 tokens in 3min 24s, 9.93 t/s,

llamacpp version:
8738 (d6f303004)

args:
llama-server --no-mmap -dio -ngl 99 -np 1 --kv-unified

neat. https://t.co/geQn57AC8u

1

2

1

1

174

27 minutes ago

@andreafspeziale @mattpocockuk Yes, remote workspaces can also solve this same problem! To be honest, I think that's how most development should happen this day, with a thin client, because of all the risks associated with agentic tooling

0

0

0

0

9

29 minutes ago

@amsalemadir @resend I think agentic use will be the main type of use for many of such tools, so optimising for it is increasingly important, especially for new tools that are not present in the training data at all

0

0

0

0

5

31 minutes ago

@crack3nnn It's not "the best coding model" or "the fastest coding model", but I found that it hits the sweet spot on cost/speed/quality, worth a try if you already have it available

1

1

0

0

4

about 17 hours ago

I've only started using it recently, but Grok Composer 2.5 took top spot as my most used model in the last few days. It's not Opus, but it's not far off. It adheres to instructions much better than Claude, more similar to GPTs in this aspect. It's quick. So, I do slow planning/exploratory sessions with Opus 4.6, and then I let Composer 2.5 execute on those plans. It's been pretty pleasant.

Everlier's tweet photo. I've only started using it recently, but Grok Composer 2.5 took top spot as my most used model in the last few days.

It's not Opus, but it's not far off.
It adheres to instructions much better than Claude, more similar to GPTs in this aspect.
It's quick.

So, I do slow planning/exploratory sessions with Opus 4.6, and then I let Composer 2.5 execute on those plans. It's been pretty pleasant.

1

0

0

0

125

32 minutes ago

@hitsmaxft Yes, absolutely! A CLI can shape and guide how agent is using it, by providing extra instructions or higlighting external state, there's a lot of extras to provide better agentic experience

0

0

0

0

6

about 15 hours ago

If you're building a CLI of any kind, add a section for agents to the help, also add a command for agents to load skill contents related to the usage of the CLI. agent-browser is a great example and I think more tools should follow the same template.

Everlier's tweet photo. If you're building a CLI of any kind, add a section for agents to the help, also add a command for agents to load skill contents related to the usage of the CLI.

agent-browser is a great example and I think more tools should follow the same template. https://t.co/vo5FYdGR2v

4

145

8

232

39K

34 minutes ago

@c0mm0n_dev_us3r Great! For agentic experience it also helps to provide some extra context alongside the command outputs, it often means a difference between a successful and a failed task

0

0

0

0

11

36 minutes ago

@dibstern @mattpocockuk Gateway means that it's a proxy, a passing point, not where the inference happens on its own. Our gateway specifically has full agentic loop, so yes, it matches a definition of a harness, it's essentially a fully featured cloud agent with OpenAI/Anthropic compatible API.

0

0

0

0

17

40 minutes ago

@9hills Arguably, Agentic Experience might become even more important soon for anything that user might not intereact with directly

0

1

0

0

9

41 minutes ago

@kaspar_winston @wesbos What do you think about this one? :) https://t.co/js7aor9KmL

about 2 months ago

I'm in awe, there's a pocket dimension of style and drip in LLMs, you just need to discover it It's so bad that it comes from the other side as being good. This over colored border-left any day of the week.

1

5

0

0

371

0

0

0

0

41

42 minutes ago

@doug__is Yes, for anything not widely present in the training data, progressive disclosure like this is the only way

0

0

0

0

4

43 minutes ago

@Michaelzy_dev @wesbos Yeah, I've seen it one too many times as well

0

0

0

0

19

43 minutes ago

@ellafella5 @wesbos There were definitely more below the fold :)

0

0

0

0

21

44 minutes ago

@wisdom_i_am @wesbos ⚡️⚡️⚡️

0

0

0

0

20

44 minutes ago

@fred_neck @USronaldcarter Everything that is not real can disappear

0

0

0

0

2

about 1 hour ago

@davidmytton Yes, as well as enriching output a bit for agents, even a single status/state line might often help agent to stay on track with what they are doing, ack/nack or explaining effects of the command. I think good AX really helps

0

0

0

0

7

Last Seen Users on Sotwe

Trends for you

Most Popular Users