MaxGraey 🌻 🇺🇦 @MaxGraey - Twitter Profile

MaxGraey 🌻 🇺🇦 @MaxGraey

31 minutes ago

@jarredsumner Check NUMA-aware actor model for runtimes: https://t.co/9os5KyX2EZ

0

42

MaxGraey 🌻 🇺🇦 @MaxGraey

1 day ago

@vldm_fake @ChShersh yeah, it's more correct what is std::unreachable in C++. It's basically a wrapper over __builtin_unreachable

0

12

MaxGraey 🌻 🇺🇦 @MaxGraey

2 days ago

@ChShersh At least it's not UB in debug

0

39

MaxGraey 🌻 🇺🇦 @MaxGraey

2 days ago

@ChShersh unreachable is not UB. It's actually opposite of UB. And many languages have this in std

2

0

153

Who to follow

Wasmer

@wasmerio

The Universal WebAssembly Runtime. We ❤️ Open Source

wasmCloud

@wasmcloud

Incubating CNCF Project. Build, manage, and scale polyglot apps across any cloud, K8s, or edge. Join us on Bluesky: https://t.co/lzXzKZYaao

Cosmonic

@cosmonic

Cosmonic Control. A powerful control plane for managing distributed apps across #cloud, #K8s, #edge Cosmonic on Bluesky: https://t.co/CSwcThtAXa

MaxGraey 🌻 🇺🇦 @MaxGraey

4 days ago

Remember every company that massively laid off their best devs for AI. When they inevitably start hiring again, just pass by. If they did it once, they'll do it again when AI becomes more profitable!

0

37

MaxGraey 🌻 🇺🇦 @MaxGraey

6 days ago

@doucommunity Uber звільнив 4000 інженерів, а потім спалив $3.5млрд за 4 місяці на токенах

1

21

0

894

MaxGraey 🌻 🇺🇦 @MaxGraey

9 days ago

@superaiwatcher I've already explained why this is all nonsense. LLMs can hallucinate and engage in reward hacking for the tests themselves. Can you share some research papers that address all these issues? Why are you so sure about all this?

0

8

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

Updated LLM code benchmark. For perplexity score only PPL (geometric mean branching factor per token which equal to 2^(total_bits / N)) used. #Typescript #Go #Rust #Zig #Haskell #Closure #Python

MaxGraey's tweet photo. Updated LLM code benchmark. For perplexity score only PPL (geometric mean branching factor per token which equal to 2^(total_bits / N)) used.

#Typescript #Go #Rust #Zig #Haskell #Closure #Python https://t.co/bwBlcBo0uw

3

2

0

1

230

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@superaiwatcher PPL is still important. Lower PPL often correlates with a stronger underlying language model & indirectly associated with reduced compute requirements to achieve a given level of quality. But here we measure how efficiently it handles the input, not the model itself.

0

16

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@superaiwatcher Check out AlphaCode 2 (or AlphaCodium). Google decided not to continue developing in this direction. It's a dead end. Maybe I'm just not aware of it? Are there any scientific papers? Or some promising research projects? But without marketing bullshit

1

0

15

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@superaiwatcher Also, current LLM architectures does not possess a world model and is not capable of semantic reasoning or computation in the way SMT does this. All of this leads to reward hacking, meaning the model simply minimizes the loss function while leaving the actual objective behind.

0

23

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@superaiwatcher I've been hearing about EBFC in LLMs for around 6 years already. The problem is that it covers examples rather than behavior across inputs. If a program passes a 1M tests that doesn't mean it won't fail on test 1M + 1. The same issue with fuzzing tests

3

0

36

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@Shirmanov https://t.co/jKL1yaV4Is

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

Updated LLM code benchmark. For perplexity score only PPL (geometric mean branching factor per token which equal to 2^(total_bits / N)) used. #Typescript #Go #Rust #Zig #Haskell #Closure #Python

3

2

0

1

230

0

2

MaxGraey 🌻 🇺🇦 @MaxGraey

12 days ago

So now #Go and #Python (with types) have joined #Rust, #Zig, and #TypeScript. Also, besides counting tokens, I'm now tracking perplexity score as well Bench, sources, methodology: https://t.co/blPDz9Rlwt #LLM #CodeGen

MaxGraey's tweet photo. So now #Go and #Python (with types) have joined #Rust, #Zig, and #TypeScript. Also, besides counting tokens, I'm now tracking perplexity score as well

Bench, sources, methodology: https://t.co/blPDz9Rlwt

#LLM #CodeGen https://t.co/ZrzAA89l9E

4

7

2

3

2K

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@golang, #Python (/w types), @rustlang, #Zig, @typescript. And now #Haskell and #Closure. The benchmark now measures only ppl for the perplexity score Results, sources, methodology: https://t.co/blPDz9Rlwt

MaxGraey's tweet photo. @golang, #Python (/w types), @rustlang, #Zig, @typescript. And now #Haskell and #Closure.

The benchmark now measures only ppl for the perplexity score

Results, sources, methodology: https://t.co/blPDz9Rlwt https://t.co/KqfdLRgjRV

0

82

MaxGraey 🌻 🇺🇦 @MaxGraey

10 days ago

@Shirmanov I'm thinking about to add Haskell and Clojure

0

1

0

36

MaxGraey 🌻 🇺🇦 @MaxGraey

11 days ago

@mountain_coding PRs are welcomed! Or you can add everything locally and run the tests. It’s not difficult, there’s a script to download model which will run locally in 5–7 minutes on the average machine, since it only run prefill and not full inferring.

0

1

0

65

MaxGraey 🌻 🇺🇦 @MaxGraey

11 days ago

@ctatedev Token count is far from the most important metric. Besides perplexity, it also matters how good the standard library is and whether it can cover and simplify most common user patterns. For example Go is the best in this regard https://t.co/B65EDQkm9j

MaxGraey 🌻 🇺🇦 @MaxGraey

12 days ago

So now #Go and #Python (with types) have joined #Rust, #Zig, and #TypeScript. Also, besides counting tokens, I'm now tracking perplexity score as well Bench, sources, methodology: https://t.co/blPDz9Rlwt #LLM #CodeGen

4

7

2

3

2K

1

5

0

1

2K

MaxGraey 🌻 🇺🇦 @MaxGraey

12 days ago

Perplexity was calculated using Qwen2.5-Coder-3B Q5_K_M with Q8 KV cache and ~8k context window.

0

1

0

194

MaxGraey 🌻 🇺🇦

@MaxGraey

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users