Alfonso² Peterssen

about 2 months ago

Over the Easter weekend, I got Gemma 4 running in pure Java. No Python. No JNI. No native code. Just the JVM. https://t.co/HlwpGqUzFX

1

12

2

899

about 2 months ago

@jerrinot RAM is abundant and attention is only computational expensive at very long contexts. The speedup is minor and considering the capability loss, it's not worth it IMHO (for small models). Currently the KV cache is stored using float16, which is already tight, memory-wise.

1

2

0

62

PL/VM/Compilers at Oracle. Working on Truffle/Graal. ME/CFS Caretaker and Activist (https://t.co/gWxW16A5Op). Austrian expat in Zurich. https://t.co/piKOfYIYCL

about 2 months ago

@epragt Performance is very competitive for pure Java, also with GraalVM's Native Image. The repository links to implementations of Llama 3 (original), Qwen 3.5, OpenAI's gpt-oss, and NVIDIA's Nemotron 3 models, all in pure Java.

0

1

0

84

Who to follow

Christian Humer

@grashalm_

Benoit Daloze

@eregontp

Expert in dynamic language runtimes and JIT compilation, @TruffleRuby lead, Rubyist. Moved to https://t.co/71GpzI094S

Christos Kotselidis

@CKotselidis

Active at: https://t.co/0FoRun8A9n

TheMukel retweeted

Erik Pragt

@epragt

about 2 months ago

This is absolutely amazing. And decent performance as well!

1

0

217

TheMukel retweeted

about 2 months ago

🤩 @TheMukel has been cooking... 👇

1

4

1

0

539

TheMukel retweeted

Devoxx @Devoxx

about 2 months ago

🚀 Gemma4 Java runner: • Single file, no deps • E2B→31B + MoE • GGUF + quant (F16→Q8) • Vector API ⚡ • CLI + thinking modes • GraalVM native + instant TTFT Pure Java. No excuses 😏 https://t.co/Som5AnBcqh by @TheMukel ☕️🔥

1

48

16

38

7K

Michalis Papadimitriou @mikepapadim

2 months ago

@__tinygrad__ The operators serve hyper-specialized implementations of each model. How good is tinygrad at fusing high-level ops? Even with some advanced compiler magic, the hand-tuned kernels with nit-picked fusions are hard to beat. It's a pristine model blueprint vs. a tuned Franken-model.

0

79

TheMukel retweeted

Аlina Yurenko 🇺🇦

@alina_yurenko

8 months ago

My @GraalVM Native Image deep dive recording is already up: https://t.co/DF5f9HjtVg 🐰🚀 It includes the very public first demo of project Crema, Open World for Native Image, at 2:19:54 😅 Thank you, @Devoxx! All demos and notes are here: https://t.co/PdEESuT6PP

3

70

15

28

4K

TheMukel retweeted

about 1 year ago

https://t.co/pEHpA4oprX is out! Great effort by the @tornadovm team to bring GPU-enabled inference to the JVM

0

13

6

1

587

TheMukel retweeted

Josh Long

@starbuxman

about 1 year ago

run an LLM with a supercharged engine powered by Java and GraalVM (ht @alina_yurenko ) https://t.co/g1gr8nlEPL

1

75

12

40

11K

TheMukel retweeted

Fabio Niephaus @fniephaus

over 1 year ago

Looking forward to speaking tomorrow at @VoxxedCERN together with @TheMukel followed by delivering both a keynote and a regular talk at @VoxxedTicino on Friday! 🤩🎤#FunDaysAhead #AAP #Java

1

21

4

1

855

TheMukel retweeted

over 1 year ago

We just merged the current status of the upcoming JDWP support for @GraalVM Native Image! 🥳 This will soon provide developers with the same debugging experience they are used to in Java, but for native images! Stay tuned for more details. https://t.co/UmNLnaLns9

fniephaus's tweet photo. We just merged the current status of the upcoming JDWP support for @GraalVM Native Image! 🥳

This will soon provide developers with the same debugging experience they are used to in Java, but for native images! Stay tuned for more details.

https://t.co/UmNLnaLns9 https://t.co/CrD3TlA06X

1

75

21

6

7K

over 1 year ago

https://t.co/ne9OtJjWfi Graal compiler: +10% faster inference with the latest early access build. New features: batched prompt processing & AVX512 support.

2

74

24

35

7K

TheMukel retweeted

over 1 year ago

As a result I can now use @DevoxxGenie with a pure Java Arm Inference engine running locally on my mac using Llama 3.2 🍏🔥

1

8

1

819

TheMukel retweeted

Johan Hutting @JohanHutting

over 1 year ago

Modern @Java Project : a Spring Boot wrapper for https://t.co/k6xDFNGg6W from @TheMukel supporting OpenAI Chat Completion REST requests 🔥 https://t.co/1cMic9PQpn #OpenAI #SpringBoot

5

91

22

62

12K

TheMukel retweeted

over 1 year ago

Earlier today was asked if Java AI integration improved yet, or that we'd still need to rely on Python or C bindings. Was happy to share https://t.co/RuRcoxbHRm by @TheMukel from the GraalVM team running native in Java without any dependencies and with superior performance!

1

80

21

39

7K

TheMukel retweeted

Jörg Wille @FilterPunk

over 1 year ago

@tjake For me your #Devoxx talk about https://t.co/fWmo5GIyBh and the one from @TheMukel about https://t.co/LXtLab5BAp were the most relevant talks. Thanks for all the background information - I have learned a lot!

1

3

1

0

148

TheMukel retweeted

over 1 year ago

Just made the first-ever @DevoxxGenie LLM inference using ONLY @Java, powered by the awesome #Jlama project! ☕🔥 Huge thanks to @tjake for making it happen! 💪🏼#NoPython #JavaAI

2

19

6

4

3K