Alex Kuleshov

Verified account

@0xAX

Software developer. Posting about things that I've met during reading source code of different systems. Author of linux-insides.

Joined October 2012

172 Following

11.3K Followers

8.2K Posts

Pinned Tweet

21 days ago

I’m considering my next career step and would be happy to hear about interesting remote backend/infra roles. My background is mostly in Elixir/Erlang/OTP, distributed backend systems, Kubernetes, observability, and telecom/AAA platforms. I’m especially interested in technically strong teams working on infrastructure, observability, databases, Kubernetes, or distributed systems. I’m based in GMT+5 and open to working with teams across compatible time zones. If your team is hiring for something similar, feel free to DM me or point me to the right person.

1

18

6

7

4K

about 3 hours ago

@damian_b Of course

0

0

0

0

6

about 3 hours ago

The first thing I will try tomorrow morning!

Damian Barabonkov

about 5 hours ago

"Introducing 𝚍𝚎𝚑𝚞𝚋, the all-in-one GitHub TUI." I was fed up with how laggy the GitHub UI was. So I decided to combine existing tools + my own changes into one unified TUI. Manage Pull Requests, Actions, Issues, view Diffs feature rich and more! https://t.co/HuTGrZG6FC

7

40

5

24

2K

1

3

0

0

402

about 5 hours ago

@josevalim Congratulations! Thank you for you work Jose

0

1

0

0

942

Who to follow

Securing every bit of your data https://t.co/hqdd8jMkYM https://t.co/GOXPtukIXE

Verified account

I'm also @[email protected]

Verified account

REcon: Annual reverse engineering and security conference held in Montreal.

0xAX retweeted

about 5 hours ago

Elixir v1.20 released! Now officially a gradually typed language: Elixir type checks every single line of code, finding bugs and dead code, without developer overhead (no typing signatures) and extremely low false positives rate. Plus a faster compiler! Links and reports below.

31

744

170

53

30K

about 11 hours ago

I finally finished the initial version of a new home for my Linux Inside series: https://t.co/IsiURZwi56 In the meantime, I will slowly continue revisiting and updating the old chapters for modern kernels

3

95

9

85

3K

1 day ago

Every time I use uv, I have the same question: how did it happen that something like this appeared so late in the Python ecosystem?

0

6

0

1

875

0xAX retweeted

Damian Barabonkov

1 day ago

My Take: it is OK to vibe production code. In my prod repos, I just have a 0-unvetted code policy. Vibed code is ok, as long as an engineer has reasoned about it. Maybe as models get stronger, this requirement will loosen. But for current SOTA coding models, not yet.

4

24

2

0

2K

1 day ago

This is a nice example of branch prediction, but be careful to apply it in practice. Sorting can make the loop faster. But if you only need to run it once, sorting may take far longer than the original loop over the pre-optimized data. Always measure the entire operation, not just the part you optimized.

1 day ago

sorting your array before the loop makes it 6x faster. same data. same algorithm. the CPU just stopped guessing wrong. a mispredicted branch flushes the entire pipeline. 15-20 wasted cycles. per wrong guess.

sudox7's tweet photo. sorting your array before the loop makes it 6x faster. same data. same algorithm. the CPU just stopped guessing wrong.

a mispredicted branch flushes the entire pipeline. 15-20 wasted cycles. per wrong guess. https://t.co/sMY8d1tAqC

18

229

9

91

32K

1

92

6

51

14K

1 day ago

@damian_b @OjasSharma276 And of course both of them so much "fun" to debug

0

1

0

0

55

1 day ago

A lesson I needed to learn again and again: If something broke after, let's say, the two latest changes in your code base, where one is large and the other tiny - do not immediately assume the large one is guilty. I know, I know, it is so tempting to narrow the debugging area by ignoring changes that look too small or irrelevant to matter. Start testing with the smallest suspicious change. It can save you a lot of time.

1

8

0

0

1K

2 days ago

@robodadg Perfect answer!

0

0

0

0

72

2 days ago

Learning ML basics and looking into the numpy source code led me into some computing archaeology today. Did you know that when you import numpy for the first time, it detects the CPU features available on your machine? Looking at how it does this, I stumbled upon an interesting detail: on x86-32 PIC builds, numpy preserves the value of the ebx register around the CPUID instruction. Without googling do you know why?

0xAX's tweet photo. Learning ML basics and looking into the numpy source code led me into some computing archaeology today.

Did you know that when you import numpy for the first time, it detects the CPU features available on your machine?

Looking at how it does this, I stumbled upon an interesting detail: on x86-32 PIC builds, numpy preserves the value of the ebx register around the CPUID instruction.

Without googling do you know why?

2

25

2

30

6K

2 days ago

@CorrodedCoder Can imagine how it was "fun" to debug it

1

1

0

0

34

0xAX retweeted

Corroded Coder @CorrodedCoder

2 days ago

@0xAX That's a great explanation. In fact I saw an occasional bug in some old code in a JNI library which turned out to be exactly this but only seemed to crop up when loaded in certain environments under Java 25. Fortunately, I was able to update the code it with a compiler intrinsic!

1

1

1

0

565

2 days ago

Returning to the Linux kernel, the reason why it does not need this workaround is trivial now, it is simply not compiled as position-independent code

0xAX's tweet photo. Returning to the Linux kernel, the reason why it does not need this workaround is trivial now, it is simply not compiled as position-independent code https://t.co/fS9Gn19iCO

0

1

0

0

620

2 days ago

CPUID returns one of its results in the ebx register. But in 32-bit position-independent code, ebx may already be used as the PIC register holding the Global Offset Table base. This is easy to reproduce in Godbolt. With GCC 4.9.4, this code compiles with -m32 -O3, but fails if we add -fPIC. So numpy avoids the problem by swapping ebx with a temporary register before and after CPUID. GCC 5 will preserve this register by itself, so the direct version compiles with newer GCC versions too https://t.co/IK6wKZeO8w

2

2

0

3

1K

3 days ago

@cs_serdar Thank you for the advice. Yes, trying to do something similar now

0

1

0

0

262

4 days ago

Question for people who learned ML after years of software engineering outside ML - what was actually worth implementing yourself? Arrays? Some linear algebra?Optimizers? CUDA kernels? Trying to find the point where learning from scratch stops being useful and quietly becomes a separate multi-year project

6

32

1

31

7K

3 days ago

@cwyangg Thank you!

0

0

0

0

280

Last Seen Users on Sotwe

Trends for you

Most Popular Users