Simon Guindon @simongui - Twitter Profile

simongui retweeted

about 1 month ago

Database transactions don't get enough love but the ability to execute a bunch of code, change a bunch of data, and only commit it when you've validated the results is going to be so critical in the AI era.

14

336

29

36

25K

Simon Guindon @simongui

about 1 month ago

I think I just experienced my first AI assisted interview candidate this week. For every question there was a sigh followed by a loooong pause. Tons of stalling. Glancing at another screen Then all the sudden answered the question Every single question was the same stalling

0

41

simongui retweeted

ClickHouse @ClickHouseDB

6 months ago

Alexey said it best: “We optimize ClickHouse… and then optimize it again… and again.” https://t.co/AO8W3A0aY7 25.11 keeps that tradition alive Parallel GROUP BY merges. Projections as true secondary indexes. Faster DISTINCT. More speed everywhere.

1

62

6

27

32K

simongui retweeted

Daniel Lemire

@lemire

about 2 years ago

From Modular's blog we learn that Mojo (the new programming language) makes SIMD instructions first class citizens: "CPUs have special registers and instructions to process multiple bits of data at the same time, known as SIMD (Single Instruction, Multiple Data). But the ergonomics of writing this code has historically been very ugly and difficult to use. These special instructions have been around for many years, but most code is still not optimized for it. When someone works through the complexities and writes a portable SIMD optimized algorithm, it blows the competition out of the water, for example simd_json. Mojo's primitives are natively designed to be SIMD-first: UInt8 is actually a SIMD[DType.uint8, 1] which is a SIMD of 1 element. There is no performance overhead to represent it this way, but it allows the programmer to easily use it for SIMD optimizations. For example, you can split up text into 64 byte blocks and represent it as SIMD[DType.uint8, 64] then compare it to a single newline character, in order to find the index for every newline. Because the SIMD registers on your machine can calculate operations on 512bits of data at the same time, this will improve the performance for those operations by 64x!"

9

145

17

84

34K

Who to follow

data, engineering, icecream. Building @StoreLocators. ex @shopify

DudeRock

@DudeRockTV

🤘Games, Dreams and Rock n' Roll, man. Making video games

simongui retweeted

Denis Magda

@denismagda

about 2 years ago

If you're not a stranger to the world of databases, then you have either read or heard about the Database Internals book by Alex Petrov (@ifesdjeen). However, what many still don't know is that Alex runs a Discord community where you can continue to advance your knowledge of database internals. Tomorrow, @FranckPachot and I are joining the group to demonstrate how MVCC (Multi-Version Concurrency Control) works in Postgres and YugabyteDB. Join us: https://t.co/AwYK68aiqB

denismagda's tweet photo. If you're not a stranger to the world of databases, then you have either read or heard about the Database Internals book by Alex Petrov (@ifesdjeen).

However, what many still don't know is that Alex runs a Discord community where you can continue to advance your knowledge of database internals.

Tomorrow, @FranckPachot and I are joining the group to demonstrate how MVCC (Multi-Version Concurrency Control) works in Postgres and YugabyteDB.

Join us: https://t.co/AwYK68aiqB

4

129

20

134

20K

simongui retweeted

Markus Eisele

@myfear

over 2 years ago

How to Generate Unique IDs in Distributed Systems: 6 Key Strategies | by Phuong Le (@func25) https://t.co/GFlaKjTLwf #distributedsystems

myfear's tweet photo. How to Generate Unique IDs in Distributed Systems: 6 Key Strategies | by Phuong Le (@func25) https://t.co/GFlaKjTLwf
#distributedsystems https://t.co/NuJeUfu6BT

1

173

53

177

13K

simongui retweeted

ESPN F1

@ESPNF1

over 2 years ago

This has got to be the best F1 setup we've ever seen 🤩📺 (via @gameroomtheater)

342

18K

2K

1K

3M

Simon Guindon @simongui

over 2 years ago

@fanatec I’ve enjoyed my DD1 so much over the past 4 years. I’m sad the power supply died. I want to get back to racing and streaming. How can I purchase a replacement power supply? Thanks so much in advance for such a great wheel base.

0

38

simongui retweeted

Phil Eaton

@eatonphil

over 2 years ago

Feels like there's room for simpler and faster Jepsen, or even just variations -- and explanations -- of it. Love to see this. https://t.co/9aom7aKGJQ

eatonphil's tweet photo. Feels like there's room for simpler and faster Jepsen, or even just variations -- and explanations -- of it.

Love to see this.

https://t.co/9aom7aKGJQ https://t.co/GIaezzbMW3

1

141

24

71

17K

Simon Guindon @simongui

over 2 years ago

Introducing pgroll: zero-downtime, reversible, schema migrations for Postgres. https://t.co/Qp0EBpXQrg

0

1

0

79

simongui retweeted

johnnysswlab.com @johnnysswlab

over 2 years ago

Faster hash maps, binary trees etc. through data layout modification We investigate how to make faster hash maps, trees, linked lists and vector of pointers by changing their data layout. https://t.co/6jYp3etxPG

3

247

59

189

40K

simongui retweeted

Saurabh Dashora

@ProgressiveCod2

over 2 years ago

This brilliant technique for handling database queries literally saved Discord. It helped them store trillions of messages and fetch them without bringing their DB cluster to its knees. The technique is called Request Coalescing. And it’s too good to ignore. But what’s so special about it? If multiple users are requesting the same row at the same time, why not query the database only once? This is exactly what Request Coalescing helps us achieve. Here’s what happens under the hood: - The first user that makes a request causes a worker task to spin up in the data service - Subsequent requests for the same data will check for the existence of that task and subscribe to it - Once the initial worker task queries the database and gets the result, it will return the row to all subscribers at the same time. There are several pros to using Request Coalescing: - Efficient utilization of database resources - Ability to handle more concurrent requests without creating hot partitions - Reduce latency But there are some cons as well: - Implementation can be complex with regards to getting a fair distributed reader-write lock. Basically, multiple readers need to access the data simultaneously while preventing conflicting writes - Overall latency may go down, but certain requests will take more time Of course, this technique is NOT needed normally. But at a certain scale, it can actually save your business. === That’s all for now! If you enjoyed this post, don’t forget to: - Destroy the LIKE button - REPOST so that everyone can try Request Coalescing wherever applicable. - BOOKMARK for future reference - Follow me for more posts like this.

33

1K

196

1K

252K

simongui retweeted

sean

@seanmw

over 2 years ago

If you are at @labscon_io and want to talk about unpacking and obfuscation come say hi 👋

0

12

3

0

3K

simongui retweeted

irfan sharif

@irfansharifm

over 2 years ago

We’ve been working on end-to-end flow control for quorum-replicated writes + LSMs in CockroachDB; it’s all very dapper. Tried writing about it here: https://t.co/NXx1v8TNEr

0

56

6

21

7K

simongui retweeted

A. Jesse Jiryu Davis @jessejiryudavis

over 2 years ago

My review of Alibaba's PolarDB-SCC, a really promising way to query database backup nodes without risking stale data https://t.co/rSwscCRY9j

0

31

7

8

4K

Simon Guindon @simongui

over 2 years ago

With today’s release of @PostgreSQL v16 I wanted share how important the logical replication changes are to change data capture (CDC) pipelines like the ones used with @debezium. Using CDC with PG16 drastically reduces risk to your primary instances! https://t.co/T0SjPPwq8d

0

12

3

1K

Simon Guindon @simongui

almost 3 years ago

As this runs longer and longer and I add more workloads the storage engines should start to show some of their tradeoffs.

0

56

Simon Guindon @simongui

almost 3 years ago

I’m excited to share I’ve launched Streetwise - a live 24/7 benchmark processing real time stock market trades that fans out to 3 flavours of Postgres and measures each. Running @PostgreSQL and the exciting new @orioledb and @hydradatabase https://t.co/p3Jm8m41Cs

3

0

224

Simon Guindon @simongui

almost 3 years ago

You can even see the rush of activity on the markets as the markets prepare to close :)

0

68

simongui retweeted

Mim @mim_djo

almost 3 years ago

so tiktok took Clickhouse engine and used a snowflake inspired architecture to build a state of the art Cloud DWH for internal use and recently made it open source !!! this is wild :) https://t.co/Kzuv46YN5Z

mim_djo's tweet photo. so tiktok took Clickhouse engine and used a snowflake inspired architecture to build a state of the art Cloud DWH for internal use and recently made it open source !!! this is wild :)

https://t.co/Kzuv46YN5Z https://t.co/arQoXYe6OR

8

340

53

291

72K

Simon Guindon

@simongui

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users