New post where I discuss some of Chujun Song's benchmarking experiments (done in my lab at UMD) showing the overhead of large JDBC reads in Trino: https://t.co/lhNX2IFaYW
I wrote 2 blog posts recently on that discuss using Trino to access data stored in SQL databases and related performance issues.
How to Avoid Data Access Bottlenecks When Using Trino https://t.co/GydAeRiQQA
The dangers of the JDBC bottleneck in Trino
https://t.co/Try2f1JEnZ
Congrats to newly minted PhDs from my lab: Cuong (Ethan) Nguyen and Chujun Song (2nd and 4th from the right). Can't wait to see the incredible things you both will accomplish in your next steps ...
@andy_pavlo@pateljm@CMUDB@sriniseshan Big pick up for CMU. And now a total transformation from where CMU was a decade ago after Natassa and Chris left.
@__pragma__@muratdemirbas@PatHelland@danrkports Agreed. Serializability scalability got a bad reputation from systems that did it wrong. Well-designed systems can scale serializability to enormous scales, even for geographically distributed transactions.
The grammar police at Starburst (I won't name names) tried to nix my phrase: "It is no longer acceptable to “code, growed, and offload” a dataset" from the post I tweeted earlier today, but I got it in via a special poetry exception.
@kartick_vad Depends on the mysql storage engine. The only difference between repeatable read and serializable in theory is phantom protection, and does not cause 10X slowdown in traditional DBMSs. Both systems that bolt on serializability after-the-fact do run into performance problems.
@johnhugg@andy_pavlo Thanks John. My 4 year old promised me that she's going to teach him how to smile, but she hasn't succeeded yet. So I think we'll work on that before full sentences :)
Very proud of my student @gangliao101 who recently graduated from my lab at @umdcs and just accepted a full time Senior Researcher position. To find out where --- you will have to follow him 😀
Maryland's primary is tomorrow. Of the three main candidates running on the democratic side, was surprised to see that only @peterfranchot seems to discuss importance of investment in *all* public universities on his Website. Maybe I'm looking in the wrong place for the others?
Agree with much of this advice. Especially that you don't need to network with famous people. Interactions with peers are more likely to lead to long term relationships and many will go on to become famous over time. Further, it's a way for your work to get a grass roots impact.
On my way to SIGMOD --- my first in-person conference since COVID. Can't wait to catch up will all the people in the database community that I haven't seen in way too long ...
Found out this week that there is a wikipedia page about me that somehow I didn't know existed until now. Thank you to whoever contributed to it. https://t.co/aK19foFu48
In light of @VoltronData's 100M series A funding last week, it might be worth revisiting my 2017 and 2018 posts on Apache Arrow: https://t.co/u58sdn3R8W and https://t.co/BxareLh3qq
Nice post on isolation levels and consistency from Ben Darnell at @CockroachDB: https://t.co/J0GmY9i2fn. The basic approach of thinking about isolation and consistency separately was also discussed in my 2019 post: https://t.co/tlxXdYyD1n.