The Mint @themintsv - Twitter Profile

The Mint @themintsv

3 days ago

@julien_c @huggingface 1 Exabyte = 1000 petabytes = 1 million terabytes

0

1

0

17

The Mint @themintsv

3 days ago

@hcompany_ai Hybrid with Internet access is probably the way to go. Simple tasks: local Stuck in a complicated task: Internet or cloud

0

324

The Mint @themintsv

3 days ago

@RemiFabreRobot Those antennas could be prettier @pollenrobotics

0

1

0

31

The Mint @themintsv

4 days ago

GenLIP: Generative Language-Image Pre-training https://t.co/Lp2hXldwxW https://t.co/Yu1TR7QCKd

Ross Wightman

@wightmanr

4 days ago

I had a fun coding session at the end of last week.. I implemented NaFlexGenLIP. It's an impl of GenLIP using NaFlex style image tokenization and packing instead of the full NaViT style sequence packing. Prelim CC12M sanity training on my local RTX Pro 6000s is showing some signs of life even with such a small model and dataset🥳 I was going to do this as part of a new project but w/ recent OpenCLIP refactoring it was easy enough to bolt on there initially to get something that's ready for scale experiments sooner. I threw in some utils to calculate text sequence length and batch budget params based on dataset caption dist. Also hacked together a generative 'zero-shot' image classification idea based on likelihoods.

3

17

1

5

3K

0

17

Who to follow

building @antigravity, the future of work @GoogleDeepMind. my views!

Yixin Bao

@yixinbao

Working on Bluetooth at @Apple. Previously: @CarnegieMellon, @AlibabaGroup

The Mint @themintsv

4 days ago

PolarQuant: Quantizing KV Caches with Polar Transformation (Google) https://t.co/vzzoEPKCwA

0

8

The Mint @themintsv

4 days ago

RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search https://t.co/KDRn7C9CCw

turbopuffer

@turbopuffer

4 days ago

tpuf quantizes vectors to improve perf (RaBitQ) the algo randomly rotates vectors, and we were using matmul at O(d²) space & time, brutal at high dims. 10k = 400MB in RAM! we rebuilt the rotation using FWHT at O(d) space & O(d log d) time. ~no recall loss, 10k = only 5kB in RAM

turbopuffer's tweet photo. tpuf quantizes vectors to improve perf (RaBitQ)

the algo randomly rotates vectors, and we were using matmul at O(d²) space & time, brutal at high dims. 10k = 400MB in RAM!

we rebuilt the rotation using FWHT at O(d) space & O(d log d) time. ~no recall loss, 10k = only 5kB in RAM https://t.co/DBSFgkGloL

6

148

10

49

11K

1

0

172

The Mint @themintsv

4 days ago

Beyond the TurboQuant-RaBitQ Debate: Why Vector Quantization Matters for AI Infrastructure Costs https://t.co/IS4DTVnjBj

1

0

95

The Mint @themintsv

5 days ago

@n0riskn0r3ward I agree. Based on my years of experience training and shipping embedding models, the quality of training data for embedding models is very important.

0

44

The Mint @themintsv

5 days ago

@n0riskn0r3ward They are doing it! I get a whole screen every now and then, trying to convince me to subscribe...

0

11

The Mint @themintsv

8 days ago

@julien_c @huggingface That sounds insane. Is it a real copy or lazy copy (you incur the cost during the access, or you do not copy anything, you just access it, in which case cross-region access becomes a bottleneck).

0

177

The Mint @themintsv

8 days ago

@ndea The paper: Recursive Program Synthesis Authors: Aws Albarghouthi, Sumit Gulwani, Zachary Kincaid University of Toronto, Microsoft Research https://t.co/su2XJapLiG

themintsv's tweet photo. @ndea The paper:

Recursive Program Synthesis

Authors: Aws Albarghouthi, Sumit Gulwani, Zachary Kincaid

University of Toronto, Microsoft Research

https://t.co/su2XJapLiG https://t.co/cStAaswoc8

0

1

0

41

The Mint @themintsv

9 days ago

@poteto Feels like people have been putting too much effort to develop these (tools--including cursor--, skills, harnesses, etc.). They will become obsolete soon (they will be absorbed or fixed by the AI companies).

0

83