@torchcompiled Coding agents are starting to get good around large codebases. I’m inclined to think it’s the processes and engineering culture around the codebase that have to change, and verification of code is now the bottleneck
Launching our new paper on arXiv: we trained the largest multilingual food model ever built.
4.1M recipes. 7 languages. 1,790 ingredients. 300 dimensions.
All of human cooking compressed into 2 megabytes.
@cargoshortdad64 school needs to reorient around teaching students to ask questions effectively. it’s a (meta)skill that’s missing from current education
@cargoshortdad64 I approached it differently by using gumbel noise, which you can formulate as the continuous analogue to the absorbing state discrete diffusion process. An argmax operation converts it back to discrete tokens from logit space, and used the gumbel max trick to train it