@xyz3c7a@__tinygrad__@tenstorrent That came with a cost of constant vllm and kernel updates for supporting new LLMs. Won't make sense at low market share. Performance being too sensitive to optimization tweaks is the issue.
@AlexanderKalian@PlasmoLab Transformer was the graph architecture. It has permutation equivariance for graph nodes and positional encoding for adjacency. Works well on knowledge graphs and even ARC-AGI.
@geohotarchive Much of the confusion is not on the pie size but on the design. There could be a self-propelled robotics industry that occupies, mines, powers, builds more robots and is well defended, but no human jobs and no human-relevant work.
@andrewgwils Did electromagnetism came out of
A. more physics experimental data,
B. more computing or
C. our genes through billions of years of evolution?