One more I forgot until just reminded:
3. Khosla Ventures wanted to invest in our Series C. Vinod took me, Michelle, and Lee out to dinner after he’d given us a term sheet. Near the end, Michelle and Lee got up to use the restroom. Vinod leaned over and said: “I’m impressed with you, not so much with them, what if you fire them and I’ll give you all their stock?” I think the charitable read was it was a test of my character. But I was so offended that we never spoke again. Literally blocked his number.
> A YC startup claimed built a cheating tool in 4 days but they actually stole it from an open-source project called Cheating Daddy which is literally a clone of Cluely the $15 million a16z backed startup building… a cheating tool
Today, we release QwQ-32B, our new reasoning model with only 32 billion parameters that rivals cutting-edge reasoning model, e.g., DeepSeek-R1.
Blog: https://t.co/jpNEx0Ck8p
HF: https://t.co/h91przQmoP
ModelScope: https://t.co/p0ztmZpWIZ
Demo: https://t.co/sxVVRFwunC
Qwen Chat: https://t.co/bg4tAU1p74
This time, we investigate recipes for scaling RL and have achieved some impressive results based on our Qwen2.5-32B. We find that RL training con continuously improve the performance especially in math and coding, and we observe that the continous scaling of RL can help a medium-size model achieve competitieve performance against gigantic MoE model. Feel free to chat with our new models and provide us feedback!
@willccbb I agree for the question-answer docs, user manuals, etc.
What’s your view on numerical data from the relational DB you would usually find in a company? Does it come down generating SQL queries from text prompts?