@rasbt Pretty satisfying to see everyone taking part in the open-source push especially Qwen. Currently playing around with 9B on my mac for reflection over toy instrtuction dataset.
I wish Meta was still there and kept focussing on mid-size open models instead of big-size leap.
@bryan_johnson An AI engineer here. Eating meet or not is a subjective truth. However, don’t go out claiming “very soon” AI will be dominant.
Firstly, we are not even remotely close. Second, we lack the most important ingredient at the moment i.e. EQ.
@fchollet Finding new algorthm is not LLMs job. That discipline is RL!
We always improve the data and detail level for our models to learn better. Explicit is better than implicit. Sure LMs learn pattern but they won’t if we feed them garbage.
@fchollet I think you are missing the point. Your model is as good as the data.
In the past we used raw text and images. Then we started preprocessing the data i.e. making data good. Then came the planning and reasoning, and now we are improving data for those.
@karpathy New chats for quick work and research. Projects for mid to long-term projects. Use hand-over prompt each time the conversation gets long.
I think works well for many cases.
@davidtsolheim@AndrewYNg Coding is not just typing syntax, or typing syntax with the help of AI. Coding is just a problem solving tool. Every engineer needs to be a good problem solver.