The open ChatGPT competitor HuggingChat can now use tools
It is fun to explore the new possibilities. Not always as good as in the example below, but the potential is high and will only get better from now on. 😍
I just trained a 124M param LLM from scratch.
It took ~26 seconds in @GoogleColab
Training details:
• 5145 tokens in training set
• 1024 tokens in context window
• 256 tokens per batch
• 10 epochs total
The model went from generating gibberish to full sentences.
Extremely cool.