OpenChat @openchatdev - Twitter Kullanıcısı

Sabitlenmiş Tweet

yaklaşık 2 yıl önce

🚀Introducing OpenChat 3.6 🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels 🤫GPTs are close to limits—excel at generation but fall short at complex tasks 🎯We are training next gen—capable of deterministic reasoning and planning 🔗 Explore OpenChat-3.6 (20240522 Llama 3 Version): HuggingFace: https://t.co/gcvR7caFVN Live Demo: https://t.co/0ZMWnRStup GitHub: https://t.co/rFeB7voG6V

OpenChatDev's tweet photo. 🚀Introducing OpenChat 3.6

🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels
🤫GPTs are close to limits—excel at generation but fall short at complex tasks
🎯We are training next gen—capable of deterministic reasoning and planning

🔗 Explore OpenChat-3.6 (20240522 Llama 3 Version):

HuggingFace: https://t.co/gcvR7caFVN
Live Demo: https://t.co/0ZMWnRStup
GitHub: https://t.co/rFeB7voG6V

7

289

67

128

33K

OpenChatDev retweetledi

Guan Wang

@makingAGI

11 ay önce

Will Sudoku become the MNIST for reasoning? Simple rules, clear structure, unique solutions—yet surprisingly challenging for modern LLMs, often requiring explicit trial-and-error to solve. https://t.co/BmqUaW8Qno

4

68

9

21

8K

OpenChatDev retweetledi

Guan Wang

@makingAGI

11 ay önce

🚀Introducing Hierarchical Reasoning Model🧠🤖 Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT! Unlock next AI breakthrough with neuroscience. 🌟 📄Paper: https://t.co/Sxprojsv0c 💻Code: https://t.co/k15cUS2wlf

makingAGI's tweet photo. 🚀Introducing Hierarchical Reasoning Model🧠🤖

Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT!

Unlock next AI breakthrough with neuroscience. 🌟

📄Paper: https://t.co/Sxprojsv0c
💻Code: https://t.co/k15cUS2wlf

227

4K

625

3K

1M

OpenChat @OpenChatDev

1 yıldan fazla bir süre önce

Thrilled to see RSP featured at AAAI'25! This pioneering concept was a key inspiration for developing OpenChat! 🚀 #AI #AAAI25

Haoyi Niu✈️CVPR @t641769919

1 yıldan fazla bir süre önce

🚨Recursive Skip-Step Planning (RSP) Relying on larger, expressive models for sequential decision-making has recently become a popular choice, but are they truly necessary? Can we replace these heavy models? Yes—RSP empowers shallow MLPs to excel in long-horizon tasks!🧵(1/n)

t641769919's tweet photo. 🚨Recursive Skip-Step Planning (RSP)
Relying on larger, expressive models for sequential decision-making has recently become a popular choice, but are they truly necessary? Can we replace these heavy models? Yes—RSP empowers shallow MLPs to excel in long-horizon tasks!🧵(1/n) https://t.co/0wzALRFEim

1

11

3

2

3K

1

6

0

3

787

OpenChatDev retweetledi

Alignment Lab AI

@alignment_lab

neredeyse 2 yıl önce

skronge bones in that one 🔍 excellent job! a 7b model out cracking gpt4 turbo and gpt4o and claude 3 sonnet!

0

8

3

6

2K

OpenChat @OpenChatDev

yaklaşık 2 yıl önce

4)This is why I am embarking on a journey to explore new frontiers in AI, specifically targeting the current limitations of GPTs in Planning and Reasoning.

2

22

0

1

2K

OpenChat @OpenChatDev

yaklaşık 2 yıl önce

🚀Introducing OpenChat 3.6 🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels 🤫GPTs are close to limits—excel at generation but fall short at complex tasks 🎯We are training next gen—capable of deterministic reasoning and planning 🔗 Explore OpenChat-3.6 (20240522 Llama 3 Version): HuggingFace: https://t.co/gcvR7caFVN Live Demo: https://t.co/0ZMWnRStup GitHub: https://t.co/rFeB7voG6V

7

289

67

128

33K

OpenChat @OpenChatDev

yaklaşık 2 yıl önce

3) However, while training these new models, I can't help but realize the upper limit of what autoregressive models can do. They struggle to solve complex tasks such as software engineering, advanced mathematics, and creating super assistants. It is mathematically challenging for GPT models to efficiently and effectively decompose and plan for the multistep, deterministic actions necessary for AGI.

2

11

0

2K

OpenChatDev retweetledi

OpenChat @OpenChatDev

yaklaşık 2 yıl önce

🚀Introducing OpenChat 3.6 🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels 🤫GPTs are close to limits—excel at generation but fall short at complex tasks 🎯We are training next gen—capable of deterministic reasoning and planning 🔗 Explore OpenChat-3.6 (20240522 Llama 3 Version): HuggingFace: https://t.co/gcvR7caFVN Live Demo: https://t.co/0ZMWnRStup GitHub: https://t.co/rFeB7voG6V

7

289

67

128

33K

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

@burkov It's an experiment. To see if 6T tokens are the secret 🤣

2

3

0

1

332

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

🚀 The World's First Gemma fine-tune based on openchat-3.5-0106 data and method (C-RLFT). Almost the same performance as the Mistral-based version. 6T tokens = secret recipe? HuggingFace: https://t.co/X5WIZyxAlr

11

177

29

72

27K

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

@ramirosalas Thank you! We're tuning MoE hyperparams and getting a GPU cluster to train 70b 🤣

1

7

0

1

306

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

@ZahirHamroune 8192 (same as Gemma base)

0

1

0

379

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

@_philschmid @teknium @Mascobot Just replaced the Gemma's tokenizer with the instruction-tuned version's. Feel free to use it https://t.co/wUNjn86t10

0

3

0

1

599

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

@o_b @LMStudioAI Thank you! Our Mixtral-based model is ongoing ⚙️

1

5

0

557

OpenChat @OpenChatDev

2 yıldan fazla bir süre önce

🚀Announcing OpenChat-3.5 Update 0106: 𝗪𝗼𝗿𝗹𝗱’𝘀 𝗕𝗲��𝘁 𝗢𝗽𝗲𝗻 𝗦𝗼𝘂𝗿𝗰𝗲 𝟳𝗕 𝗟𝗟𝗠! Experience ChatGPT & Grok-level AI locally 💿! Surpassing Grok-0 (33B) across all 4 benchmarks and Grok-1 (???B) on average and 3/4 benchmarks 🔥. 🎯 This update mainly enhanced training methodology, in-context learning & coding skills, outperforming the last 1210 release on 7 out of 8 benchmarks! 🌐 The model is available on our live demo, HuggingFace, and GitHub: HuggingFace: https://t.co/Y6XQbzB2cE Live Demo: https://t.co/0ZMWnRStup GitHub: https://t.co/rFeB7voG6V 🛠️ To deploy it yourself, visit our GitHub (https://t.co/rFeB7voG6V) for full instructions to serve OpenChat models with an accelerated vLLM backend, API key authentication, and more!