AI-Insight

over 1 year ago

🚀Top 20 Likes of Hugging Face Daily Paper @_akhaliq @huggingface 🚀Congratulattion https://t.co/oBMDtE8CgT 1、The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits 2、Qwen2.5 Technical Report 3、MiniMax-01: Scaling Foundation Models with Lightning Attention 4、LLM in a flash: Efficient Large Language Model Inference with Limited Memory 5、Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone 6、Llama 2: Open Foundation and Fine-Tuned Chat Models 7、rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking 8、CLEAR: Character Unlearning in Textual and Visual Modalities 9、EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions 10、GAIA: a benchmark for General AI Assistants 11、GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection 12、DocLLM: A layout-aware generative language model for multimodal document understanding 13、3D Gaussian Splatting for Real-Time Radiance Field Rendering 14、Retentive Network: A Successor to Transformer for Large Language Models 15、Differential Transformer 16、Qwen2 Technical Report 17、Mixtral of Experts 18、Transformer Explainer: Interactive Learning of Text-Generative Models 19、Your Transformer is Secretly Linear 20、Self-Rewarding Language Models

AI_Insight_Talk's tweet photo. 🚀Top 20 Likes of Hugging Face Daily Paper @_akhaliq @huggingface
🚀Congratulattion

https://t.co/oBMDtE8CgT

1、The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

2、Qwen2.5 Technical Report

3、MiniMax-01: Scaling Foundation Models with Lightning Attention

4、LLM in a flash: Efficient Large Language Model Inference with Limited Memory

5、Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

6、Llama 2: Open Foundation and Fine-Tuned Chat Models

7、rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

8、CLEAR: Character Unlearning in Textual and Visual Modalities

9、EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

10、GAIA: a benchmark for General AI Assistants

11、GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

12、DocLLM: A layout-aware generative language model for multimodal document understanding

13、3D Gaussian Splatting for Real-Time Radiance Field Rendering

14、Retentive Network: A Successor to Transformer for Large Language Models

15、Differential Transformer

16、Qwen2 Technical Report

17、Mixtral of Experts

18、Transformer Explainer: Interactive Learning of Text-Generative Models

19、Your Transformer is Secretly Linear

20、Self-Rewarding Language Models

AI_Insight_Talk retweeted

7 months ago

Hugging Face Weekly Paper Trends @_akhaliq (Gen by nana-banana-pro)

12K

AI_Insight_Talk retweeted

alphaXiv

@askalphaxiv

7 months ago

We just raised a $7M Seed round co-led by @MenloVentures and @haystackvc with participation from @Shakti_VC, @conviction and @upfrontvc 🚀 We're honored to have the support of incredible angels including @ericschmidt, @SebastianThrun, @sarahookr Join us: https://t.co/IKwK8KsG96

askalphaxiv's tweet photo. We just raised a $7M Seed round co-led by @MenloVentures and @haystackvc with participation from @Shakti_VC, @conviction and @upfrontvc 🚀

We're honored to have the support of incredible angels including @ericschmidt, @SebastianThrun, @sarahookr

Join us: https://t.co/IKwK8KsG96 https://t.co/tzOpr7TcAX

633

221

174K

8 months ago

NewtonBench: BENCHMARKING GENERALIZABLE SCIENTIFIC LAW DISCOVERY IN LLM AGENTS https://t.co/loNLizypPj

8 months ago

Found an interesting next model architecture exploration work from Shanghai AI Lab: SDAR, a new paradigm that converts trained AR models into blockwise diffusion models for FAST parallel decoding! ✅ AR's training efficiency ✅ Diffusion's inference speed The 30B MoE model even beats pure AR baselines on GPQA and ChemBench. HF Papers: https://t.co/YwfrWqsaSb Model（1.7B/4B/8B/30B-A3B）：https://t.co/ptU3ttWXlL

AI_Insight_Talk's tweet photo. Found an interesting next model architecture exploration work from Shanghai AI Lab: SDAR, a new paradigm that converts trained AR models into blockwise diffusion models for FAST parallel decoding!

✅ AR's training efficiency
✅ Diffusion's inference speed

The 30B MoE model even beats pure AR baselines on GPQA and ChemBench.

HF Papers: https://t.co/YwfrWqsaSb
Model（1.7B/4B/8B/30B-A3B）：https://t.co/ptU3ttWXlL

11 months ago

@Alibaba_Qwen @OpenAI When will OpenAI release an open-source large language model (LLM)?

352

AI_Insight_Talk retweeted

11 months ago

Amazing !!!! I try the anycoder to generate tech-style poster @_akhaliq https://t.co/TshVOxmRoU

12 months ago

HF Papers: Code Bench Live https://t.co/SsjkaAq2d0

AI_Insight_Talk retweeted

over 1 year ago

🔥 VLMEvalKit Support Gemma 3 🚀 https://t.co/oqDHWUoB6y https://t.co/tqvMIeeASM @OpenCompassX @GoogleAI

177

AI_Insight_Talk retweeted

over 1 year ago

🔥 BREAKTHROUGH ALERT! OpenCompass @OpenCompassX v0.4.1 is now LIVE 🚀 Our latest release brings new Omni-Math support, OlympiadBench evaluation framework, and the challenging HLE dataset! Enhanced math verification, dataset repetition, and G-Pass computation. See how we're pushing the boundaries of AI evaluation! #AIEvaluation #OpenCompass #TechInnovation https://t.co/muuN0ChMU1

vansinhu's tweet photo. 🔥 BREAKTHROUGH ALERT! OpenCompass @OpenCompassX v0.4.1 is now LIVE 🚀
Our latest release brings new Omni-Math support, OlympiadBench evaluation framework, and the challenging HLE dataset! Enhanced math verification, dataset repetition, and G-Pass computation. See how we're pushing the boundaries of AI evaluation! #AIEvaluation #OpenCompass #TechInnovation

https://t.co/muuN0ChMU1

162

AI_Insight_Talk retweeted

over 1 year ago

🚀 Just discovered #WritingBench - a game-changer for evaluating LLMs' writing capabilities! 🔥 Key highlights: • 1,239 queries across 6 domains & 100 subdomains • Dynamic criteria generation with 83% human alignment • Enables 7B models to reach SOTA performance The paper identifies that CoT prompting significantly improves creative writing tasks - something we should all implement! Their domain categorization is incredibly thorough. Check out the repo: https://t.co/gWIbHwHGSB #AI #NLP #LLM #Research

vansinhu's tweet photo. 🚀 Just discovered #WritingBench - a game-changer for evaluating LLMs' writing capabilities!

🔥 Key highlights:
• 1,239 queries across 6 domains & 100 subdomains
• Dynamic criteria generation with 83% human alignment
• Enables 7B models to reach SOTA performance

The paper identifies that CoT prompting significantly improves creative writing tasks - something we should all implement! Their domain categorization is incredibly thorough.

Check out the repo: https://t.co/gWIbHwHGSB

#AI #NLP #LLM #Research

165

AI_Insight_Talk retweeted

over 1 year ago

🔥MedAgentsBench： Amazing Work🚀 Just explored #MedAgentBench from @Yale researchers and it's mind-blowing! They've created a cutting-edge benchmark that finally exposes the true capabilities of LLMs in complex medical reasoning. ⚡ Key discoveries: DeepSeek R1 & OpenAI O3 dominate clinical reasoning tasks Agent-based frameworks deliver exceptional performance-cost balance Open-source alternatives are closing the gap at fraction of the cost This work shatters previous benchmarks that failed to challenge today's advanced models. The future of medical AI is here: https://t.co/GR0zTsBu8V #MedicalAI #MachineLearning #AIinHealthcare 🔥

$vansinhu's tweet photo. 🔥MedAgentsBench： Amazing Work🚀 Just explored #MedAgentBench from @Yale researchers and it's mind-blowing! They've created a cutting-edge benchmark that finally exposes the true capabilities of LLMs in complex medical reasoning. ⚡ Key discoveries: DeepSeek R1 & OpenAI O3 dominate clinical reasoning tasks Agent-based frameworks deliver exceptional performance-cost balance Open-source alternatives are closing the gap at fraction of the cost This work shatters previous benchmarks that failed to challenge today's advanced models. The future of medical AI is here: https://t.co/GR0zTsBu8V #MedicalAI #MachineLearning #AIinHealthcare 🔥$

441

over 1 year ago

AI_Insight_Talk's tweet photo. https://t.co/ayRxXvWuFg