Linjun Zhang

Federico Bianchi @federicobianchy

8 months ago

🌀New Self-Driven RL Method: RESTRAIN 🌀 📝: https://t.co/VceEJ248fW - RESTRAIN turns spurious votes → self-Improving signals. No labels needed - Does this through self-penalizing unreliable reasoning paths: ✔️ Uses all rollouts, not just the majority, ✔️ Offsets low-consistency rollout advantage, ✔️ Down-weights low-consensus prompts 📈 Results: 🔥 Beats existing techniques on both training-time (label-free) and test-time scaling — all without labels. 🔥 Nearly matches (and sometimes surpasses) gold-label RL 🧵(1/5)

jaseweston's tweet photo. 🌀New Self-Driven RL Method: RESTRAIN 🌀
📝: https://t.co/VceEJ248fW
- RESTRAIN turns spurious votes → self-Improving signals. No labels needed
- Does this through self-penalizing unreliable reasoning paths:
✔️ Uses all rollouts, not just the majority,
✔️ Offsets low-consistency rollout advantage,
✔️ Down-weights low-consensus prompts
📈 Results:
🔥 Beats existing techniques on both training-time (label-free) and test-time scaling — all without labels.
🔥 Nearly matches (and sometimes surpasses) gold-label RL
🧵(1/5)

194

149

13K

linjunz_stat retweeted

Ran Xu @ritaranx

8 months ago

🚨 Happy to share AceSearcher accepted to #NeurIPS2025 #Spotlight! 🔹 One LLM, two roles: Decomposer (split queries) + Solver (combine context) 🔹 +7.6% on QA & fact verification 🔹 32B ≈ DeepSeek-V3 on DocMath 📂 Code: https://t.co/lQU12Dm7vb 📑 arXiv: https://t.co/JI0kOh0yDk

ritaranx's tweet photo. 🚨 Happy to share AceSearcher accepted to #NeurIPS2025 #Spotlight!

🔹 One LLM, two roles: Decomposer (split queries) + Solver (combine context)
🔹 +7.6% on QA & fact verification
🔹 32B ≈ DeepSeek-V3 on DocMath
📂 Code: https://t.co/lQU12Dm7vb
📑 arXiv: https://t.co/JI0kOh0yDk https://t.co/BIfyr1tig7

linjunz_stat retweeted

10 months ago

🚀 One month left to submit to Agents4Science! 🤖 AI as primary author + reviewer 🤖 Human co-authors welcome. All submissions/reviews public for transparent study. 💡We expect AI will make mistakes - and it will be instructive to study these openly!

federicobianchy's tweet photo. 🚀 One month left to submit to Agents4Science! 🤖

AI as primary author + reviewer 🤖 Human co-authors welcome. All submissions/reviews public for transparent study.

💡We expect AI will make mistakes - and it will be instructive to study these openly! https://t.co/vNkhvYIt6O

linjunz_stat retweeted

10 months ago

🤖Introducing: CoT-Self-Instruct 🤖 📝: https://t.co/OXFALvsDgh - Builds high-quality synthetic data via reasoning CoT + quality filtering - Gains on reasoning tasks: MATH500, AMC23, AIME24 & GPQA-💎 - Outperforms existing train data s1k & OpenMathReasoning - Gains on non-reasoning tasks as well: AlpacaEval & ArenaHard 🧵1/3

jaseweston's tweet photo. 🤖Introducing: CoT-Self-Instruct 🤖
📝: https://t.co/OXFALvsDgh
- Builds high-quality synthetic data via reasoning CoT + quality filtering
- Gains on reasoning tasks: MATH500, AMC23, AIME24 & GPQA-💎
- Outperforms existing train data s1k & OpenMathReasoning
- Gains on non-reasoning tasks as well: AlpacaEval & ArenaHard
🧵1/3

376

284

24K

linjunz_stat retweeted

James Zou @james_y_zou

11 months ago

📢New conference where AI is the primary author and reviewer! https://t.co/lLjAgp7Zmp Current venues don't allow AI-written papers, so it's hard to assess the +/- of such works🤔 #Agents4Science solicits papers where AI is the main author w/ human advisors. 💡Initial reviews by LLM reviewers w/ final assessment + selection by human experts. 💡Submissions are asked to clearly document AI contribution. 💡All submissions/reviews will be public to enable transparent study of the strength and limitations of AI as researcher and reviewer. We expect AI will make mistakes and it will be instructive to study these in the open! Many thanks to the fantastic co-organizers and expert advisory board! Please see the website for more information.

james_y_zou's tweet photo. 📢New conference where AI is the primary author and reviewer! https://t.co/lLjAgp7Zmp

Current venues don't allow AI-written papers, so it's hard to assess the +/- of such works🤔 #Agents4Science solicits papers where AI is the main author w/ human advisors.

💡Initial reviews by LLM reviewers w/ final assessment + selection by human experts.
💡Submissions are asked to clearly document AI contribution.
💡All submissions/reviews will be public to enable transparent study of the strength and limitations of AI as researcher and reviewer.
We expect AI will make mistakes and it will be instructive to study these in the open!

Many thanks to the fantastic co-organizers and expert advisory board! Please see the website for more information.

503

130

207

114K

linjunz_stat retweeted

James Zou @james_y_zou

about 1 year ago

Our new #ICML2025 paper formulates #LLM hallucination as hypothesis testing to provide statistical guarantees on factuality. #FactTest is a distribution free and model agnostic approach to improve LLM accuracy. Great job @FanNie1208 Xiaotian Hou, Shuhang Lin @HuaxiuYaoML @linjunz_stat!

linjunz_stat retweeted

over 1 year ago

🚨 New Paper 🚨 An Overview of Large Language Models for Statisticians 📝: https://t.co/oklTYEAMvH - Dual perspectives on Statistics ➕ LLMs: Stat for LLM & LLM for Stat - Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more. - LLM for Stat: How LLMs can enhance statistical workflows: from data collection, synthesis, annotation to statistical modeling, with applications to medical research Presents key LLM advances: Architecture, Training, Reasoning, and Self-Alignment: (1) 🧠Evolution of LLM architectures with Transformers and Self-Attention (2) LLM training pipeline from pre-training, SFT, to RLHF and Preference Optimization. (3) 💭 System 2 Prompting and Chain-of-Thought for test-time scaling . (4) 🚀 LLM Self-Alignment for achieving super-human intelligence Statisticians play a key role in the development of large-scale AI models: (1) 💡 Statistical insights improve LLM uncertainty quantification & interpretability (2) 🤖 Watermarking for AI-generated content detection (3) ⚖️ Privacy & algorithmic fairness to ensure responsible AI adoption LLMs can also empower statistical science by: (1) 📈 Scaling up data collection, synthesis, and annotation. (2) 🖥️ Automating statistical coding & exploratory analysis (3) 🔬 Facilitating medical research By bridging statistics & AI, we can: ✅ Improve better LLMs with statistical methodologies. ✅ Leverage LLMs for statistical applications in high-stakes domains

jaseweston's tweet photo. 🚨 New Paper 🚨
An Overview of Large Language Models for Statisticians
📝: https://t.co/oklTYEAMvH

- Dual perspectives on Statistics ➕ LLMs: Stat for LLM & LLM for Stat
- Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more.
- LLM for Stat: How LLMs can enhance statistical workflows: from data collection, synthesis, annotation to statistical modeling, with applications to medical research

Presents key LLM advances: Architecture, Training, Reasoning, and Self-Alignment:
(1) 🧠Evolution of LLM architectures with Transformers and Self-Attention
(2) LLM training pipeline from pre-training, SFT, to RLHF and Preference Optimization.
(3) 💭 System 2 Prompting and Chain-of-Thought for test-time scaling .
(4) 🚀 LLM Self-Alignment for achieving super-human intelligence

Statisticians play a key role in the development of large-scale AI models:
(1) 💡 Statistical insights improve LLM uncertainty quantification & interpretability
(2) 🤖 Watermarking for AI-generated content detection
(3) ⚖️ Privacy & algorithmic fairness to ensure responsible AI adoption

LLMs can also empower statistical science by:
(1) 📈 Scaling up data collection, synthesis, and annotation.
(2) 🖥️ Automating statistical coding & exploratory analysis
(3) 🔬 Facilitating medical research

By bridging statistics & AI, we can:
✅ Improve better LLMs with statistical methodologies.
✅ Leverage LLMs for statistical applications in high-stakes domains

222

125

19K

linjunz_stat retweeted

over 1 year ago

💀 Introducing RIP: Rejecting Instruction Preferences💀 A method to *curate* high quality data, or *create* high quality synthetic data. Large performance gains across benchmarks (AlpacaEval2, Arena-Hard, WildBench). Paper 📄: https://t.co/9EKFpTsd9e

jaseweston's tweet photo. 💀 Introducing RIP: Rejecting Instruction Preferences💀

A method to *curate* high quality data, or *create* high quality synthetic data.

Large performance gains across benchmarks (AlpacaEval2, Arena-Hard, WildBench).

Paper 📄: https://t.co/9EKFpTsd9e https://t.co/cOvDJu5jiZ

448

358

72K

linjunz_stat retweeted

Jing Xu @jingxu_ml

over 1 year ago

New data selection & synthetic data creation method can dramatically improve model performance by filtering out 77% training examples!

532

over 1 year ago

Preferred teaching time: 6-9 PM. The day of the week can be flexible to fit your schedule!

223

over 1 year ago

Our department is looking for a part-time lecturer for a master course on “Database Systems for Data Science” for the Spring 2025 semester. Got interested or know someone who does? Shoot me an email at [email protected]. Feel free to share this around! #Hiring #DataScience

886

over 1 year ago

Details: •Teach once a week (in New Brunswick), 3 hours, for about 14 weeks •All teaching materials from past semesters are provided •We offer a competitive salary!

254

linjunz_stat retweeted

Weijie Su

@weijie444

almost 2 years ago

Very excited to give a short course on large language models at #JSM2024 in Portland! w/ Emily Getzen and @linjunz_stat AI for Stat and Stat for AI! @AmstatNews

weijie444's tweet photo. Very excited to give a short course on large language models at #JSM2024 in Portland! w/ Emily Getzen and @linjunz_stat AI for Stat and Stat for AI! @AmstatNews https://t.co/L5E0gPIdxi

linjunz_stat retweeted

Huaxiu Yao

@HuaxiuYaoML

about 2 years ago

📢Excited to share our approach called Calibrated Self-Rewarding Vision Language Models (CSR)🌟! With no need for labeled data, a VLM can get stronger by itself with visual constraints. Discover how CSR enhances VLMs through self-improvement with visual constraints: https://t.co/sp60C13QlG Led by @AiYiyangZ. Key Idea: 👉1. Each iteration sees the target VLM generating preference data and performing preference optimization. 👉2. Self-generated preferences are guided by reward scores, which are generated by the target VLM itself and calibrated based on image-response relevance.

HuaxiuYaoML's tweet photo. 📢Excited to share our approach called Calibrated Self-Rewarding Vision Language Models (CSR)🌟! With no need for labeled data, a VLM can get stronger by itself with visual constraints. Discover how CSR enhances VLMs through self-improvement with visual constraints:

https://t.co/sp60C13QlG

Led by @AiYiyangZ.

Key Idea:
👉1. Each iteration sees the target VLM generating preference data and performing preference optimization.

👉2. Self-generated preferences are guided by reward scores, which are generated by the target VLM itself and calibrated based on image-response relevance.

202

110

26K