Joshua Kazdan @JoshuaK92829 - Twitter Profile

3 months ago

@Piotr761303Ueh @jchudnov @RylanSchaeffer @sanmikoyejo @stai_research We start to see identification of semantically equivalent pairs around 3-4B param models. We are not sure at what point it begins to have a negative impact on training.

0

40

Joshua Kazdan @JoshuaK92829

3 months ago

@AlexanderSpangh @jchudnov yes! If you take a look at Fig 2 that's exactly what it shows. The longer you train the model, the more the gradients induced by semantically identical documents align.

0

23

JoshuaK92829 retweeted

Jessica Chudnovsky

@jchudnov

3 months ago

Your deduplication pipeline was built for small models. At scale, it's broken. New preprint: "Scale Dependent Data Duplication" 1/10

jchudnov's tweet photo. Your deduplication pipeline was built for small models. At scale, it's broken.
New preprint: "Scale Dependent Data Duplication"

1/10 https://t.co/xVWu3YKGH5

6

115

27

103

27K

JoshuaK92829 retweeted

Jessica Chudnovsky

@jchudnov

3 months ago

If you train large models, curate pretraining data, or care about whether scaling laws actually hold, this is for you. Preprint: https://t.co/rjH2yPnTdv With @JoshuaK92829, Noam Levi, @RylanSchaeffer, Abhay, Bo, Mehmet, @sanmikoyejo, and David Donoho @stai_research @StanfordAILab @stanfordnlp 10/10

3

41

12

45

8K

Joshua Kazdan @JoshuaK92829

8 months ago

Here's hoping for better luck at ICLR 2026! https://t.co/rm4izHEOnV If you want to read the paper without R7Hk's endorsement: https://t.co/NmJAOnCq72 @DjDvij also made a colab where you can try the attack out for yourself: https://t.co/7U6u9lOM5D

1

2

0

1

405

Joshua Kazdan @JoshuaK92829

8 months ago

So exuberant to announce that our paper "No, of Course I Can! Deeper Fine-Tuning Attacks That Bypass Token-Level Safety Mechanisms" has been rejected from NeurIPS 2025 with an average score of 4! 💪🔥🔥💯 @DjDvij @RylanSchaeffer @sanmikoyejo @ChrisCundy @AbhayPuri98

1

10

4

2

2K

Joshua Kazdan @JoshuaK92829

8 months ago

3. Writing the majority of your review using a language model. It did such a great job! Thanks also to the AC for ignoring us when we reported this review for violating the @NeurIPSConf guidelines against LM reviewing.

JoshuaK92829's tweet photo. 3. Writing the majority of your review using a language model. It did such a great job!

Thanks also to the AC for ignoring us when we reported this review for violating the @NeurIPSConf guidelines against LM reviewing. https://t.co/VFR5Ej6VqH

1

3

0

238

JoshuaK92829 retweeted

Rylan Schaeffer @RylanSchaeffer

11 months ago

New position paper! Machine Learning Conferences Should Establish a “Refutations and Critiques” Track Joint w/ @sanmikoyejo @JoshuaK92829 @yegordb @bremen79 @koustuvsinha @in4dmatics @JesseDodge @suchenzang @BrandoHablando @MGerstgrasser @is_h_a @ObbadElyas 1/6

RylanSchaeffer's tweet photo. New position paper! Machine Learning Conferences Should Establish a “Refutations and Critiques” Track

Joint w/ @sanmikoyejo @JoshuaK92829 @yegordb @bremen79 @koustuvsinha @in4dmatics @JesseDodge @suchenzang @BrandoHablando @MGerstgrasser @is_h_a @ObbadElyas

1/6 https://t.co/PBH1IgmL2z

12

431

50

135

94K

Joshua Kazdan @JoshuaK92829

12 months ago

@casper_hansen_ @RylanSchaeffer There's no contradiction. We don't claim that min-p is better or worse than other logit processors-- we contend only that the evidence in Minh et. al. does not meet scientific standards to claim superiority.

0

3

0

154

JoshuaK92829 retweeted

Rylan Schaeffer @RylanSchaeffer

12 months ago

A bit late to the party, but our paper on predictable inference-time / test-time scaling was accepted to #icml2025 🎉🎉🎉 TLDR: Best of N was shown to exhibit power (polynomial) law scaling (left), but maths suggest one should expect exponential scaling (center). We show how to ... 1/3

RylanSchaeffer's tweet photo. A bit late to the party, but our paper on predictable inference-time / test-time scaling was accepted to #icml2025 🎉🎉🎉

TLDR: Best of N was shown to exhibit power (polynomial) law scaling (left), but maths suggest one should expect exponential scaling (center). We show how to ...

1/3

9

116

16

62

18K

JoshuaK92829 retweeted

Rylan Schaeffer @RylanSchaeffer

12 months ago

🚨New preprint 🚨 Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims 1/8

RylanSchaeffer's tweet photo. 🚨New preprint 🚨

Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models

We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims

1/8 https://t.co/ZaT1nkJTg0

12

283

33

216

75K

JoshuaK92829 retweeted

Rylan Schaeffer @RylanSchaeffer

about 1 year ago

Interested in test time / inference scaling laws? Then check out our newest preprint!! 📉 How Do Large Language Monkeys Get Their Power (Laws)? 📉 https://t.co/AQzxg3rUZU w/ @JoshuaK92829 @sanmikoyejo @Azaliamirh @jplhughes @jordanjuravsky @sprice354_ @aengus_lynch1 @_robertkirk

RylanSchaeffer's tweet photo. Interested in test time / inference scaling laws?

Then check out our newest preprint!!

📉 How Do Large Language Monkeys Get Their Power (Laws)? 📉

https://t.co/AQzxg3rUZU

w/ @JoshuaK92829 @sanmikoyejo @Azaliamirh @jplhughes @jordanjuravsky @sprice354_ @aengus_lynch1 @_robertkirk

6

224

37

151

95K

JoshuaK92829 retweeted

Jason Weston

@jaseweston

over 1 year ago

🚨 New Paper 🚨 An Overview of Large Language Models for Statisticians 📝: https://t.co/oklTYEAMvH - Dual perspectives on Statistics ➕ LLMs: Stat for LLM & LLM for Stat - Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more. - LLM for Stat: How LLMs can enhance statistical workflows: from data collection, synthesis, annotation to statistical modeling, with applications to medical research Presents key LLM advances: Architecture, Training, Reasoning, and Self-Alignment: (1) 🧠Evolution of LLM architectures with Transformers and Self-Attention (2) LLM training pipeline from pre-training, SFT, to RLHF and Preference Optimization. (3) 💭 System 2 Prompting and Chain-of-Thought for test-time scaling . (4) 🚀 LLM Self-Alignment for achieving super-human intelligence Statisticians play a key role in the development of large-scale AI models: (1) 💡 Statistical insights improve LLM uncertainty quantification & interpretability (2) 🤖 Watermarking for AI-generated content detection (3) ⚖️ Privacy & algorithmic fairness to ensure responsible AI adoption LLMs can also empower statistical science by: (1) 📈 Scaling up data collection, synthesis, and annotation. (2) 🖥️ Automating statistical coding & exploratory analysis (3) 🔬 Facilitating medical research By bridging statistics & AI, we can: ✅ Improve better LLMs with statistical methodologies. ✅ Leverage LLMs for statistical applications in high-stakes domains

jaseweston's tweet photo. 🚨 New Paper 🚨
An Overview of Large Language Models for Statisticians
📝: https://t.co/oklTYEAMvH

- Dual perspectives on Statistics ➕ LLMs: Stat for LLM & LLM for Stat
- Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more.
- LLM for Stat: How LLMs can enhance statistical workflows: from data collection, synthesis, annotation to statistical modeling, with applications to medical research

Presents key LLM advances: Architecture, Training, Reasoning, and Self-Alignment:
(1) 🧠Evolution of LLM architectures with Transformers and Self-Attention
(2) LLM training pipeline from pre-training, SFT, to RLHF and Preference Optimization.
(3) 💭 System 2 Prompting and Chain-of-Thought for test-time scaling .
(4) 🚀 LLM Self-Alignment for achieving super-human intelligence

Statisticians play a key role in the development of large-scale AI models:
(1) 💡 Statistical insights improve LLM uncertainty quantification & interpretability
(2) 🤖 Watermarking for AI-generated content detection
(3) ⚖️ Privacy & algorithmic fairness to ensure responsible AI adoption

LLMs can also empower statistical science by:
(1) 📈 Scaling up data collection, synthesis, and annotation.
(2) 🖥️ Automating statistical coding & exploratory analysis
(3) 🔬 Facilitating medical research

By bridging statistics & AI, we can:
✅ Improve better LLMs with statistical methodologies.
✅ Leverage LLMs for statistical applications in high-stakes domains

0

222

55

125

19K

JoshuaK92829 retweeted

Krishnamurthy (Dj) Dvijotham @DjDvij

about 1 year ago

(1/n) Fine tuning APIs create significant security vulnerabilities, breaking alignment in frontier models for under $100! Introducing NOICE, a fine-tuning attack that requires just 1000 training examples to remove model safeguards. The strangest part: we use ONLY harmless data.

DjDvij's tweet photo. (1/n) Fine tuning APIs create significant security vulnerabilities, breaking alignment in frontier models for under $100!
Introducing NOICE, a fine-tuning attack that requires just 1000 training examples to remove model safeguards. The strangest part: we use ONLY harmless data. https://t.co/uR59MTg2WP

1

33

6

9

3K

Joshua Kazdan @JoshuaK92829

over 1 year ago

@arundsharma @belindmo @KyssenYu @proudmpala @sanmikoyejo @pydantic Thanks for bringing this up-- I'm surprised to hear you got such a low accuracy. We're happy to share our evals. Let's connect over email?

0

45

Joshua Kazdan

@JoshuaK92829

Last Seen Users on Sotwe

Trends for you

Most Popular Users