Nikil Selvam

3 months ago

@hbXNov @kaiwei_chang @adityagrover_ @VioletNPeng @AnthropicAI congratsss hritik! 🥳

0

1

0

99

NikilSelvam retweeted

Ph.D. student @StanfordAILab | Previous M.S. student @UCLA StarAI Lab

4 months ago

Can we build a blind, *unlinkable inference* layer where ChatGPT/Claude/Gemini can't tell which call came from which users, like a “VPN for AI inference”? Yes! Blog post below + we built it into open source infra/chat app and served >15k prompts at Stanford so far. How it helps with AI user privacy: # The AI user privacy problem If you ask AI to analyze your ChatGPT history today, it’s surprisingly easy to infer your demographics, health, immigration status, and political beliefs. Every prompt we send accumulates into an (identity-linked) profile that the AI lab controls completely and indefinitely. At a minimum this is a goldmine for ads (as we know now). A bigger issue is the concentration of power: AI labs can easily become (or asked to become) a Cambridge Analytica, whistleblow your immigration status, or work with health insurance to adjust your premium if they so choose. This is a uniquely worse problem than search engines because your average query is now more revealing (not just keywords), interactive, and intelligence is now cheap. Despite this, most of us still want these remote models; they’re just too good and convenient! (this is aka the "privacy paradox".) # Unlinkable inference as a user privacy architecture The idea of unlinkable inference is to add privacy while preserving access to the remote models controlled by someone else. A “privacy wrapper” or “VPN for AI inference”, so to speak. Concretely, it’s a blind inference middle layer that: (1) consists of decentralized proxies that anyone can operate; (2) blindly authenticates requests (via blind signatures / RFC9474,9578) so requests are provably sandboxed from each other and from user identity; (3) relays prompts over randomly chosen proxies that don’t see or log traffic (via client-side ephemeral keys or hosting in TEEs); and (4) the provider simply sees a mixed pool of anonymous prompts from the proxies. No state, pseudonyms, or linkable metadata. If you squint, an unlinkable inference layer is essentially a vendor for per-request, anonymous, ephemeral AI access credentials (for users or agents alike). It partitions your context so that user tracking is drastically harder. Obviously, unlinkability isn’t a silver bullet: the prompt itself still goes to the remote model and can leak privacy (so don't use our chat app for a therapy session!). It aims to combat *longitudinal tracking* as a major threat to user privacy, and its statistical power increases quickly by mixing more users and requests. Unlinkability can be applied at any granularity. For an AI chat app, you can unlinkably request a fresh ephemeral key for every session so tracking is virtually impossible. # The Open Anonymity Project We started this project with the belief that intelligence should be a truly public utility. Like water and electricity, providers should be compensated by usage, not who you are or what you do with it. We think unlinkable inference is a first step towards this “intelligence neutrality”. # Try it out! It’s quite practical - Chat app “oa-chat”: https://t.co/ELf8LvxFzX (<20 seconds to get going) - Blog post that should be a fun read: https://t.co/OwFmyFlZH5 - Project page: https://t.co/Swerz1xDE2 - GitHub: https://t.co/38CeKajCy2

62

833

156

792

384K

NikilSelvam retweeted

Neil Rathi

@neil_rathi

5 months ago

New paper, w/@AlecRad Models acquire a lot of capabilities during pretraining. We show that we can precisely shape what they learn simply by filtering their training data at the token level.

neil_rathi's tweet photo. New paper, w/@AlecRad

Models acquire a lot of capabilities during pretraining.

We show that we can precisely shape what they learn simply by filtering their training data at the token level. https://t.co/g0bg78mliO

26

1K

99

666

112K

Who to follow

Meihua Dang

@meihuadang

John Dang

@johnamqdang

AI Researcher, Founding Team at @adaption_ai | Prev @Cohere_Labs @Cohere | LLM Post-Training, RL, Reasoning, Multimodality, Multilinguality

Zhe Zeng

@zhezeng0908

Assist. Prof. @CS_UVA | Faculty fellow @NYU_Courant | CS Ph.D @UCLA | Neurosymbolic AI, Probabilistic ML, Constraints, AI4Science | https://t.co/pZJZxyzZ7W

NikilSelvam retweeted

CLS

@ChengleiSi

5 months ago

Can LLMs automate frontier LLM research, like pre-training and post-training? In our new paper, LLMs found post-training methods that beat GRPO (69.4% vs 48.0%), and pre-training recipes faster than nanoGPT (19.7 minutes vs 35.9 minutes). 1/

ChengleiSi's tweet photo. Can LLMs automate frontier LLM research, like pre-training and post-training?

In our new paper, LLMs found post-training methods that beat GRPO (69.4% vs 48.0%), and pre-training recipes faster than nanoGPT (19.7 minutes vs 35.9 minutes).

1/ https://t.co/k66Wr7JbY5

10

585

140

474

111K

NikilSelvam retweeted

Aryaman Arora

@aryaman2020

7 months ago

🫡 new paper neurons can be a sparse and interpretable basis for circuit tracing, once you make the right decisions about which neurons and how you circuit trace! i'm excited for how this affects future progress on circuits + automating interp

5

190

15

114

22K

10 months ago

@khoomeik no reason for this correspondence to exist, but ngl it bothers me ever so slightly that the length of the differing part in each of the graphemes doesn’t correspond to the position of the tongue 🫠

0

2

0

253

NikilSelvam retweeted

Yanzhe Zhang

@StevenyzZhang

10 months ago

Introducing Generative Interfaces - a new paradigm beyond chatbots. We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks. Adaptive and Interactive: creates the form that best adapts to your goals and needs!

4

151

40

102

60K

NikilSelvam retweeted

10 months ago

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions. Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:

kenziyuliu's tweet photo. New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions.

Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far: https://t.co/3TzD9ULEtg

15

368

75

192

67K

NikilSelvam retweeted

Yanzhe Zhang

@StevenyzZhang

11 months ago

Soon, AI agents will act for us—collaborating, negotiating, and sharing data. But can they truly protect our privacy? We simulate privacy-critical scenarios, using alternating search to evolve attacks and defenses, uncovering severe vulnerabilities and building protections.

3

82

30

39

19K

NikilSelvam retweeted

Harshit Joshi

@harshitj__

11 months ago

flying to Vienna 🇦🇹 for ACL to present Genie Worksheets (Monday 11am)! come and say hi if you want to talk about how to create controllable and reliable application layers on top of LLMs, knowledge discovery and curation, or just wanna hang

harshitj__'s tweet photo. flying to Vienna 🇦🇹 for ACL to present Genie Worksheets (Monday 11am)!

come and say hi if you want to talk about how to create controllable and reliable application layers on top of LLMs, knowledge discovery and curation, or just wanna hang

2

46

20

14

10K

NikilSelvam retweeted

Michael Ryan

@michaelryan207

about 1 year ago

New #ACL2025NLP Paper! 🎉 Curious what AI thinks about YOU? We interact with AI every day, offering all kinds of feedback, both implicit ✏️ and explicit 👍. What if we used this feedback to personalize your AI assistant to you? Introducing SynthesizeMe! An approach for creating natural language personal user models from your interactions. 🧵

7

145

39

87

47K

about 1 year ago

also working with @aryaman2020 has made me 73% more bullish on mech interp; prior not disclosed

0

3

0

201

about 1 year ago

there’s so much hiding under simple behavioral metrics 👀

Aryaman Arora

@aryaman2020

about 1 year ago

new paper! 🫡 why are state space models (SSMs) worse than Transformers at recall over their context? this is a question about the mechanisms underlying model behaviour: therefore, we propose using mechanistic evaluations to answer it!

aryaman2020's tweet photo. new paper! 🫡

why are state space models (SSMs) worse than Transformers at recall over their context? this is a question about the mechanisms underlying model behaviour: therefore, we propose using mechanistic evaluations to answer it!

12

664

88

483

81K

1

9

1

3

1K

NikilSelvam retweeted

about 1 year ago

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

kenziyuliu's tweet photo. An LLM generates an article verbatim—did it “train on” the article?

It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

13

323

86

198

92K

NikilSelvam retweeted

Yangjun Ruan

@YangjunR

over 1 year ago

New paper on synthetic pretraining! We show LMs can synthesize their own thoughts for more data-efficient pretraining, bootstrapping their capabilities on limited, task-agnostic data. We call this new paradigm “reasoning to learn”. https://t.co/yxBMwccAUd Here’s how it works🧵

YangjunR's tweet photo. New paper on synthetic pretraining!

We show LMs can synthesize their own thoughts for more data-efficient pretraining, bootstrapping their capabilities on limited, task-agnostic data. We call this new paradigm “reasoning to learn”.
https://t.co/yxBMwccAUd

Here’s how it works🧵

16

486

95

389

52K

NikilSelvam retweeted

Aryaman Arora

@aryaman2020

over 1 year ago

new paper! 🫡 we introduce 🪓AxBench, a scalable benchmark that evaluates interpretability techniques on two axes: concept detection and model steering. we find that: 🥇prompting and finetuning are still best 🥈supervised interp methods are effective 😮SAEs lag behind

aryaman2020's tweet photo. new paper! 🫡

we introduce 🪓AxBench, a scalable benchmark that evaluates interpretability techniques on two axes: concept detection and model steering.

we find that:
🥇prompting and finetuning are still best
🥈supervised interp methods are effective
😮SAEs lag behind

10

415

66

243

105K

over 1 year ago

@MatternJustus congrats!!

0

2

0

138

NikilSelvam retweeted

over 1 year ago

a big collab on unlearning led by @katherine1ee and @afedercooper!! it always helps to ask *why* and *how* a specific new technology will tangibly help in practice, or if it’s really just a solution searching for a problem. this is especially true for unlearning as of today.

0

25

2

3

3K

over 1 year ago

💬 Drop by our #NeurIPS poster today to chat more! (4/4)

0

1

0

259

over 1 year ago

To what extent can we trade additional parallel compute for lower sampling latency in diffusion models? 🤔 A lot, when you resort to multigrid methods! Presenting Self-Refining Diffusion Samplers (SRDS) to accelerate diffusion sampling through Parareal iterations! 📈 (1/n)

4

19

7

2

3K