Veronica Qing Lyu

Research scientist @allen_ai | PhD @upennnlp | Vision and Language

3 months ago

Meet KARL, an RL'd model for document-centric tasks at frontier quality and open source cost/speed. Great for @databricks customers and scientists (77-page tech report!) As usual, this isn't just one model - it's an RL assembly line to churn out models for us and our customers 🧵

jefrankle's tweet photo. Meet KARL, an RL'd model for document-centric tasks at frontier quality and open source cost/speed. Great for @databricks customers and scientists (77-page tech report!) As usual, this isn't just one model - it's an RL assembly line to churn out models for us and our customers 🧵

9

244

46

182

72K

veronica3207 retweeted

Davis Blalock

@davisblalock

3 months ago

🚀 Today we’re releasing FlashOptim: better implementations of Adam, SGD, etc, that compute the same updates but save tons of memory. You can use it right now via `pip install flashoptim`. 🚀 https://t.co/nRrLSpjnwV A bunch of cool ideas make this possible: [1/n]

davisblalock's tweet photo. 🚀 Today we’re releasing FlashOptim: better implementations of Adam, SGD, etc, that compute the same updates but save tons of memory. You can use it right now via `pip install flashoptim`. 🚀

https://t.co/nRrLSpjnwV

A bunch of cool ideas make this possible: [1/n] https://t.co/xeaMyWztpv

31

2K

227

1K

219K

Who to follow

Ming Zhong

@MingZhong_

PhD student at UIUC @dmguiuc | Research Fellow @AnthropicAI | Ex. Research Intern at @GoogleDeepmind, @AIatMeta & @MSFTResearch

4 months ago

Many thanks to @KartikSreeni, Samraj Moorjani, @sam_havens, @bemikelive, @alkispolyzotis, @matei_zaharia and other collaborators

0

2

0

142

Matei Zaharia @matei_zaharia

4 months ago

💡New blog on MemAlign: a lightweight dual-memory framework for aligning LLM judges with human feedback It delivers competitive or better quality at orders-of-magnitude lower cost & latency, enabling memory scaling—quality from experience, not more per-query compute.

4 months ago

Agent memory is a simple and powerful way to do continual learning! With the new MemAlign method from Databricks Research, we can build better LLM judges from examples of human ratings, and they scale with more data. Now in Databricks and @MLflow. https://t.co/aMbc8IZ9zb

10

235

38

182

19K

1

6

0

373

veronica3207 retweeted

Andrew Drozdov

@mrdrozdov

5 months ago

Instructed Retriever is a multi-tiered declarative approach for building high quality search agents. It's an example of an "instructed system", which goes beyond prompt tuning and tool calling by passing data among modules which work together to fulfill an information need.

1

36

12

9

6K

veronica3207 retweeted

Krista Opsahl-Ong @kristahopsalong

6 months ago

Today we’re releasing OfficeQA — a new benchmark for end-to-end grounded reasoning that reflects the real work enterprises need AI agents to do. More details below 👇

4

41

18

10

9K

6 months ago

I'll be at the Databricks booth (1619) until 8pm today. Come chat!

6 months ago

I'm missing NeurIPS BUT my extraordinary @databricks colleagues will be there: 🧱 Erich Elsen (multimodal) 🧱 @abaheti95, @gupta__abhay, @jjgort (RL at scale) 🧱 @JacobianNeuro (search) 🧱 @VeronicaLyu (feedback) Hang out with them, and you won't miss me at all 🙂

4

82

7

24

29K

0

6

1

3K

8 months ago

Job posting: https://t.co/oolNQFu6dy

0

2

1

4

1K

8 months ago

🚀 Our Databricks Mosaic Research team are looking for Research Interns for Summer 2026! Our team explores exciting challenges at the intersection of AI and data, especially in how AI agents can help enterprises reason over knowledge and automate data workflows.

6

142

16

139

11K

8 months ago

It’s a place for deep thinking, fast building, and a lot of fun! If you’re a late-stage PhD student passionate about applied AI research, please email me your CV (veronica [dot] lyu [at] databricks [dot] com) with `[Research Intern]` in the title!

0

1

0

9

919

8 months ago

We work on areas like agentic systems for knowledge QA and data engineering, learning from feedback, agent memory, intelligent document parsing …… Check out some of our latest work in our blogs: https://t.co/2YMXjoOhyb

0

2

1

0

798

veronica3207 retweeted

Michael Bendersky @bemikelive

10 months ago

Not that I have a favorite recent project, but... 🧵 LLM judges are the popular way to evaluate generative models. But they have drawbacks. They're: * Generative, so slow and expensive. * Nondeterministic. * Uncalibrated. They don't know how uncertain they are. Meet PGRM!

4

77

15

43

17K

veronica3207 retweeted

10 months ago

Since joining @databricks, our research team has been hard at work on Agent Bricks, a new product that helps enterprises develop state-of-the-art domain-specific agents. We are now releasing a research blog about Agent Learning from Human Feedback (ALHF) https://t.co/2RDs3H6mkY

2

101

20

59

10K

veronica3207 retweeted

11 months ago

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

9

223

24

95

42K

veronica3207 retweeted

Tom Zhang

@tom_jiahao

11 months ago

Introducing Muscle v0 -- infinite degrees of freedom, from @DaxoRobotics. A different mountain to climb - with a far more beautiful peak. We built this from the ground up: - Ultra-dexterous - Built for machine learning - Durable and robust More below (1/n)

34

547

89

193

273K

Matei Zaharia @matei_zaharia

12 months ago

Excited to share what we've been working on for the past few months at @DbrxMosaicAI!

12 months ago

Excited to launch Agent Bricks, a new way to build auto-optimized agents on your tasks. Agent Bricks uniquely takes a *declarative* approach to agent development: you tell us what you want, and we auto-generate evals and optimize the agent. https://t.co/EVqwq583cF

9

265

53

109

49K

0

15

0

1K

over 1 year ago

Paper: https://t.co/Vxb9sCYmz5 Full talk: https://t.co/GQDzF7edPg Collaborators: @MApidianaki , Chris Callison-Burch Presentation details: Wed 11/13 17:15 pm, Oral Session "Interpretability and Analysis of Models for NLP 2" (Brickell)

0

5

1

334

over 1 year ago

🤔What model explanation method should you use? How to ensure it reflects the model’s true reasoning? 🌟 In our CL survey, Towards Faithful Model Explanation in NLP, we review 110+ explainability methods through the lens of faithfulness. Check out my presentation at #EMNLP2024!

veronica3207's tweet photo. 🤔What model explanation method should you use? How to ensure it reflects the model’s true reasoning?

🌟 In our CL survey, Towards Faithful Model Explanation in NLP, we review 110+ explainability methods through the lens of faithfulness.

Check out my presentation at #EMNLP2024! https://t.co/KcbrmEzhqY

1

33

8

2K