Ariel Gera @ArielGera2 - Twitter Profile

Pinned Tweet

30 days ago

Zero-shot instruction-following, nuanced classification and reasoning? ➡️ LLMs! Real-world low-latency retrieval over a large-scale corpus? ➡️ Embedding models! But what if you need BOTH at once? That's what our new paper 💡 is all about... https://t.co/WmDM1JnZA1 🧵

1

11

6

1

623

Ariel Gera @ArielGera2

about 24 hours ago

Are you using Gaussian Mixture Models? Maybe now is a good time to start... The best part about Gal's GMM kernel paper is that we demonstrate that the kernel doesn't just achieve a speedup - it opens new possibilities to do things that were previously unfeasible 🔥

Gal Bloch @gal_bloch

1 day ago

Excited to share Flash-GMM, our new work from IBM Research on scaling Gaussian Mixture Models on GPUs 🚀 Paper: https://t.co/fOWZToMcX9 Code: https://t.co/1dPYk7dKuz A simple idea inspired by FlashAttention enables GMMs to scale to billions of data points on a single GPU.

1

5

2

0

87

0

2

0

60

ArielGera2 retweeted

Sumit @_reachsumit

about 1 month ago

Task-Adaptive Embedding Refinement via Test-time LLM Guidance IBM introduces a test-time query refinement method that uses LLM feedback on top-K documents to optimize query embeddings. 📝 https://t.co/L8O9d89pGk 👨🏽‍💻 https://t.co/Ujpy5yKBKH

0

18

4

14

880

Ariel Gera @ArielGera2

30 days ago

@lateinteraction Very cool and timely work! @dianetc_ We just put out a paper that looks exactly at this gap - how to extend embedding pipelines to scenarios where only LLMs are good at judging relevance. We use an LLM as a test-time teacher to adapt the query embedding: https://t.co/Jg6CEFD6rO

ArielGera2's tweet photo. @lateinteraction Very cool and timely work! @dianetc_
We just put out a paper that looks exactly at this gap - how to extend embedding pipelines to scenarios where only LLMs are good at judging relevance.
We use an LLM as a test-time teacher to adapt the query embedding:
https://t.co/Jg6CEFD6rO https://t.co/hjafZvu5iB

Ariel Gera @ArielGera2

30 days ago

Zero-shot instruction-following, nuanced classification and reasoning? ➡️ LLMs! Real-world low-latency retrieval over a large-scale corpus? ➡️ Embedding models! But what if you need BOTH at once? That's what our new paper 💡 is all about... https://t.co/WmDM1JnZA1 🧵

1

11

6

1

623

0

1

0

95

Who to follow

Asaf Yehudai

@AsafYehudai

#NLProc researcher, CS Ph.D. student at @HebrewU (@nlphuj), and a researcher at @ibmresearch.

#AI Researcher | A jumped-up pantry boy who never knew his place

Ariel Gera @ArielGera2

30 days ago

Plus it's quite fun to watch queries as they gradually walk across embedding space based on the feedback signal 🧑‍🚀 A lot of cool future directions here: like understanding how LLM feedback rearranges the embeddings, or how to wisely select the set of documents for feedback

ArielGera2's tweet photo. Plus it's quite fun to watch queries as they gradually walk across embedding space based on the feedback signal 🧑‍🚀
A lot of cool future directions here: like understanding how LLM feedback rearranges the embeddings, or how to wisely select the set of documents for feedback https://t.co/1vLLsBGXms

0

3

0

59

Ariel Gera @ArielGera2

30 days ago

Zero-shot instruction-following, nuanced classification and reasoning? ➡️ LLMs! Real-world low-latency retrieval over a large-scale corpus? ➡️ Embedding models! But what if you need BOTH at once? That's what our new paper 💡 is all about... https://t.co/WmDM1JnZA1 🧵

1

11

6

1

623

Ariel Gera @ArielGera2

30 days ago

If you are interested in more versatile embedding models, check it out! 📜 Paper: https://t.co/WmDM1JnZA1 🔗 Code: https://t.co/RYbkWbOYkc Our team @IBMResearch: @ShirAshuryTahan @gal_bloch @OhadEytan @assaftl

1

4

0

102

Ariel Gera @ArielGera2

3 months ago

What is GQR you ask? A method that uses test-time gradient updates to boost retrieval quality at a low cost And actually multimodal hybrid retrieval is just *one example* of why this is useful (more on that soon... 😉), so I highly recommend to play with this yourselves!

Omri Uzan

@omri_uzan

3 months ago

Happy to share that GQR was accepted to ICLR! 🇧🇷 We now also have a tutorial that introduces the algorithm in a friendlier way, along with new results and analysis in the paper. Check them out! See you in Rio! https://t.co/Bv4cAVes14

2

10

1

0

4K

0

5

3

2

1K

ArielGera2 retweeted

Shir Ashury-Tahan @ShirAshuryTahan

4 months ago

LLM "robustness" is often treated like a mysterious, standalone capability. But what if it’s not? 🤔 Our new research shows robustness naturally appears when models truly understand a task - competence drives stability. More details in the thread 👇 https://t.co/EQIj7ngASB

ShirAshuryTahan's tweet photo. LLM "robustness" is often treated like a mysterious, standalone capability.
But what if it’s not? 🤔
Our new research shows robustness naturally appears when models truly understand a task - competence drives stability.

More details in the thread 👇
https://t.co/EQIj7ngASB https://t.co/x5roLKHUmT

1

12

11

4

2K

Ariel Gera @ArielGera2

8 months ago

Why I really enjoyed this project: It combines a lot: multimodality + hybrid retrieval + test-time optimization 🤯 At the same time, it is actually quite simple 💡 and helps to achieve more (retrieval quality) with less (compute resources) 🦾 plus @omri_uzan is pretty great

Omri Uzan

@omri_uzan

8 months ago

🚨 NEW PAPER 🚨 Guided Query Refinement (GQR) - a hybrid vision-text retrieval method that matches SOTA performance with 54× less memory and 14× faster inference on Visual Document Retrieval 🔗 https://t.co/DW9emLNIXl 📄 https://t.co/WqssZQ6Q9Y 🧵👇

omri_uzan's tweet photo. 🚨 NEW PAPER 🚨

Guided Query Refinement (GQR) - a hybrid vision-text retrieval method that matches SOTA performance with 54× less memory and 14× faster inference on Visual Document Retrieval
🔗 https://t.co/DW9emLNIXl
📄 https://t.co/WqssZQ6Q9Y
🧵👇 https://t.co/ZkOSoMoCOA

1

26

5

4K

0

6

1

140

ArielGera2 retweeted

Ramon Astudillo @RamonAstudill12

8 months ago

The Generative Model Alignment team at IBM Research is looking for next summer interns! Two candidates for two topics 🍰Reinforcement Learning environments for LLMs 🐎Speculative and non-auto regressive generation for LLMs interested/curious? DM / email [email protected]

0

6

4

2K

Ariel Gera

@ArielGera2

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users