George Halal @halal_george - Twitter Profile

Pinned Tweet

10 months ago

Excited to share that we trained rerankers at the cost/performance frontier and are open sourcing them! Contextual AI Reranker v2 🚀 Best performing, most efficient reranker 🤗 Open weights (1B, 2B, 6B) 🫡 Instruction-following (including recency-awareness) 🌐 Multilingual 1/4

halal_george's tweet photo. Excited to share that we trained rerankers at the cost/performance frontier and are open sourcing them!

Contextual AI Reranker v2
🚀 Best performing, most efficient reranker
🤗 Open weights (1B, 2B, 6B)
🫡 Instruction-following (including recency-awareness)
🌐 Multilingual

1/4 https://t.co/8sAee1SKWw

6

168

26

124

31K

George Halal @halal_george

7 months ago

Stop building heuristic-based graphs. Start building adaptable, agentic tools. We break down the full system design in this blog post: https://t.co/ufRPQO8HH8 🧵 4/4

0

1

105

George Halal @halal_george

7 months ago

An agentic alternative to GraphRAG. We built a Metadata Search Tool to solve reference traversal without the rigid complexity of static graphs. The result? Agents resolve complex queries in fewer steps with higher accuracy. 🧵 1/4

halal_george's tweet photo. An agentic alternative to GraphRAG.

We built a Metadata Search Tool to solve reference traversal without the rigid complexity of static graphs.

The result? Agents resolve complex queries in fewer steps with higher accuracy.

🧵 1/4 https://t.co/RX8ijiV7aq

1

12

6

7

2K

George Halal @halal_george

7 months ago

Our solution: shift the traversal logic to the agent. Extract “metadata”/“aliases” for each chunk at indexing time. During retrieval, the agent dynamically chooses: Search raw text content OR search metadata to hop to a reference? 🧵 3/4

halal_george's tweet photo. Our solution: shift the traversal logic to the agent.

Extract “metadata”/“aliases” for each chunk at indexing time.

During retrieval, the agent dynamically chooses:
Search raw text content OR search metadata to hop to a reference?

🧵 3/4 https://t.co/lZ3xHSGWCu

2

0

129

Who to follow

Tobias Marriage

@TobiasMarriage

Citizen. Partner. Friend. Son. Sibling. Neighbor. Physicist. Opinions are my own.

Praween with a very long last name 

@psiritanasak

Colin Hill

@jcolinhill

Assistant professor, physics, Columbia. Opinions are my own. (Cover photo: D. Kellner)

George Halal @halal_george

8 months ago

This flexibility is the superpower. You control what to extract: section hierarchies, list of claims, questions the doc answers—whatever fits your use case. GraphRAG locks you into a static workflow. Metadata search adapts to yours. Thanks Jackie Zhang and @sheshanshag for their help on this project! This tool will be available on our platform soon, but contact @ContextualAI for early access.

0

181

George Halal @halal_george

8 months ago

Your metadata IS your graph. Giving our agents access to a metadata search tool boosted our evals by 11%, providing the flexibility of GraphRAG while avoiding all the complexity. It unlocks new capabilities, including reference traversal. Example: 1. Agent finds a doc with references. 2. Agent decides which references to traverse and searches over metadata to fetch them.

halal_george's tweet photo. Your metadata IS your graph.

Giving our agents access to a metadata search tool boosted our evals by 11%, providing the flexibility of GraphRAG while avoiding all the complexity.

It unlocks new capabilities, including reference traversal. Example:
1. Agent finds a doc with references.
2. Agent decides which references to traverse and searches over metadata to fetch them.

3

17

4

17

3K

George Halal @halal_george

8 months ago

Like GraphRAG, we extract structured info from docs at ingestion—each entry becoming a searchable node in the embedding space. Unlike GraphRAG, we skip the heuristic-based graph building and navigation methods, which are often specialized to various domains and show diminishing returns in our ablations. This keeps things fast and adaptable. Adding new docs or changing your metadata schema is trivial.

1

0

1

182

George Halal @halal_george

9 months ago

Woohoo! We made sure not to overfit to benchmarks and focused on its generalization capabilities, so glad to hear that worked :)

search founder @n0riskn0r3ward

10 months ago

.@ContextualAI 's new re-ranker ($0.05 per M tokens) is a bit better than voyage re-rank 2.5 (also $0.05 per M tokens) which is a pretty high bar IMO. ~2% better recall @ 10 in my eval. I'm also not exactly doing standard QA RAG either, so likely a bit out of domain for both.

1

19

3

1

2K

1

9

0

1

479

George Halal @halal_george

10 months ago

@ethan_kim00 and that's why all other rerankers perform poorly on the recency benchmark. Ours was specifically trained to rank retrieved documents as: more_relevant_more_recent_doc > more_relevant_less_recent_doc > less_relevant_more_recent_doc > less_relevant_less_recent_doc

0

81

George Halal @halal_george

10 months ago

Excited to share that we trained rerankers at the cost/performance frontier and are open sourcing them! Contextual AI Reranker v2 🚀 Best performing, most efficient reranker 🤗 Open weights (1B, 2B, 6B) 🫡 Instruction-following (including recency-awareness) 🌐 Multilingual 1/4

6

168

26

124

31K

halal_george retweeted

Michael

@michael_chomsky

10 months ago

Instruction following rerankers are so underrated. You can set arbitrary instructions like ‘sort by candidates that are a good fit for this role’ or ‘article mentions an early stage company’. This is the kind of thing I was hypothesizing years ago, and it’s cool to see the space catch up to theory. The next step will be small models that do binary classification based on a set of arbitrary criteria.

0

16

3

4

2K

George Halal @halal_george

10 months ago

@lgandecki "compared to the 2nd-best rerankers which are up to ~10x more expensive!” I’m now realizing that the line break makes it seem like it’s not part of the sentence above it

1

0

25

halal_george retweeted

Douwe Kiela

@douwekiela

10 months ago

We just released the latest version of our reranker: best performing, most efficient, open weights, instruction following, and multilingual. Try it out in your agentic RAG pipelines!

6

43

4

12

6K

halal_george retweeted

Sheshansh Agrawal

@sheshanshag

10 months ago

Performance on standard retrieval benchmarks like BEIR/ MMTEB hasn't correlated with performance on real world retrieval evaluation datasets for a while now. The causes are twofold: - Relevance is ill-defined and subjective. - Popular retrieval benchmarks are gameable. Here I describe how we tackled these challenges while building our second generation of rerankers. 1/N

sheshanshag's tweet photo. Performance on standard retrieval benchmarks like BEIR/ MMTEB hasn't correlated with performance on real world retrieval evaluation datasets for a while now.

The causes are twofold:
- Relevance is ill-defined and subjective.
- Popular retrieval benchmarks are gameable.

Here I describe how we tackled these challenges while building our second generation of rerankers.

1/N

1

20

6

3K

George Halal @halal_george

10 months ago

Check out our blogpost for more details: https://t.co/2bfN2qxHTi Tagging folks who might find this interesting: @bclavie, @jobergum, @mariaKhalusova, @virattt, @pavelsvitek_, @jerryjliu0, @dzhng, @dani_avila7, @daniel_mac8, @helloiamleonie, @ShengyaoZhuang, @nlpnyc, @swyx, @wowitsmrinal, @spacemanidol, @NateSesti, @n0riskn0r3ward, @michael_chomsky, @mrdbourke, @pelaseyed, @IntuitMachine, @rohanpaul_ai, @tom_doerr, @OptimiseOrDie, @johnjnay, @_akhaliq, @hwchase17, @LuizaJarovsky, @daansan_ml, @daansan_ml, @omarsar0, @_reachsumit, @Aurimas_Gr, @RichardSocher 4/4

0

8

0

2

768

George Halal @halal_george

10 months ago

You can access our rerankers now on 🤗 HuggingFace: https://t.co/kLdsRkkiCl 🌳 Google Model Garden: Available soon. 🟠 API endpoint: The first $50 (1 billion tokens) are free with a business email. Documentation: https://t.co/C0ewJV5apR 👩‍💻 Python SDK (code snippet attached) 3/4

halal_george's tweet photo. You can access our rerankers now on
🤗 HuggingFace: https://t.co/kLdsRkkiCl
🌳 Google Model Garden: Available soon.
🟠 API endpoint: The first $50 (1 billion tokens) are free with a business email. Documentation: https://t.co/C0ewJV5apR
👩‍💻 Python SDK (code snippet attached)

3/4 https://t.co/s8AzB6Mb05

1

6

0

3

1K

halal_george retweeted

Contextual AI

@ContextualAI

10 months ago

🏆 It's official - Contextual AI is now at the top of the FACTS leaderboard for groundedness, beating out strong competition from Gemini 2.5 Pro and GPT-5! Congrats to our research team @w33lliam @rajan__vivek @nandita__naik @Thienhn97 @sheshanshag @shikibmehri on this awesome achievement!

ContextualAI's tweet photo. 🏆 It's official - Contextual AI is now at the top of the FACTS leaderboard for groundedness, beating out strong competition from Gemini 2.5 Pro and GPT-5!

Congrats to our research team @w33lliam @rajan__vivek @nandita__naik @Thienhn97 @sheshanshag @shikibmehri on this awesome achievement!

0

18

6

5

6K

George Halal @halal_george

11 months ago

Another great use case for the instruction-following reranker we trained

Nina Lopatina

@NinaLopatina

11 months ago

We had an interesting meta-learning at @aiDotEngineer World’s Fair from some of the organizers of the MCP track: there has been such an explosion in MCP Server creation, that one of the emerging challenges in this space is selecting the right one for your task.

NinaLopatina's tweet photo. We had an interesting meta-learning at @aiDotEngineer World’s Fair from some of the organizers of the MCP track: there has been such an explosion in MCP Server creation, that one of the emerging challenges in this space is selecting the right one for your task. https://t.co/8cvn23kaUP

1

6

3

2

2K

1

4

1

0

157

halal_george retweeted

William Berrios

@w33lliam

11 months ago

📢 As promised ✨, we're open-sourcing LMUnit! Our SoTA generative model for fine-grained criteria evaluation of your LLM responses 🎯 ✅ SoTA on Flask & BigGbench ✅ SoTA generative reward model on RewardBench2 🤗 Models available on @huggingface: https://t.co/rHe2Xl3wHH 💻 Github repo: https://t.co/Q7vVMG8EWH 📄 Paper: https://t.co/nonydlCszX ✍️ Blog: https://t.co/epyyUyp6hd See more details in the quoted tweet👇

1

34

14

12

7K

George Halal

@halal_george

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users