Siddhant Pagariya @sidpagariya - Twitter Profile

about 13 hours ago

every company wants to be #1 in their own benchmark worked with @micro1_ai to have an independently validated benchmark huge s/o @ArthBohra @donaldwu_ and the rest of the team in making this happen

micro1

@micro1_ai

about 14 hours ago

Today we're publishing LongExtractBench, a benchmark commissioned by @reductoai and independently validated by micro1. We evaluated seven production document extraction systems across the same 225 complex enterprise documents. The benchmark was intentionally difficult: documents averaged 358 pages and contained roughly 88,700 ground-truth fields each. Every system was evaluated using the configuration documented in the benchmark methodology. Key findings: • Reducto Deep Extract was the only system to successfully complete all 225 documents. • Direct frontier LLM baselines achieved substantially lower completion rates on long, complex documents. • In this benchmark, dedicated extraction platforms achieved higher completion rates than the direct frontier LLM baselines. • Recall was the clearest differentiator. Precision remained high across systems, but recall ranged from 33.8% to 99.6%, highlighting which systems consistently captured the information contained in long, complex documents. The full report includes the benchmark methodology, limitations, and reproducibility resources. Check out the report and results in the comments below.

micro1_ai's tweet photo. Today we're publishing LongExtractBench, a benchmark commissioned by @reductoai and independently validated by micro1.

We evaluated seven production document extraction systems across the same 225 complex enterprise documents. The benchmark was intentionally difficult: documents averaged 358 pages and contained roughly 88,700 ground-truth fields each. Every system was evaluated using the configuration documented in the benchmark methodology.

Key findings:
• Reducto Deep Extract was the only system to successfully complete all 225 documents.
• Direct frontier LLM baselines achieved substantially lower completion rates on long, complex documents.
• In this benchmark, dedicated extraction platforms achieved higher completion rates than the direct frontier LLM baselines.
• Recall was the clearest differentiator. Precision remained high across systems, but recall ranged from 33.8% to 99.6%, highlighting which systems consistently captured the information contained in long, complex documents.

The full report includes the benchmark methodology, limitations, and reproducibility resources. Check out the report and results in the comments below.

21

101

29

18

14K

1

9

3

0

265

sidpagariya retweeted

Adit

@aditabrm

about 13 hours ago

Many companies are #1 in a benchmark they crafted. We worked with @micro1 to create an independently audited benchmark to measure document extraction performance with long documents. The results of LongExtractBench show the nuances companies are likely to find in the real world. micro1 tested frontier models with max reasoning and document processing platforms with their strongest configurations, and found notable precision/recall and completion tradeoffs across most. Reducto’s Deep Extract leads the industry by a wide margin. 🧵

aditabrm's tweet photo. Many companies are #1 in a benchmark they crafted.

We worked with @micro1 to create an independently audited benchmark to measure document extraction performance with long documents.

The results of LongExtractBench show the nuances companies are likely to find in the real world. micro1 tested frontier models with max reasoning and document processing platforms with their strongest configurations, and found notable precision/recall and completion tradeoffs across most.

Reducto’s Deep Extract leads the industry by a wide margin. 🧵

11

103

20

23

31K

sidpagariya retweeted

Reducto

@reductoai

4 days ago

We have an exciting lineup of events for the upcoming @aiDotEngineer conference! Along with some cool giveaways for everyone who finds us 👀 📅 Monday, June 29th: Workshop on how Reducto parsed the Epstein files for the viral @jmailarchive 📍Room 2024 ⏰1:15- 2:15 PM 📅 Monday, June 29th: Fireside Chat with @mintlify & @cognition 🔗 Sign up: https://t.co/XdT3E5QgwS 📅Tuesday, June 30th: Talk by our CEO @aditabrm 📍Room 2006 ⏰ 1:30- 1:50 PM 📅 Tuesday, June 30th: Talk by @abhiarya on building for Agent Experience 📍Expo Stage 2 NW ⏰3:45 - 4:05 PM 📅Tuesday, June 30th: All Day World Cup Viewing Lounge with @baseten & @LangChain 🔗 Sign up: https://t.co/iPwDm50vBB

0

14

5

1

4K

Siddhant Pagariya

@sidpagariya

6 days ago

it was great to see new faces and talk a bit about the work we do @reductoai and what’s coming up :)

vibha

@vibhayellamraju

6 days ago

loved hearing what everyone was building ! if docs are core to your stack, we’ve got a pretty sweet startup program to help you get started with @reductoai 🤝

vibhayellamraju's tweet photo. loved hearing what everyone was building !

if docs are core to your stack, we’ve got a pretty sweet startup program to help you get started with @reductoai 🤝 https://t.co/Wvf0FacIwq

3

47

2

6

6K

0

3

0

83

Who to follow

Allan Brooks

@Afrochemist

Hi Im a CIRT Analyst that always forget that I have a twitter account :)

Building for the built world | prev @ Lumos, Warp, Relay

sidpagariya retweeted

Adit

@aditabrm

14 days ago

You don't really need Fable. Opus with better inputs outperforms Fable on Surge's GDP.pdf benchmark. It also leads to fewer reasoning tokens, lower latency, and better cost at scale.

3

50

7

13

9K

sidpagariya retweeted

vibha

@vibhayellamraju

22 days ago

we’re dropping off a few reducto swag boxes to founders building cool things. including our most loved shirts! tell us what you’re building and we’ll see you on thursday.

vibhayellamraju's tweet photo. we’re dropping off a few reducto swag boxes to founders building cool things.

including our most loved shirts!

tell us what you’re building and we’ll see you on thursday. https://t.co/crhjX9ES8Z

24

74

8

21

25K

sidpagariya retweeted

Donald

@donaldwu_

25 days ago

CTO started looking over my shoulder when I was coding

0

42

4

2

2K

sidpagariya retweeted

Reducto

@reductoai

27 days ago

It's Dev Day at Snowflake - we're hosting an FDE Happy Hour with @modal @Snowflake to celebrate! 🍻 3-5PM | Kona's Bar near Moscone Make sure to RSVP - space is limited: https://t.co/upN8yrkjir

reductoai's tweet photo. It's Dev Day at Snowflake - we're hosting an FDE Happy Hour with @modal @Snowflake to celebrate! 🍻

3-5PM | Kona's Bar near Moscone

Make sure to RSVP - space is limited: https://t.co/upN8yrkjir https://t.co/3HMhs7oxY9

0

10

1

4K

Siddhant Pagariya

@sidpagariya

26 days ago

@palak_agarwal6 @reductoai @vibhayellamraju @theroyalpalis welcome to the team Palak!!

0

4

0

266

sidpagariya retweeted

Karan Brar

@deepmatmul

27 days ago

https://t.co/Hv1ua6nKGX

3

31

9

15

11K

Siddhant Pagariya

@sidpagariya

about 1 month ago

@raunakdoesdev do u think we can print a whole dispenser too? 🫢

0

96

Siddhant Pagariya

@sidpagariya

about 1 month ago

@adelwu_ crazy we’re also strawberry picking! which farm r u at?

0

2

1

251

Siddhant Pagariya

@sidpagariya

about 1 month ago

@lin_annabeth @joshnkeezy what about the other 45% lol

1

2

0

19

Siddhant Pagariya

@sidpagariya

about 1 month ago

@hu_yifei 3rd reason best reason 🫡🫡🚀

0

3

0

171

sidpagariya retweeted

Raunak

@raunakdoesdev

about 1 month ago

https://t.co/htilIes0SD

2

130

17

273

32K

sidpagariya retweeted

adel 🌟

@adelwu_

about 1 month ago

doing a fun internal presentation about "why you should post more 101"

14

95

4

26

6K

Siddhant Pagariya

@sidpagariya

about 1 month ago

@adelwu_ of all the ones this is the only one that’s going to take a lot of time getting used to 😭

0

1

0

23

Siddhant Pagariya

@sidpagariya

about 1 month ago

@joshnkeezy @donaldwu_ yo that was absolutely a wild time 🫡 i was genuinely so confused until omar dropped the “we’re gonna be parsing epstein files” in the chat 😆

1

0

70

Siddhant Pagariya

@sidpagariya

about 2 months ago

@adelwu_ happy to pay a 50% premium to get out of my current place pls and thanks 😭😩🤝

0

1

0

266

Siddhant Pagariya

@sidpagariya

about 2 months ago

pov: you’re at an Indian wedding but your brain is 100% in the ai stack. aunties: can you help us type these biodatas into the computer? me: send it to reducto for parsing and extraction then feed the JSON output into my brother’s app aunties: …is he okay???

1

6

0

108

Siddhant Pagariya

@sidpagariya

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users