Eli @ebadgio_ - Twitter Profile

Eli

@ebadgio_

about 10 hours ago

https://t.co/uKPLskaLU6

3

37

6

66

7K

Eli

@ebadgio_

1 day ago

Long array extraction is a core capability we have invested a lot of time in even since the early days at Extend. It's a very challenging problem that is far from just a model problem, you need a purpose built harness that enables foundation models to reliably extract 1,000s of data points over hundred to get the most out of any top model. Our MAX mode for extraction does exactly that through a combination of things like: - Dynamic chunking of large documents based on table sizes/density and schema complexity, with semantic preservation as much as possible - Multiple passes through the full document to make sure all split context is persisted across the extraction over a long document - Heavy usage of smaller models used to detect and fix mechanical issues around a variety of page and section boundary conditions All this together brings us closer to the end goal of *perfect* extraction over any sized document and schema complexity. The most exciting part though is this is using a system we built and launched months ago, we're now working on a v2 that will take it to another level of complexity handling, stay tuned 👀

Kushal Byatnal

@kushalbyatnal

1 day ago

we created a new, open source eval (LongArray-Extract) for one of the hardest problems in document processing: how to extract every row out of long documents some highlights: - Extend's array extraction is SOTA (99.2%) - 3x faster than the next closest competitor (5 min vs 14 min) it's based on examples we've seen in production: > bank statements with 2,000+ transactions > clinical adverse-event listings with 1,000+ events > legal filings with hundreds of numbered factual paragraphs if you've ever built a document pipeline on hundred page docs with thousands of listings, you know exactly how quickly things break we open sourced the benchmark + dataset so teams can inspect the docs, run the harness, and compare results directly

kushalbyatnal's tweet photo. we created a new, open source eval (LongArray-Extract) for one of the hardest problems in document processing: how to extract every row out of long documents

some highlights:
- Extend's array extraction is SOTA (99.2%)
- 3x faster than the next closest competitor (5 min vs 14 min)

it's based on examples we've seen in production:
> bank statements with 2,000+ transactions
> clinical adverse-event listings with 1,000+ events
> legal filings with hundreds of numbered factual paragraphs

if you've ever built a document pipeline on hundred page docs with thousands of listings, you know exactly how quickly things break

we open sourced the benchmark + dataset so teams can inspect the docs, run the harness, and compare results directly

10

50

9

18

4K

0

12

2

0

430

Eli

@ebadgio_

about 1 month ago

@PhilHedayatnia @henloitsjoyce @AirfoilStudio @ExtendHQ We got a lot of love from these ads tbh, and several new customers, appreciate you guys helping us design our new brand!

0

3

0

49

Eli

@ebadgio_

about 1 month ago

Few understand this, but there is no better time to reflect on how to improve your document ingestion pipelines than on your morning commute in nyc

joyce

@henloitsjoyce

about 1 month ago

why are there AI ads on the nyc subway now... how do i get out of this bubble designed by @AirfoilStudio ????

12

45

2

5

87K

3

17

1

0

1K

Who to follow

uncanny valley girl / @onairosapp

Rory Hughes

@rorhug

Helping people launch their vibe-coded app today.

Eli

@ebadgio_

about 2 months ago

@Andercot https://t.co/y7Oljnvp6B

Eli

@ebadgio_

3 months ago

Openclaw is just the Autogpt of 2026, and three months from now just like autogpt no one will be talking about it

0

12

0

558

0

2

0

105

Eli

@ebadgio_

3 months ago

@rabois @the_P_God It was much more @garrytan via a combination of YC and local political initiatives that saved SF. OpenAI was very impactful, but alone would not have led this turn around

0

1

0

530

Eli

@ebadgio_

3 months ago

Openclaw is just the Autogpt of 2026, and three months from now just like autogpt no one will be talking about it

0

12

0

558

Eli

@ebadgio_

3 months ago

@andrewlu0 @paper So clean

0

1

0

51

Eli

@ebadgio_

4 months ago

@calvinchen @MatternJustus @navidkpr Huge, congrats Calvin! Insane execution speed building this out

1

0

181

Eli

@ebadgio_

4 months ago

There are many things I love about our new site, but my favorite is the animation all the way at the bottom for those interested enough to scroll to the end

Kushal Byatnal

@kushalbyatnal

4 months ago

proud to share Extend's updated brand and website! we spent 100s of hours on it, and obsessed over every single detail multiple full redesigns thrown out, fonts swapped and swapped again, every animation tweaked until it felt right...we even locked ourselves in a room and white-boarded every single word on the page until it resonated why? prospects would see a demo and say something like "this is not what I expected based on your site", and they were 100% correct our product has changed so much in the past year, that we wouldn't even recognize the old version (new APIs, capabilities, entirely new categories of problems we now solve). Our old site didn't reflect any of that. we’re proud of how this turned out, and we hope it conveys the level of craft our team obsesses over in everything we ship check out the new site below and please share feedback!

10

40

3

27

4K

0

12

1

758

Eli

@ebadgio_

4 months ago

There were actually a number of internal evals we have at @ExtendHQ that specifically gpt4-0314-32k outperformed on against all subsequent gpt4 models, and claude 3x models, until gpt-4o-0806 and Opus 3.7 Truly a special model. Many did not realize how far you could push it on complex extraction tasks. I’ll also never forget my fist few times interacting with Bing Creative Mode, which to my understanding, was built on top of gpt4-0314

2

20

0

2

2K

Eli

@ebadgio_

4 months ago

if you’re watching the superbowl and love document processing, keep an eye out for the Extend logo this weekend

TBPN

@tbpn

4 months ago

In an effort to expand awareness of technology and business, we bought a Super Bowl ad. See you Sunday on NBC

181

2K

87

250

1M

1

6

0

442

Eli

@ebadgio_

4 months ago

every Capital One Cafe should open up a Brex Bakery counter

0

4

0

265

Eli

@ebadgio_

6 months ago

@jefftangx @creatine_cycle @ExtendHQ Need more bottle golf

0

1

0

62

Eli

@ebadgio_

6 months ago

@karrisaarinen @rsg @linear @tembo @Sentry @codegen @cursor_ai @FactoryAI @github @OpenAI @cognition I was too early https://t.co/cmTFbnQWAf

Eli

@ebadgio_

about 3 years ago

Assign your Linear tasks to gpt4

2

8

0

716

0

1

0

141

ebadgio_ retweeted

jason

@jxnlco

7 months ago

One takeaway that stuck with me from my session with Eli Badgio at @ExtendHQ: Document processing isn’t a prompt problem, it’s a pipeline problem. Getting to 99%+ accuracy means optimizing every step end to end, not just writing better prompts. Notes from the session below.

jxnlco's tweet photo. One takeaway that stuck with me from my session with Eli Badgio at @ExtendHQ: Document processing isn’t a prompt problem, it’s a pipeline problem.

Getting to 99%+ accuracy means optimizing every step end to end, not just writing better prompts.

Notes from the session below. https://t.co/DPUNgi5Ue3

1

19

1

22

6K

Eli

@ebadgio_

7 months ago

@justalexoki Live Player behavior

0

14

0

1K

Eli

@ebadgio_

8 months ago

@rankintweets You clearly haven’t had the pleasure of meeting @andrewlu0 yet

2

5

0

99

Eli

@ebadgio_

8 months ago

This was super fun to build, and even more fun to use

Kushal Byatnal

@kushalbyatnal

8 months ago

Introducing Composer — the first AI Agent for document processing. Get to production-grade accuracy, autonomously in minutes. In our early beta, some teams hit 99% accuracy on complex document tasks in under 10 minutes. Composer is an agent built to optimize schemas the same way a human would (but way faster). Instead of tuning prompts by hand, you point Composer at your eval set inside Extend. Composer will: - analyze where your schema falls short - propose targeted improvements - run multiple experiments in parallel - surface diffs, accuracy gains, and traces behind each change With this launch, Extend is the only product on the market that helps you reach production-grade accuracy this fast. Composer is live for all Extend customers today! Try it out at the link in comments below.

24

470

41

696

75K

2

8

0

498

Eli

@ebadgio_

8 months ago

@jxnlco @nejatian @nejatian just dm’d you!

0

2

0

65

Eli

@ebadgio_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users