Garrett Lord @GarrettLord - Twitter Profile

hot take: most model routing today is astrology. degradation and speed, sure. but the actual "which model for which task" decision? public benchmarks measure stuff that has nothing to do with your workload. the companies getting this right built their own evals. everyone else is guessing with extra steps.

4

32

1

5

3K

Garrett Lord

@GarrettLord

2 days ago

@henrytdowling Explain how you would do this technically?

3

1

0

123

Garrett Lord

@GarrettLord

2 days ago

@mustafasuleyman @satyanadella Congrats! Incredible work!

0

3

0

451

GarrettLord retweeted

Sahil Bhaiwala

@Sbhaiwala03

14 days ago

Good framework for the hierarchy of rewards from instant and entirely digital to those that require time and engagement with the real world

1

8

2

1

3K

GarrettLord retweeted

Sahil Bhaiwala

@Sbhaiwala03

15 days ago

Everyone wants to build evals, but few people want to actually read through the data. You can’t have good model intuition without reading the tasks and traces. There is no substitute.

0

3

1

366

GarrettLord retweeted

Sahil Bhaiwala

@Sbhaiwala03

3 days ago

MiniMax performed impressively well on BankerToolBench, suggesting it is a seriously viable option for real-world investment banking workflows. We're launching BTB v2 soon, excited to see how various models perform as the tasks become more rich and complex

0

5

1

0

550

GarrettLord retweeted

Curtis G. Northcutt

@cgnorthcutt

3 days ago

Our new benchmark BankerToolBench makes its first model card!

2

18

10

0

647

Garrett Lord

@GarrettLord

3 days ago

so the bull case is every enterprise on earth has unique judgment worth codifying and someone has to do it. our whole pitch. $X00B+. one pushback: i think kirkland is smarter than the thread gives credit. going internal doesn’t lock them out of harvey later. portable judgment + ride any model curve + still partner with vertical players when ready. sounds like leverage to me. every enterprise should be building applied evals.

FleetingBits

@fleetingbits

3 days ago

some thoughts on kirkland building its own harvey 1) kirkland is spending $500m over four years in order to build its own internal ai legal tools; kirkland intends to spend $100m this year 2) i suspect that kirkland is doing this because they have told themselves that they have valuable data and because they want to appear differentiated 3) i think the first issue is that kirkland probably does not have differentiated data from other elite law firms; at least, not at the level a harvey would absorb 4) all the elite firms probably have similar internal workflow data and so long as some of them defect, that is enough to commoditize the data kirkland wants to use for its platform 5) and, to the extent that they do have different internal workflows, harvey and legora will end up representing a better version of them and this will put kirkland at a disadvantage 6) moreover, companies like kirkland will have difficulty building their internal legal platforms because they do not have experience with software development 7) and, there are both cultural and structural issues with them managing software developers, like they cannot give non-lawyers equity in the firm due to regulation 8) so, i think firms like kirkland are better off using tools like harvey and legora and then looking to focus on where their value really is now: client relationships, local knowledge (litigation, regulation) and legal r&d (novel structures, etc...) 9) anyway, this seems to me like a phenomenon that ai creates across a lot of industries, where firms that were previously vertically integrated become unbundled due to ai because part of the intelligence gets moved to the labs or otherwise gets commoditized 10) and so, a new set of companies are created whose job it is in order to provide services complementary to the labs: forward deployed like harvey and legora and data providers like mercor, surge and handshake

83

747

31

655

2M

1

35

1

18

8K

GarrettLord retweeted

Anish Athalye

@anishathalye

3 days ago

Very cool to see BankerToolBench in a model card! https://t.co/6ct4Vi2ELp

0

13

2

0

2K

Garrett Lord

@GarrettLord

3 days ago

Wonder if there is a company working on this? Bullish on data co's.

Aaron Levie

@levie

4 days ago

This is effectively the #1 problem for AI agents in the enterprise. As we go from agentic coding (where a large amount of context is in the code base, and users are technical enough to get the rest to the agent easily) to a world of knowledge work agents, the context problem becomes much more acute. We see this every day with customers at Box. For existing digital knowledge, it’s often fragmented across legacy systems or environments that don’t play nice with agents, and have access controls that don’t map to the real work that needs to be done, which become a huge hurdle for getting agents the context they need. This has to all get moved to modern, secure cloud environments. But also, companies often haven’t captured and digitized some of the critical context that agents need to work with. Decisions, processes, and workflows often live in people’s heads and tribal knowledge that need to get turned into unstructured data for agents. This is actually one of the biggest points of leverage for applied AI companies, because they can work to specialize in getting agents exactly the information and domain expertise they need. But it’s also one of the reasons why FDEs and new system integrator plays will also work so well right now. The companies that figure this out will be able to get the most out of AI going forward.

122

756

102

858

182K

1

22

0

17

16K

Garrett Lord

@GarrettLord

3 days ago

100%. And the companies that solve it are going to be huge. @joinHandshake

Tom Blomfield

@t_blom

5 days ago

Imagine replacing 90% of your employees with a team of geniuses who have no idea how your company operates. Total chaos. Nothing works. That’s what AI feels like today. The missing piece is extracting all the domain knowledge from people’s heads and providing that as structured context to the models.

464

3K

225

1K

575K

1

16

1

5

6K

Garrett Lord

@GarrettLord

4 days ago

@RaminNasibov Rollercoaster tycoon

0

4

0

319

Garrett Lord

@GarrettLord

5 days ago

@martin_casado Would be so fun to hear about the cat and mouse game! Must be wild.

0

3

0

120

Garrett Lord

@GarrettLord

5 days ago

@Jack_Raines C

0

2

0

1

549

Garrett Lord

@GarrettLord

5 days ago

data former lawyers becoming legal engineers. former bankers becoming finance engineers. the expert isn’t getting automated. the expert is getting hired to teach the model. the most advanced AI use cases are the map. code needs more data than ever. law is following. the rest of the economy is just earlier on the same curve.

GarrettLord's tweet photo. data

former lawyers becoming legal engineers. former bankers becoming finance engineers. the expert isn’t getting automated. the expert is getting hired to teach the model.

the most advanced AI use cases are the map. code needs more data than ever. law is following. the rest of the economy is just earlier on the same curve.

7

67

5

51

7K

Garrett Lord

@GarrettLord

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users