Marco Tulio Ribeiro

@marcotcr

Seattle, WA

Joined May 2009

2 Following

887 Followers

24 Posts

Marco Tulio Ribeiro @marcotcr

about 3 years ago

Testing LLMs (and prompts) like we test software: https://t.co/bZlIhgZEFh TL;DR: (1) You should, (2) How to test: specific properties, evaluate these with LLMs (perception is easier than generation), (3) What to test: get the LLM to help you figure it out.

1

53

11

30

12K

marcotcr retweeted

about 3 years ago

Microsoft open-sources a new AI library that connects to open-source GPTs, not just OpenAI. https://t.co/wxrBtHCmIC

danielgross's tweet photo. Microsoft open-sources a new AI library that connects to open-source GPTs, not just OpenAI. https://t.co/wxrBtHCmIC https://t.co/nCjdA7hj2Z

19

727

158

389

147K

Marco Tulio Ribeiro @marcotcr

about 3 years ago

@sean_lynch We're just writing stuff WE would want to use, and I guess we probably count as 'real developers' :)

1

6

0

1

440

marcotcr retweeted

Andrej Karpathy

about 3 years ago

Also highly relevant: guidance from microsoft "Guidance programs allow you to interleave generation, prompting, and logical control" Also internally handles subtle but important tokenization-related issues, e.g. "token healing". https://t.co/eEc1rywuWP

karpathy's tweet photo. Also highly relevant: guidance from microsoft
"Guidance programs allow you to interleave generation, prompting, and logical control"
Also internally handles subtle but important tokenization-related issues, e.g. "token healing".
https://t.co/eEc1rywuWP https://t.co/DudrisKuV3

3

195

18

77

62K

Who to follow

Sherry Tongshuang Wu

Assist. Prof @SCSatCMU , CS PhD @uwcse. HCI+AI, map general-purpose models to specific use cases! prev. intern @MSFTResearch @GoogleAI @Apple. She/her.

The NLP group at the University of Washington.

Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw

marcotcr retweeted

about 3 years ago

been reading the readme for https://t.co/v1qsPSXiXJ, kind of galaxy brain tl;dr they made a whole prompt engineering language

12

1K

182

840

262K

Marco Tulio Ribeiro @marcotcr

about 3 years ago

Blog post: playing with Vicuna-13B, ChatGPT (3.5), MPT-7B-Chat on harder stuff https://t.co/u2YQEuP6rV TL;DR: We think ChatGPT is still way ahead, but sometimes the extra control from open source models is worth it.

3

297

50

146

78K

Marco Tulio Ribeiro @marcotcr

over 3 years ago

My intern is close to writing a paper, so I wrote her this blog post on writing (part 1 of 2): https://t.co/AoMgl4YHLi

3

306

63

214

0

Marco Tulio Ribeiro @marcotcr

almost 4 years ago

I never tweet, but here is a blog post I wrote for an intern, may be useful for others too... Part 1: https://t.co/DhE55I7ie7 Part 2: https://t.co/ncpJTb4otF

9

548

120

399

0

Marco Tulio Ribeiro @marcotcr

over 9 years ago

@jacswork My guess is you mean LIME : ). I don't know exactly what you mean, but we have follow up work coming out soon!

0

1

0

0

0

Marco Tulio Ribeiro @marcotcr

almost 10 years ago

@BecomingDataSci Twitter is too hard : )

0

0

0

0

0

Marco Tulio Ribeiro @marcotcr

almost 10 years ago

@BecomingDataSci Would love to hear what went wrong if you are willing to share in detail. Would you email me at my handle @gmail.com?

1

0

0

0

0

Marco Tulio Ribeiro @marcotcr

almost 10 years ago

@BecomingDataSci I can share a few additional text or tabular examples if you like, but they all require specific datasets. We have a ton.

1

0

0

0

0

Marco Tulio Ribeiro @marcotcr

almost 10 years ago

@BecomingDataSci Which ones didn't work? The first three should work, so please let me know if there are bugs =]

1

0

0

0

0

Marco Tulio Ribeiro @marcotcr

almost 10 years ago

"Why Should I Trust You?" Explaining the Predictions of Any Classifier. Promo video: https://t.co/Eu4lwhFZrU #kdd2016 @guestrin @sameer_

1

52

15

1

0

Marco Tulio Ribeiro @marcotcr

about 10 years ago

@fmailhot Heh, sorry for opaqueness of my replies. Twitter is not really my thing. Feel free to email me further questions or comments: )

0

0

0

0

0

Marco Tulio Ribeiro @marcotcr

about 10 years ago

@fmailhot Good point about opaque features. If the classifier uses stopwords, LIME should reflect it, so I don't think LIME should remove it

1

0

0

0

0

Marco Tulio Ribeiro @marcotcr

about 10 years ago

@fmailhot Probably don't need LIME for that though, if it's only 5 tokens. Things change in a longer sentence (not even a long document).

1

0

0

0

0

Marco Tulio Ribeiro @marcotcr

about 10 years ago

@fmailhot That is true (only 32 data points), but you may still want to tease out the contribution of each token. e.g 'I do not like that.'

0

0

0

0

0

Marco Tulio Ribeiro @marcotcr

about 10 years ago

@fmailhot It works with documents of any size. We also just added support for tabular (numerical + categorical) data. Maybe images soon.

1

0

0

0

0

marcotcr retweeted

about 10 years ago

Great work from @marcotcr , @sameer_ on explaining any machine learning model (20 newsgroup, deep net). https://t.co/OClODgse0z @guestrin

1

64

30

0

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users