Roberto Paredes @RobertoParPal - Twitter Profile

Roberto Paredes @RobertoParPal

over 2 years ago

@Glennalbert @Staphylo_ailus

0

10

Roberto Paredes @RobertoParPal

over 2 years ago

@jimkxa Yes this is why ROCm (misery) is soooo good…

0

305

Roberto Paredes @RobertoParPal

over 2 years ago

Ale, pues ya está…

OpenAI

@OpenAI

over 2 years ago

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. https://t.co/YYpOAcrXQ3 Prompt: “Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes.”

9K

129K

30K

34K

98M

0

6

0

229

Roberto Paredes @RobertoParPal

over 2 years ago

@antor Desiré = de ver ** 🤣

0

10

Who to follow

(Pi)ñeiro

@uxpineiro

Tech for reindustrialization and places worth building. Prev. co-founded @CitizenX

UPV Campus d'Alcoi

@UPVCampusAlcoy

Campus de Alcoy de la Universitat Politècnica de València

COIICV

@COIICV

Colegio Oficial de la Ingeniería Informática de la Comunitat Valenciana

Roberto Paredes @RobertoParPal

over 2 years ago

@antor Puede ser interesante pero Desirés de ver esto… no sé https://t.co/1UDroueIy2

Greg Kamradt

@GregKamradt

over 2 years ago

Pressure Testing GPT-4-128K With Long Context Recall 128K tokens of context is awesome - but what's performance like? I wanted to find out so I did a “needle in a haystack” analysis Some expected (and unexpected) results Here's what I found: Findings: * GPT-4’s recall performance started to degrade above 73K tokens * Low recall performance was correlated when the fact to be recalled was placed between at 7%-50% document depth * If the fact was at the beginning of the document, it was recalled regardless of context length So what: * No Guarantees - Your facts are not guaranteed to be retrieved. Don’t bake the assumption they will into your applications * Less context = more accuracy - This is well know, but when possible reduce the amount of context you send to GPT-4 to increase its ability to recall * Position matters - Also well know, but facts placed at the very beginning and 2nd half of the document seem to be recalled better Overview of the process: * Use Paul Graham essays as ‘background’ tokens. With 218 essays it’s easy to get up to 128K tokens * Place a random statement within the document at various depths. Fact used: “The best thing to do in San Francisco is eat a sandwich and sit in Dolores Park on a sunny day.” * Ask GPT-4 to answer this question only using the context provided * Evaluate GPT-4s answer with another model (gpt-4 again) using @langchain evals * Rinse and repeat for 15x document depths between 0% (top of document) and 100% (bottom of document) and 15x context lengths (1K Tokens > 128K Tokens) Next Steps To Take This Further: * Iterations of this analysis were evenly distributed, it’s been suggested that doing a sigmoid distribution would be better (it would tease out more nuanced at the start and end of the document) * For rigor, one should do a key:value retrieval step. However for relatability I did a San Francisco line within PGs essays. Notes: * While I think this will be directionally correct, more testing is needed to get a firmer grip on GPT4s abilities * Switching up prompt with vary results * 2x tests were run at large context lengths to tease out more performance * This test cost ~$200 for API calls (a single call at 128K input tokens costs $1.28) * Thank you to @charles_irl for being a sounding board and providing great next steps

GregKamradt's tweet photo. Pressure Testing GPT-4-128K With Long Context Recall

128K tokens of context is awesome - but what's performance like?

I wanted to find out so I did a “needle in a haystack” analysis

Some expected (and unexpected) results

Here's what I found:

Findings:
* GPT-4’s recall performance started to degrade above 73K tokens
* Low recall performance was correlated when the fact to be recalled was placed between at 7%-50% document depth
* If the fact was at the beginning of the document, it was recalled regardless of context length

So what:
* No Guarantees - Your facts are not guaranteed to be retrieved. Don’t bake the assumption they will into your applications
* Less context = more accuracy - This is well know, but when possible reduce the amount of context you send to GPT-4 to increase its ability to recall
* Position matters - Also well know, but facts placed at the very beginning and 2nd half of the document seem to be recalled better

Overview of the process:
* Use Paul Graham essays as ‘background’ tokens. With 218 essays it’s easy to get up to 128K tokens
* Place a random statement within the document at various depths. Fact used: “The best thing to do in San Francisco is eat a sandwich and sit in Dolores Park on a sunny day.”
* Ask GPT-4 to answer this question only using the context provided
* Evaluate GPT-4s answer with another model (gpt-4 again) using @langchain evals
* Rinse and repeat for 15x document depths between 0% (top of document) and 100% (bottom of document) and 15x context lengths (1K Tokens > 128K Tokens)

Next Steps To Take This Further:
* Iterations of this analysis were evenly distributed, it’s been suggested that doing a sigmoid distribution would be better (it would tease out more nuanced at the start and end of the document)
* For rigor, one should do a key:value retrieval step. However for relatability I did a San Francisco line within PGs essays.

Notes:
* While I think this will be directionally correct, more testing is needed to get a firmer grip on GPT4s abilities
* Switching up prompt with vary results
* 2x tests were run at large context lengths to tease out more performance
* This test cost ~$200 for API calls (a single call at 128K input tokens costs $1.28)
* Thank you to @charles_irl for being a sounding board and providing great next steps

200

4K

611

2K

1M

1

0

27

Roberto Paredes @RobertoParPal

over 2 years ago

@antor Mira pues LINCE Mistral 7B bastante más barato…

0

1

0

21

RobertoParPal retweeted

Worten

@WortenES

over 2 years ago

Pues el anuncio para el Black Friday nos ha quedado precioso. Esperamos que no te importe que hayamos retocado un poco tu vídeo @lladosfitness 😘

725

22K

4K

1K

4M

Roberto Paredes @RobertoParPal

over 2 years ago

@antor Esto de qué coche es?

0

1

0

169

Roberto Paredes @RobertoParPal

over 2 years ago

@karpathy @OwainEvans_UK Where is the problem?

0

68

RobertoParPal retweeted

Elon Musk

@elonmusk

almost 3 years ago

https://t.co/VzTxpktH1q

14K

155K

24K

4K

49M

Roberto Paredes @RobertoParPal

almost 3 years ago

Lo ha clavado https://t.co/ONdTMcYVfb

0

1

0

54

Roberto Paredes @RobertoParPal

almost 3 years ago

@saivenkataraju @abhi1thakur Yeah barely one token per second

0

48

Roberto Paredes @RobertoParPal

almost 3 years ago

@saivenkataraju @abhi1thakur It is slow but yes you can. Check for instance also the llama2.c project of Karpathy.

1

0

80

Roberto Paredes @RobertoParPal

almost 3 years ago

@harumambaru @abhi1thakur Not sure but I think the it uses int4 and llama 7b so probably could fit but not sure

0

81

Roberto Paredes @RobertoParPal

almost 3 years ago

@DesatranqueJaen @RaulHernandezL

0

2

0

200

Roberto Paredes @RobertoParPal

about 3 years ago

@francoisfleuret Temperature of the room

0

1

0

398

RobertoParPal retweeted

Solver Intelligent Analytics @iasolver

about 3 years ago

La semana pasada celebramos la 2ª edición de nuestro ‘Almuerzo con Inteligencia Artificial’, un encuentro organizado en Madrid que contó nuevamente con Jordi Mansanet, Roberto Paredes y Victoria Corral de Solver como anfitriones y que conecta con empresas que apuestan por la IA.

iasolver's tweet photo. La semana pasada celebramos la 2ª edición de nuestro ‘Almuerzo con Inteligencia Artificial’, un encuentro organizado en Madrid que contó nuevamente con Jordi Mansanet, Roberto Paredes y Victoria Corral de Solver como anfitriones y que conecta con empresas que apuestan por la IA. https://t.co/yKQrBuLHOe

1

5

1

0

311

Roberto Paredes @RobertoParPal

about 3 years ago

@elonmusk @clownworld @CommunityNotes No, it is the border between Spain and Morocco. That happens in a particular days when Spain a Morocco relationships were bad… so clearly Morocco can definitively avoid this but they use that as a weapon.

0

73

Roberto Paredes @RobertoParPal

about 3 years ago

@DrBPChamberlain @tk_rusch @mmbronstein (Pay) Attention is all you need

0

1

0

64

Roberto Paredes

@RobertoParPal

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users