Evi @geteviapp - Twitter Profile

Evi @geteviapp

about 2 hours ago

@giffmana It’s non commercial, and it is nowhere near ChatGPT images. Big question is what’s up with llama5.

0

36

Evi @geteviapp

about 11 hours ago

@zhengyaojiang Where are the prompts you used? I assume you didn’t make the animation and not sharing the prompts? Or did you just waste our time? If you don’t publish the prompts it was complete waste of time because folks at labs will just add this capability in the next model release.

0

9

Evi @geteviapp

1 day ago

@mustafasuleyman @aurko79 DeepSWE scores are what?

0

101

Evi @geteviapp

1 day ago

@dzhulgakov @FireworksAI_HQ Batch size of what?

0

223

Evi @geteviapp

1 day ago

@dosco Did you fall for “RLM” “novelty”? Be careful those folks have very specific goal and it is not for anyone (else) to succeed:)

1

0

58

Evi @geteviapp

1 day ago

@swyx The main point here is that Gemini Ultra 3.1 from 6 months ago used the same number of flops. This shows that raw flops makes no difference. It is like saying that Sun has lots of energy, yes, bit it has less intelligence than a mosquito:)

0

1K

Evi @geteviapp

2 days ago

@dosco @ekzhu So OpenAI invented that 2 years ago? Or was it Karpathy with LLM OS?

0

16

Evi @geteviapp

2 days ago

@gabriel1 Trouble is with gate keeping “researcher” and not doing bare minimum (PhD, PostDoc, few years as a Professor), but “researcher”:))

0

79

Evi @geteviapp

2 days ago

@GabLesperance @lateinteraction RLM? What’s “R”? Is this DSPy Schmidhuber psychosis?

0

115

Evi @geteviapp

3 days ago

@ajvazan @davideciffa All big businesses use cloud services, devices manufactured abroad and tap water. For LLMs both OpenAI and Anthropic have ZDR (zero data retention) on by default. OpenAI further (unlike Anthropic) does not forward data to Google Search, they have their own web index.

0

18

Evi @geteviapp

3 days ago

@ajvazan @davideciffa Local AI makes no sense anyway, so ride the wave by using OpenAI models, they are currently cheapest and most powerful, everything else is slowing you down

1

0

17

Evi @geteviapp

3 days ago

@aaron_epstein @ycombinator That is not a product, that is a single prompt codex plugin demo.

0

1

0

27

Evi @geteviapp

3 days ago

@haider1 DeepSWE is the only bench that matters

0

102

Evi @geteviapp

3 days ago

@chribjel Next 10T model will fix this, “trust me bro” (tm)

0

91

Evi @geteviapp

3 days ago

@kinglycrow @dexhorthy Just be careful with “you’re absolutely right” and “brilliant idea” vs Codex which never compliments you, but just has way more raw power:)

1

0

26

Evi @geteviapp

3 days ago

@LLMJunky @nummanali Harness is just a single prompt to Codex to implement. Kids these days cobble together hardware demos in clubs and voice prompt “also check latest OpenAI/codex architecture and implement this here, in Rust”.

0

1

0

34

Evi @geteviapp

3 days ago

@thomasrice_au What a weird statement. If model can’t do something useful the right path is to ask your coding agent to add datagen and verifier configs (and Slack human evals project manager) and ask your bot to babysit the next mini, small, mid and big runs and tell you early what’s wrong.

1

0

27

Evi @geteviapp

3 days ago

@reach_vb @Dimillian The energy drinks are just coffee and sugar, for brain function much better is very dark chocolate. Now with agents you actually don’t want to be awake, you want maximum creativity within 24h, even if it is just a single voiced in prompt.

1

0

79

Evi @geteviapp

3 days ago

@kinglycrow @dexhorthy Using dumb model for “planning” is a big mistake :)

1

0

27

Evi

@geteviapp

Last Seen Users on Sotwe

Trends for you

Most Popular Users