AT @SomeTestUser01 - Twitter Profile

AT @SomeTestUser01

8 months ago

@SunoMusic Pro Plan "Making songs is currently disabled. You can still listen to your favorite tunes." wtf?

2

0

91

AT @SomeTestUser01

10 months ago

@GregKamradt But the fourth word in the answer is fourth anyway, whatever it is, so he's right.

0

1

0

87

AT @SomeTestUser01

about 1 year ago

@itsalexvacca Why wasn't ChatGPT o3 / 4.5 in the test?

0

22

AT @SomeTestUser01

about 1 year ago

@alexalbert__ And it doesn't work properly on Windows, very funny

0

156

AT @SomeTestUser01

about 1 year ago

@flowersslop Drop both?

0

196

AT @SomeTestUser01

about 1 year ago

@sama This isn't even news, there was no reason for the insomnia, Sam.

0

10

AT @SomeTestUser01

over 1 year ago

@hume_ai Is this some kind of prehistoric product? Why does it only support English?

0

69

AT @SomeTestUser01

over 1 year ago

Although the author of the article declares Grok-3 the winner, this conclusion is debatable for several reasons: 1. Subjectivity of Evaluation The author claims that Grok-3 "won" because it made the biggest impression. However, impressions are a subjective criterion. In reality, each model has its own strengths and weaknesses, and the choice of the best model depends on specific tasks. GPT-4.5 is better suited for everyday tasks and has improved emotional intelligence. Claude 3.7 focuses on programming and logical reasoning. Grok-3 offers high request limits and strong reasoning capabilities but does not necessarily outperform competitors in all aspects. 2. Incomplete Comparative Testing The author mentions that Grok-3 "convinced the majority" but does not provide objective metrics. He refers to "feelings," but in practice, models need to be tested under different conditions: coding, math, text generation, factual accuracy, and more. Many independent tests still show OpenAI (including Deep Research) leading. Grok-3 has indeed made progress, but there is no concrete evidence that it is universally better. 3. Underestimating GPT-4.5 The author calls GPT-4.5 a "disappointment" while also admitting that it has improved in creative tasks, has better emotional intelligence, and offers a great experience for general users. GPT-4.5 is not aimed at professionals, but that does not make it bad. It is optimized for a broad audience, which aligns with OpenAI's strategy. Its improvements may not feel like a "revolution," but gradual refinement is normal—especially since GPT-4o was already strong. 4. Oversimplified Industry Analysis The author concludes that OpenAI is now "under pressure" and "losing ground." However, in reality: OpenAI remains the market leader with the largest user base. GPT-5 is already in development, and OpenAI is preparing for a significant leap forward. Grok-3 shows promising progress, but xAI is still a relatively new company that must prove the consistency of its success. Conclusion Grok-3 is an impressive contender, but it is too early to declare it the winner. OpenAI still holds the lead, while Grok-3 is catching up and demonstrating strong potential. However, without objective data and large-scale comparative testing, it is premature to claim that it is the best overall model.

0

2

0

1

116

AT @SomeTestUser01

over 1 year ago

@sama Okay, just open a pro plan for Russia.

0

2

0

172

AT @SomeTestUser01

over 1 year ago

@MatthewBerman @GregKamradt Of course not, the amount of computing we will be doing will grow even faster.

0

29

AT @SomeTestUser01

over 1 year ago

@kimmonismus I'd say answer 25 would make more sense, answer 1 is more of a fun gimmick. But if the model could see both ways, that would be great.

0

6

0

424

AT @SomeTestUser01

over 1 year ago

@kimmonismus This is a poorly formulated problem, it may have different solutions, because in classical math 1 does not equal 5, so it is not the traditional mathematical sign = and we have no reason to believe that it has transitivity.

1

60

0

1K

AT @SomeTestUser01

over 1 year ago

@sama We only need two names: AGI and ASI

0

6

AT @SomeTestUser01

over 1 year ago

@OfficialLoganK "ImageFX isn't available in your country yet" Yeah, very impressive.

0

5

0

241

AT @SomeTestUser01

over 1 year ago

@alexalbert__ Are you really wasting your time on nonsense like this instead of releasing Opus 3.5?

0

33

AT @SomeTestUser01

over 1 year ago

@MatthewBerman Glasses is the next computing platform

0

5

AT @SomeTestUser01

over 1 year ago

@MatthewBerman 1 and 2 are also a job for AGI, sorry

0

1

0

36

AT @SomeTestUser01

over 1 year ago

@kimmonismus I guess a list of 100 books is too big for one query in 4o. I asked him to do it for the first 20 and he did a great job.

SomeTestUser01's tweet photo. @kimmonismus I guess a list of 100 books is too big for one query in 4o. I asked him to do it for the first 20 and he did a great job. https://t.co/LDHzkRglsN

0

30

AT @SomeTestUser01

over 1 year ago

@felps_bra @rabrg That statement is from 2017 if I'm not mistaken and doesn't take AGI into account. With automated qualitative research, you can get interesting ideas and efficient algorithms in weeks that would take humans decades and dozens of experiments to create.

0

12

AT @SomeTestUser01

over 1 year ago

@felps_bra @rabrg Of course you're wrong. Experiments are the final stage, moreover, a number of experiments can be carried out at lower capacities. The main thing is to develop new algorithms, approaches and optimize existing ones.

1

0

16

AT

@SomeTestUser01

Last Seen Users on Sotwe

Trends for you

Most Popular Users