AIMultiple @aimultiple - Twitter Profile

AIMultiple @AIMultiple

about 1 month ago

Thank you @brave team!

0

1

39

AIMultiple retweeted

Cem Dilmegani

@dilmegani

6 months ago

We benchmarked Mistral’s new OCR across 300 documents in handwriting, printed media and printed text. OCR 3 is behind Gemini and others. With a 6% difference in a dataset of 300 documents, the difference is statistically significant.

dilmegani's tweet photo. We benchmarked Mistral’s new OCR across 300 documents in handwriting, printed media and printed text. OCR 3 is behind Gemini and others. With a 6% difference in a dataset of 300 documents, the difference is statistically significant. https://t.co/bQhx0wsdxj

2

7

1

691

AIMultiple retweeted

Cem Dilmegani

@dilmegani

6 months ago

AI making our lives easier

0

3

1

0

141

AIMultiple retweeted

Cem Dilmegani

@dilmegani

11 months ago

It reminds me of Grok 4 which aced every good benchmark with a holdout dataset like LiveCodeBench. AI influencers were impressed. Then we tested it and got disappointed. In most cases, like the hallucination benchmark below, it failed to reach the top position.

dilmegani's tweet photo. It reminds me of Grok 4 which aced every good benchmark with a holdout dataset like LiveCodeBench. AI influencers were impressed. Then we tested it and got disappointed.

In most cases, like the hallucination benchmark below, it failed to reach the top position. https://t.co/2UmJbc8W5u

2

1

279

Who to follow

Acquired Podcast

@AcquiredFM

Every company has a story. Acquired tells the definitive history and strategy of the world's greatest companies. Hosted by @gilbert and @djrosent.

Jonas Bülow Knudsen

@Jonas_B_K

Manager, Research @ SpecterOps

Cem Dilmegani

@dilmegani

Cem founded AIMultiple, the AI industry analyst. Prior to AIMultiple, he advised enterprises on tech strategy at McKinsey and Altman Solon.

AIMultiple retweeted

Cem Dilmegani

@dilmegani

11 months ago

ChatGPT's new agent almost broke our benchmark. We'll soon need a harder test. This benchmark is not based on a public dataset that is included in OpenAI's models. While we explain the task clearly, data is not public, therefore models not have access to it.

dilmegani's tweet photo. ChatGPT's new agent almost broke our benchmark. We'll soon need a harder test.
This benchmark is not based on a public dataset that is included in OpenAI's models. While we explain the task clearly, data is not public, therefore models not have access to it. https://t.co/lMeuWSl1Zh

1

347

AIMultiple @AIMultiple

over 1 year ago

For details: https://t.co/JsNKQZHcuY

0

184

AIMultiple @AIMultiple

over 1 year ago

We are introducing AI LMC-Eval, a coding benchmark with 100 questions & tested on 7 leading LLMs. LMC stands for Logic / Math Coding. We presented the LLM with high school level logic and math problems and instructed it to write Python to solve them.

AIMultiple's tweet photo. We are introducing AI LMC-Eval, a coding benchmark with 100 questions & tested on 7 leading LLMs.
LMC stands for Logic / Math Coding. We presented the LLM with high school level logic and math problems and instructed it to write Python to solve them. https://t.co/TGZPGqYekx

2

0

347

AIMultiple @AIMultiple

over 1 year ago

This is a benchmark performed on the holdout set. We published 1 example question but the rest of the 100 questions are not public. Therefore, models can't just respond with the answers in their training set.

1

0

202

AIMultiple @AIMultiple

over 1 year ago

Agentic AI is still mostly hype. We asked 5 AI agents to fetch a prices of a specific product from original sources and got only 20% of the results. Should we try this with other agents? https://t.co/z8lPoL1c59

AIMultiple's tweet photo. Agentic AI is still mostly hype. We asked 5 AI agents to fetch a prices of a specific product from original sources and got only 20% of the results.

Should we try this with other agents?

https://t.co/z8lPoL1c59 https://t.co/XRPu9xKahB

0

5

1

2

407

AIMultiple @AIMultiple

over 3 years ago

#ArtificialIntelligence is a game-changing technology for businesses, but there are still many myths and unclear points about AI. Take a look at our article to learn about them. https://t.co/WMuWEEMyvc #MachineLearning

1

11

2

3K

AIMultiple retweeted

Calsoft Inc. @CalsoftInc

over 3 years ago

@AIMultiple says most companies today allocate nearly 50% of their QA budgets to test automation! Why is automating QA so important? Learn about its impact for rapid product release cycle in our blog: https://t.co/9FagXvR0mc #Calsoft #QA #QAAutomation #ProductEngineering

CalsoftInc's tweet photo. @AIMultiple says most companies today allocate nearly 50% of their QA budgets to test automation! Why is automating QA so important?

Learn about its impact for rapid product release cycle in our blog: https://t.co/9FagXvR0mc

#Calsoft #QA #QAAutomation #ProductEngineering https://t.co/Q8xfL9HJAE

0

2

1

0

532

AIMultiple @AIMultiple

over 3 years ago

Digital transformation is integrating digital technologies into all aspects of a business to meet the market and changing business requirements. Learn about why digital transformation matters and some use cases: https://t.co/FbtVMbBvmM #digitaltransformation

0

6

0

AIMultiple @AIMultiple

over 3 years ago

Web scraping enables businesses to get a bulk list of their target audience’s email addresses. It reduces human errors in manually entering email addresses into a database and accelerates marketing processes. To learn more, read our comprehensive article. https://t.co/ASAIPokO28

0

2

0

AIMultiple @AIMultiple

over 3 years ago

Psychological factors such as users’ sentiments regarding policy changes or new investments greatly influence how stock prices change. In this article, we’ll explore how sentiment analysis can be applied to stock market forecasts. #StockMarkets https://t.co/2cm9b3XxKt

0

1

0

AIMultiple @AIMultiple

over 3 years ago

@sridharseshadri @dilmegani @SpirosMargaris @JoannMoretti @AkwyZ @DeepLearn007 @Nicochan33 @jblefevre60 Thank you for sharing!

0

1

0

AIMultiple @AIMultiple

over 3 years ago

IoT enables a myriad of different business applications. Knowing those IoT use cases can help businesses integrate IoT technologies into their investment decisions. That is why we created the most comprehensive list of IoT use cases in industries. #IoT https://t.co/XW45jKVlzD

0

2

0

AIMultiple @AIMultiple

over 3 years ago

AI presents opportunities for cybersecurity professionals to improve their cyber defenses and new threats as cyber attackers leverage modern, publicly available machine learning algorithms. Check our comprehensive article on AI security. #cyberattack https://t.co/0asjTXNBQJ

0

2

0

AIMultiple @AIMultiple

over 3 years ago

Why using edge computing for IoT devices can be a better alternative than the cloud? Here are some examples of IoT devices using edge computing for storage and data processing. https://t.co/wk7vnbfRE7 #edgecomputing

AIMultiple's tweet photo. Why using edge computing for IoT devices can be a better alternative than the cloud?

Here are some examples of IoT devices using edge computing for storage and data processing.

https://t.co/wk7vnbfRE7
#edgecomputing https://t.co/23kFzAPdbT

0

2

0

AIMultiple @AIMultiple

over 3 years ago

Annotated data is integral to many machine learning and artificial intelligence applications. At the same time, it is one of the most time-consuming and labor-intensive parts of ML projects. Here, we explore what data annotation is and why it matters. https://t.co/gkYQeXLliM

0

2

0

AIMultiple

@AIMultiple

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users