Souvik Mandal @mandalsouvik4 - Twitter Profile

mandalsouvik4 retweeted

Jun Song

@jun_song

3 months ago

Opus 4.7 토큰 테스트 토크나이저 차이로 제미나이의 2배를 사용합니다. Opus 4.6 대비해서도 50% 많이 사용해요. 이건 사실상 같은 한도에서 모델이 50% 더 비싸진겁니다.

88

3K

232

378

251K

Souvik Mandal @mandalsouvik4

3 months ago

@giffmana 🤔🤔

1

6

0

3

445

Souvik Mandal @mandalsouvik4

3 months ago

@MaziyarPanahi @Yuchenj_UW @sama Whoever forked the repo now should apply for claude for open source now, more than 5k stars 😂

1

0

82

Souvik Mandal @mandalsouvik4

3 months ago

Maybe Claude code open sourced its codebase because it cannot figure out all the bugs related to high token usage and open sourcing it is more efficient. 🚀

0

1

0

81

Who to follow

Pramod RPS

@pramodrps

Believer of Technology driven sustainable future!!

mandalsouvik4 retweeted

Rushabh Nagda @rushabh_nagda

3 months ago

"When a metric becomes the target, it stops being a good metric" - Goodharts Law last few days GLM-OCR has been trending after it claimed 95% on OmniDocBench, which is higher than Gemini-3-pro in reality GLM-OCR is way worse than the story these benchmarks paint, lets see how full disclaimer: ive been working in this space for the last 7 years with @nanonets

1

12

10

2

478

mandalsouvik4 retweeted

Y Combinator

@ycombinator

3 months ago

Every student accepted into Startup School India now gets $25k+ in AI and cloud credits. Apply, get in, and start building: https://t.co/gncXSJGhdb

140

2K

146

937

388K

Souvik Mandal @mandalsouvik4

3 months ago

@vanstriendaniel Nanonets OCR 3!! Preview model is available for test at https://t.co/oQfNvwx1WT

1

9

0

4

1K

Souvik Mandal @mandalsouvik4

4 months ago

@techNmak Model is not available on hf. Some issue with model configuration https://t.co/JyDMO7CYdF

0

1

0

252

Souvik Mandal @mandalsouvik4

4 months ago

@vanstriendaniel @huggingface Results of Nanonets OCR 3 intermediate checkpoint works fine. https://t.co/gIettdM1Aj. We will open-source it soon!!

3

25

1

12

852

Souvik Mandal @mandalsouvik4

4 months ago

@ihailmyindia Models are not released yet.

0

54

mandalsouvik4 retweeted

Rann

@rstone_9

8 months ago

A tiny phishing trick broke 5 out of 8 popular OCR models (Deepseek-OCR, Gemini-2.5Pro, PaddleOCR, etc) Can you spot the phish in the screenshot? If you use OCR/Markdown pipelines in prod, check out the results in the thread. 1/n

rstone_9's tweet photo. A tiny phishing trick broke 5 out of 8 popular OCR models (Deepseek-OCR, Gemini-2.5Pro, PaddleOCR, etc)

Can you spot the phish in the screenshot?

If you use OCR/Markdown pipelines in prod, check out the results in the thread.

1/n https://t.co/PWoUutTHcq

12

147

20

122

43K

mandalsouvik4 retweeted

Maziyar PANAHI

@MaziyarPanahi

9 months ago

> RTX Pro 6000 96GB @PrimeIntellect > Nanonets-OCR @nanonets > vLLM @vllm_project > 2000 images from cifar10 @huggingface one word, damn!

18

340

25

273

230K

Souvik Mandal @mandalsouvik4

9 months ago

@HKydlicek Feel free to try the newer model. We have fixed some of the issues of the last model (Nanonets-OCR-s) that people mentioned. Also, added some of the requested functionalities.

0

1

0

30

Souvik Mandal @mandalsouvik4

9 months ago

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents.

mandalsouvik4's tweet photo. Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. https://t.co/9uwhckiCJJ

9

26

7

6

1K

mandalsouvik4 retweeted

Vaibhav (VB) Srivastav

@reach_vb

9 months ago

Nanonets released a new version of their SoTA OCR model 🔥 Supports LaTeX, Multilingual, Complex tables and much more Works out of the box with transformers, vllm and all major runners! 🤗 https://t.co/fgvdp4QIC9

6

471

61

378

30K

Souvik Mandal @mandalsouvik4

9 months ago

@BramVanroy @vanstriendaniel @nanonets @huggingface We have compared against Gemini-Flash, which is generally the SOTA for document understanding tasks. https://t.co/bqZLJVGxt9 We will add more evals more time.

0

50