YuWd @yuigawada - Twitter Profile

Pinned Tweet

YuWd @YuigaWada

about 1 year ago

未踏スーパークリエータに認定されましたこれからも頑張ります〜

IPA（情報処理推進機構）

@IPAjp

about 1 year ago

本日、2024年度「未踏IT人材発掘・育成事業」のスーパークリエータ 19名を公表しました。おめでとうございます！🎊 詳細はこちらからご覧��ださい👉https://t.co/SD7pQBheL2 #未踏事業

0

251

50

44

187K

6

180

5

30K

YuWd @YuigaWada

about 11 hours ago

@iorin__io おめでとうございます！

0

1

0

66

YuWd @YuigaWada

5 days ago

[New paper on arXiv!] 🚨 LLM-as-a-Judge suffers from a mismatch between large-vocabulary LM and evaluation over a small label set. 🛠️ We propose Rigel, an LLM-as-a-Judge for image/video captioning, based on a self-distilled score adaptation framework! https://t.co/U1B3Gm2Ima

YuigaWada's tweet photo. [New paper on arXiv!]
🚨 LLM-as-a-Judge suffers from a mismatch between large-vocabulary LM and evaluation over a small label set.

🛠️ We propose Rigel, an LLM-as-a-Judge for image/video captioning, based on a self-distilled score adaptation framework!

https://t.co/U1B3Gm2Ima https://t.co/42JRv85YSc

0

46

9

23

3K

YuWd @YuigaWada

8 days ago

🚀 The LIMIT Workshop (#ECCV2026) is now accepting submissions! ✨Submit your work on representation learning when data, labels, modalities, compute, or supervision are limited. Website: https://t.co/dsFpwCDmgG OpenReview: https://t.co/5gtxFFq6oZ Deadline: July 6, 23:59 AoE

YuigaWada's tweet photo. 🚀 The LIMIT Workshop (#ECCV2026) is now accepting submissions!

✨Submit your work on representation learning when data, labels, modalities, compute, or supervision are limited.

Website: https://t.co/dsFpwCDmgG
OpenReview: https://t.co/5gtxFFq6oZ

Deadline: July 6, 23:59 AoE https://t.co/qdjxQWkBgc

1

13

1

2

2K

Who to follow

hori

@horiy_dev

https://t.co/3GgB57opCP

milkcoffee

@milkcoffeen

アルゴ黄/ヒュ橙/ARC writer/ Engineer @PreferredNetJP

Ryosuke Korekata

@kkrr10_

Ph.D. student in CS @keio_smilab | JSPS DC1 | ex-Visiting scholar @LTIatCMU | ex-Intern @ MERL, TMC, SONY | Interests: Embodied AI, V&L

YuigaWada retweeted

管四

@guansi

13 days ago

最近发现了一个特别��意思的用法。我把 GPT 和 DeepSeek V4 Pro 当高级工程师用，负责写 SOP、写 CI/CD 流程、设计方案。然后把这些东西一股脑丢给 Hermes Agent 里的 Minimax M3 去执行。正常情况下，ChatGPT 一个小时能搞定，DeepSeek 两个小时也差不多了。M3 呢？已经干了一上午，四个小时过去了，还在和 CI 测试死��。日志刷得飞快，错误改了一轮又一轮，就是过不了。但神奇的是，我居然没有生气，反而越来越期待。因为我突然意识到，M3 最有价值的地方可能不是干活，而是验收文档。如果 GPT 写的 SOP，DeepSeek 写的流程，最后连 M3 这种级别的“小白员工”都能一步一步照着执行成功，那说明这个文档是真的写明白了。反过来，如果 M3 卡住了，大概率不是它一个人的问题，而是流程里还有默认知识、隐藏前提和作者自己都没意识到的经验依赖。以前测试用户手册，要找新人。现在测试用户手册，直接找最笨的 Agent。某种意义上说，M3 已经从生产工具变成了��检工具。它就像公司刚入职的新同事。能力可能一般，效率可能不高，但凡是文档里写得不清楚的地方，它一定能精准踩坑给你看。 AI 时代最好的测试工程师，未必是最聪明的模型。有时候恰恰是最笨的那个。

106

3K

555

2K

946K

YuWd @YuigaWada

19 days ago

We created CaptionEvalKit-for-VLMs, a reproducible, all-in-one image captioning evaluation toolkit for VLMs. ✅ One command to use CLIPScore, Polos, CIDEr, LLM-as-a-Judge & more. No dependency hell, one env for all ✅ Reproduce Kendall's τ for any metric https://t.co/Px6iEfVeMJ

0

25

3

8

2K

YuWd @YuigaWada

27 days ago

デンバーからピッツバーグに到着

0

3

0

424

YuWd @YuigaWada

27 days ago

@so_kariyama 出してたんだ！！おめでとうございます🎉🎉

0

1

0

351

YuWd @YuigaWada

30 days ago

@HiroyaforInfo @DreamDive2025 おめでとうございます！！

0

1

0

356

YuWd @YuigaWada

about 1 month ago

[#CVPR2026 Main] 👀 Where & how do MLLMs hallucinate? Now we can tell! 🤖 Meet ZINA: it performs fine-grained hallucination detection and editing 📍 ExHall F, #364 🗓 Jun 7, 11:45–13:45 📄 https://t.co/2C223Rc9hC Work done during my visit to NeuLab (@gneubig) at CMU!

YuigaWada's tweet photo. [#CVPR2026 Main]
👀 Where & how do MLLMs hallucinate? Now we can tell!

🤖 Meet ZINA: it performs fine-grained hallucination detection and editing

📍 ExHall F, #364
🗓 Jun 7, 11:45–13:45
📄 https://t.co/2C223Rc9hC

Work done during my visit to NeuLab (@gneubig) at CMU! https://t.co/OCGaBvCBDc

1

49

6

8

6K

YuWd @YuigaWada

about 1 month ago

CVPRなう

0

8

0

436

YuWd @YuigaWada

about 1 month ago

CVPR行くンゴ‼️ デンバーでお会いしましょう

0

23

0

1K

YuWd @YuigaWada

3 months ago

論文一件をarXivに公開しました！ MLLM-as-a-Judgeのモデル間選好バイアスに関する研究です🤖🤖 S. Koyama*, Y. Wada*, D. Yashima, and K. Sugiura, MLLM-as-a-Judge Exhibits Model Preference Bias https://t.co/ZoDVMXiaRE

0

67

5

26

6K

YuigaWada retweeted

Shivam Duggal @ShivamDuggal4

3 months ago

Tokenization & Generation power Large Models. But are they really separate? Tokenization=Generation under strong observability UNITE: An end-to-end training framework where one shared Generative Encoder (GE) performs both token. & latent denoising Paper: https://t.co/8idMdy123h