Daiki Shiono @onely7_deep - Twitter Profile

4 days ago

🎉 Thrilled to announce that our paper "Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models" received the CVPR Compute Gold Star at #CVPR2026! Congrats to our co-authors!!🙌 https://t.co/mCGCXz2xrg

0

38

4

7

5K

onely7_deep retweeted

Shojiro Yamabe @shojiro_yamabe

6 days ago

Turingのインターンシップで行った研究はCVPR2026 6/6 Poster Session 3 で発表予定です！自分は明日から本会議に参加します、ぜひご飯など行きましょう！

0

20

2

0

2K

onely7_deep retweeted

Shuai Bai

@shuai_bai_

13 days ago

Excited to share Qwen-VLA paper, our exploration of generalist Vision-Language-Action models. It extends Qwen’s multimodal backbone from visual understanding and reasoning to continuous action generation and trajectory prediction. Paper: https://t.co/9jvRW0nI8B

shuai_bai_'s tweet photo. Excited to share Qwen-VLA paper, our exploration of generalist Vision-Language-Action models.
It extends Qwen’s multimodal backbone from visual understanding and reasoning to continuous action generation and trajectory prediction.
Paper:
https://t.co/9jvRW0nI8B https://t.co/4Q7PKA3W7Y

7

585

114

324

76K

onely7_deep retweeted

Ziwei Liu

@liuziwei7

16 days ago

🔥LLaVA-OneVision-2.0 Open Sourced🔥 LLaVA-OneVision series @lmmslab now upgrades to 2.0 with its key advance on *codec-stream tokenization*, which treats highly dynamic video as a continuous bit-cost stream - Tech Report: https://t.co/pFo2fGYj2M - Code: https://t.co/JvRzu96rJ1

liuziwei7's tweet photo. 🔥LLaVA-OneVision-2.0 Open Sourced🔥

LLaVA-OneVision series @lmmslab now upgrades to 2.0 with its key advance on *codec-stream tokenization*, which treats highly dynamic video as a continuous bit-cost stream

- Tech Report: https://t.co/pFo2fGYj2M
- Code: https://t.co/JvRzu96rJ1 https://t.co/d1BgKzQo8I

3

236

45

101

19K

Who to follow

Hiroto Kurita

@hiroto_kurita

machine learning engineer @apple ex: founding ml eng. @kotoba_tech, JSPS DC1

亀井遼平/Ryohei Kamei

@ka_me1024

東北大学 Tohoku NLP Group(鈴木潤研究室) 博士2年

Youmi Ma

@Youmima1015

Assistant Professor, School of Computing, Science Tokyo (TokyoTech). PhD (Engineering). Natural Language Processing and Knowledge Acquisition.

onely7_deep retweeted

Masanari Oi @stjohn2007

about 1 month ago

We propose HATCH🐣, a human-inspired training framework for multi-image spatial reasoning in VLMs 🐤 HATCH improves multi-image spatial reasoning ability while preserving single-image reasoning capabilities 🐓 📚️https://t.co/02Ry5iGmn3

stjohn2007's tweet photo. We propose HATCH🐣, a human-inspired training framework for multi-image spatial reasoning in VLMs 🐤

HATCH improves multi-image spatial reasoning ability while preserving single-image reasoning capabilities 🐓

📚️https://t.co/02Ry5iGmn3 https://t.co/qNCZ8sgbRd

0

23

6

2

2K

onely7_deep retweeted

Yu Yamaguchi | チューリング CTO

@ymg_aq

about 2 months ago

Rust製の軽量ジョブスケジューラ「slotd」を公開しました。 HPC環境でよく使われるジョブスケジューラですが、個人の開発環境でもあったら便利だな…と思い開発しました。コマンドインターフェースや操作感はSlurmに寄せています（完全に互換…とはいきませんが） https://t.co/5HCmsj6ZeL

1

183

38

81

23K

onely7_deep retweeted

Tsubasa Takahashi @tsubasashi

about 2 months ago

東北大学自然言語処理グループのみちのく情報伝達学セミナーで「安全で秘密を守れるAIの実現を目指して」と題して、Confidential AIやAcompanyの研究ビジョンについて講演させていただきました！熱量の高い議論ができて、私もとても楽しかったです。 https://t.co/aGzamj0fEz

tsubasashi's tweet photo. 東北大学自然言語処理グループのみちのく情報伝達学セミナーで

「安全で秘密を守れるAIの実現を目指して」

と題して、Confidential AIやAcompanyの研究ビジョンについて講演させていただきました！
熱量の高い議論ができて、私もとても楽しかったです。

https://t.co/aGzamj0fEz https://t.co/kVOmwFPCdq

0

44

9

3

3K

onely7_deep retweeted

Reina Akama @reinaakama

2 months ago

本日付で、准教授に昇任いたしました。所属は全て変わらずです。センター長兼ボスから辞令をいただき、研究室の皆さまには美しい花束と大きなケーキで盛大すぎるお祝いをしていただきました。嬉しくも身の引き締まる思いでいっぱいです。組織・分野に貢献できるようより一層精進して参ります。

reinaakama's tweet photo. 本日付で、准教授に昇任いたしました。所属は全て変わらずです。センター長兼ボスから辞令をいただき、研究室の皆さまには美しい花束と大きなケーキで盛大すぎるお祝いをしていただきました。嬉しくも身の引き締まる思いでいっぱいです。組織・分野に貢献できるようより一層精進して参ります。 https://t.co/B8zZL8XSUG

9

200

20

5

10K

onely7_deep retweeted

Ryosuke Matsuda @VolumeisRyo

2 months ago

🎉 Excited to share that our paper has been accepted to CVPR 2026 and is now available on arXiv! SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation 🔗 https://t.co/eCxMdCKHLn #CVPR2026 #arXiv [1/N]

VolumeisRyo's tweet photo. 🎉 Excited to share that our paper has been accepted to CVPR 2026 and is now available on arXiv!

SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
🔗 https://t.co/eCxMdCKHLn
#CVPR2026 #arXiv [1/N] https://t.co/46TpyiFHqn

1

39

6

8

3K

onely7_deep retweeted

Yu Yamaguchi | チューリング CTO

@ymg_aq

3 months ago

チューリング、自動運転の研究開発を支援する約111時間分のデータセットを公開しました。 End-to-End自動運転システムおよびVision-Language-Action（VLA）モデルの学習・評価への活用を想定しています https://t.co/qLgoBIasqu

0

90

20

30

7K

onely7_deep retweeted

Yu Yamaguchi | チューリング CTO

@ymg_aq

3 months ago

チューリング、VLAモデルによる自動運転走行を実現しました。公道での事例は国内初になるかと思います。 20億パラメータのVLMを追加学習し、運転行動をリアルタイム推論する技術を開発しました。完全自動運転の実現に向け、今後もフィジカルAI開発を加速していきます https://t.co/PV2D7IMmsP

4

326

76

72

56K

onely7_deep retweeted

Baifeng

@baifeng_shi

3 months ago

Humans can see in high-res, high-FPS in real-time. Why can't VLMs? Introducing AutoGaze: ViTs/VLMs "gaze" only at key video regions! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos. 📄 https://t.co/GhbWZwMAg7 🌐 https://t.co/mEJ991MAIR 🤗 https://t.co/FOfc2QRThi (1/n)🧵

47

2K

202

1K

158K

onely7_deep retweeted

Astral @astral_sh

3 months ago

Astral has entered into an agreement to join OpenAI as part of the Codex team. https://t.co/UkFATfFpop

69

1K

149

111

269K

onely7_deep retweeted

Ai2 @allen_ai

3 months ago

Grounding lets vision-language models do more than describe—they can point to where a robot should grasp, which button to click, or which object to track across video frames. Today we're releasing MolmoPoint, a better way for models to point. 🧵

allen_ai's tweet photo. Grounding lets vision-language models do more than describe—they can point to where a robot should grasp, which button to click, or which object to track across video frames.

Today we're releasing MolmoPoint, a better way for models to point. 🧵 https://t.co/g7fYEOjOpQ

4

235

38

131

60K

onely7_deep retweeted

Weights & Biases Japan

@wandbjapan

3 months ago

W&Bモバイルアプリが iOSでついに公開されました 🚀 どこからでもトレーニングの実行状況をモニタリング。何かが壊れた瞬間にクラッシュアラートを受信。スマートフォンでリアルタイムのメトリクスを確認できます！これは W&Bで数多くリクエストされてきた機能で、ついに実現しました！ https://t.co/Dk9AUR1sFI

1

110

31

36

49K

onely7_deep retweeted

Keito Kudo @k8kudo

3 months ago

数学を解くLLM構築コンペ FT-LLM2026で，オープン部門1位，総合部門でも2位となりました! Tohoku NLP＋αで実現しうる最強メンバー(@mhida90, @onely7_deep @go2oo2 @muyo8692 @r_takahashi_h12 @y_aoneko @kyano__nlp @ma38taniguchi @t_ito0516 @KeisukeS_ @drJunSuzuki)による賜物です! @tohoku_nlp

k8kudo's tweet photo. 数学を解くLLM構築コンペ FT-LLM2026で，オープン部門1位，総合部門でも2位となりました!
Tohoku NLP＋αで実現しうる最強メンバー(@mhida90, @onely7_deep @go2oo2 @muyo8692 @r_takahashi_h12 @y_aoneko @kyano__nlp @ma38taniguchi @t_ito0516 @KeisukeS_ @drJunSuzuki)による賜物です!
@tohoku_nlp https://t.co/sez3cbmEI9

0

41

11

1

5K

onely7_deep retweeted

Keito Kudo @k8kudo

3 months ago

#NLP2026 にて，主著論文「多段算術推論タスクにおける思考の連鎖の忠実性」が委員特別賞を受賞しました! 共著者は@y_aoneko , @ttk_kuribayashi , shusaku sone, @ma38taniguchi , @ana_brrr, @KeisukeS_, @inuikentaro さんです．共著者の皆様の多くのサポートに感謝申し上げます! @tohoku_nlp

1

42

6

2

1K

onely7_deep retweeted

Ryota Tanaka @rtanaka_lab

3 months ago

#NLP2026 にて、優秀賞頂きました！関係者の皆さん、ありがとうございます！

1

54

6

1

5K

Daiki Shiono @onely7_deep

3 months ago

#NLP2026 では、主著１本、共著３本の発表があります。主著は、日本語大規模視覚言語インターリーブデータセット構築とLVLMの性能に対する効果検証に関するお話です。現地参加の方は、・3/10 (火) 11:15-12:45 C会場(2F 大会議室202) にぜひお越しください！お待ちしてます！ turing / @tohoku_nlp