はまち @hama_jp - Twitter Profile

Introducing TabFM, a foundation model designed specifically for tabular data classification & regression. This approach allows generation of high-quality predictions on previously unseen tables in a single forward pass. Learn more and try out the model →https://t.co/OTbVQ8oUQs

GoogleResearch's tweet photo. Introducing TabFM, a foundation model designed specifically for tabular data classification & regression. This approach allows generation of high-quality predictions on previously unseen tables in a single forward pass.

Learn more and try out the model →https://t.co/OTbVQ8oUQs https://t.co/XTD1RCgGjE

46

5K

550

4K

373K

Who to follow

Element 店長

@element5429

香川県高松市北浜町の雑貨屋Elementの店長です！

あおちゃん

@nyacchichan

働けど働けど… Wワークで見も心もぼろぼろです… （今は訪問看護師してます) でも、いつも元気❗️をモットーに今日も頑張ります☺️ 詐欺紹介、撲滅運動します🙋 おこづかい情報や美味しいお店、観光情報などなど教えてくださいね。趣味:プチ旅行、美味しいお店に食べに行く、そのお店の口コミをGoogleに投稿する

yakigac

@yakigac

やきがしと読みます。便利なしくみづくりが好きです。関連: 機械学習画像処理データサイエンスロボコン高専沖縄 @Microsoft

はまち @hama_jp

2 days ago

ただ推論するだけだったらわざわざ買わなくてもいいのにね。

うみゆき@AI研究

@umiyuki_ai

2 days ago

その通り。ここまで頑張って（280万円）マトモに使えない。だからローカルLLMなんかに夢見んなって100回言ってる。それでもローカルLLMを愛好するシネベンチマニアみたいなAIオタク向けにツイートする時もある。ワケ分かってない人がそれ見て「ローカルLLMならタダでAI使い放題や！」とか誤解を招く。一生この悪循環が続く

0

318

39

81

46K

0

1

286

hama_jp retweeted

おざけん

@ozaken_AI

3 days ago

https://t.co/jlBP6PLL0W

5

541

79

719

597K

はまち @hama_jp

2 days ago

@nikkei 船頭が多すぎる気もする

0

31

はまち @hama_jp

2 days ago

@47news_official すごっ!

0

1K

hama_jp retweeted

hata

@hatappo

3 days ago

今日上司に WebMCP を教えてもらったんだけど名前も知らなかった。色んなオンラインの手続きとか、各社がこのインターフェイスを持てば楽になりそうな未来でいいなあ。 https://t.co/Bi6ZZQnZN7

0

29

5

21

3K

はまち @hama_jp

2 days ago

@ebiebi_pg その構文知ってそうな人に、言えなくて困るよねー

0

1

0

499

hama_jp retweeted

DAIR.AI

@dair_ai

2 days ago

Cool new paper from NVIDIA. Looks like agentic coding is moving into hardware design. HORIZON treats hardware design as repository-level code evolution. A Markdown harness becomes a project pack with domain knowledge, an executable evaluator, an acceptance predicate, and a git policy. The agent then evolves an isolated worktree. That is a strong pattern because hardware design needs executable checks. The verifier harness becomes the real interface between the agent and the design task. The paper reports 100% benchmark completion across several hardware design suites, which makes this one worth tracking even if you do not work on EDA. Paper: https://t.co/zoUSIPhYGt Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

dair_ai's tweet photo. Cool new paper from NVIDIA.

Looks like agentic coding is moving into hardware design.

HORIZON treats hardware design as repository-level code evolution. A Markdown harness becomes a project pack with domain knowledge, an executable evaluator, an acceptance predicate, and a git policy.

The agent then evolves an isolated worktree.

That is a strong pattern because hardware design needs executable checks. The verifier harness becomes the real interface between the agent and the design task.

The paper reports 100% benchmark completion across several hardware design suites, which makes this one worth tracking even if you do not work on EDA.

Paper: https://t.co/zoUSIPhYGt

Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

10

153

18

114

15K

はまち @hama_jp

3 days ago

@mikumo_hk ほんとこれ

0

1

0

931

hama_jp retweeted

Dan Kornas

@DanKornas

4 days ago

Transformers are easier to learn when you can poke the model directly. Transformer Explainer is an interactive visualization tool for learning how Transformer-based text-generation models like GPT work. It helps you connect the architecture to real behavior by running a live GPT-2 model in the browser, letting you enter your own text, and showing how internal components work together to predict the next tokens. Key features: • Live GPT-2 in the browser – experiment without setting up a separate model server first • Custom text input – try your own prompts and watch how the model handles them • Internal component views – observe the operations that work together inside the Transformer • Next-token prediction focus – connect each visual step to the model’s token predictions • Local development path – clone the repo, install dependencies, and run it with npm for deeper inspection It’s open-source (MIT license). Link in the reply 👇

DanKornas's tweet photo. Transformers are easier to learn when you can poke the model directly.

Transformer Explainer is an interactive visualization tool for learning how Transformer-based text-generation models like GPT work.

It helps you connect the architecture to real behavior by running a live GPT-2 model in the browser, letting you enter your own text, and showing how internal components work together to predict the next tokens.

Key features:

• Live GPT-2 in the browser – experiment without setting up a separate model server first
• Custom text input – try your own prompts and watch how the model handles them
• Internal component views – observe the operations that work together inside the Transformer
• Next-token prediction focus – connect each visual step to the model’s token predictions
• Local development path – clone the repo, install dependencies, and run it with npm for deeper inspection

It’s open-source (MIT license).

Link in the reply 👇

9

876

144

1K

31K

はまち @hama_jp

3 days ago

@gsgenjitsu 手厳しい

0

1

0

1K

はまち @hama_jp

4 days ago

@shinjukuacc ありがたい

0

26

hama_jp retweeted

elvis

@omarsar0

5 days ago

If you use LLM-as-judge, this one is worth reading. (bookmark it) It's actually one of the most effective ways to use LLM-as-a-Judge for evals. Holistic judge scores hide both their reasoning and their ceiling effects. BINEVAL decomposes each evaluation criterion into atomic yes-or-no questions, answers each independently per output, then aggregates the verdicts into calibrated multi-dimensional scores. Every question-level verdict is inspectable, so you can diagnose exactly why an output scored low, and the same verdicts feed straight back as targeted prompt-improvement signal. Across SummEval, Topical-Chat, and QAGS, it matches or beats UniEval and G-Eval, training-free, with especially strong results on factual consistency. Paper: https://t.co/oar6BZcasm Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX