pseudotensor @pseudotensor - Twitter Profile

10 months ago

@an_vo12 Acts like a feature not a bug. The anomalous legs are properly discounted as not real enough for that particular animal. Do it for an unknown animal for which leg counting is not a strongly known prior.

1

0

120

pseudotensor

@pseudotensor

about 1 year ago

For GAIA, the dataset https://t.co/LT2hNgdnvo used is called a "validation" data set that is leaked all over internet. They should submit to the official test set. They also excluded results from Trase and https://t.co/PHQmCxqIFh referring to a 4 month old result as "previous SOTA": https://t.co/zFKx7Lu7kI and https://t.co/ysQbJ7b028

0

288

pseudotensor

@pseudotensor

over 1 year ago

@manusai Need to post to GAIA test set. Your agent may be finding the many validation datasets online that allow the agent to cheat and get high validation score.

0

1

0

418

pseudotensor

@pseudotensor

over 1 year ago

@ylecun 1 year later, LLMs get 65% on test set vs. degree-holding humans at 92%. Not bad year I think. https://t.co/WCmHkkZTfR

0

18

Who to follow

Vishal Goklani

@vgoklani_ai

Twitter Nerd... Interested in Deep Learning (self-supervised learning & LLMs), Astrophysics (exoplanets), and Cosmology (CMB).... I like to build things

Jiayi Wei

@MrVPlusOne

LLMs @ Microsoft Superintelligence. ex research scientist at Augment Code. PhD from UT Austin.

Livox LiDAR

@LivoxTech

LiDAR sensors at an unbeatable combination of resolution, range and price.

pseudotensor

@pseudotensor

over 1 year ago

@MFarajtabar E.g. a paper you didn't cite uses gsm hard and shows major drop in performance: https://t.co/ZaxwtYpjpX

0

1

0

645

pseudotensor

@pseudotensor

over 1 year ago

@MFarajtabar Similar results for gsm8k hard https://t.co/PPhialLB8t, which changes all the numbers and if LLMs were just applying some basic math it shouldn't matter much, but it matters alot.

0

4

0

680

pseudotensor

@pseudotensor

over 1 year ago

@MFarajtabar Adding irrelevant items and seeing performance drop isn't new. I remember AI explained channel talking about this year(s) ago and relevant for his Simple Bench.

0

1

0

491

pseudotensor

@pseudotensor

over 1 year ago

@shishirpatil_ Why is LMSYS in the name and blog? Seems unrelated to them: https://t.co/m05l2AlGdz

0

140

pseudotensor

@pseudotensor

over 1 year ago

🚨 BREAKING: Open-Strawberry aims to recreate OpenAI's o1 as open-source! 🍓 🔓 Democratizing AI 🚀 Accelerating innovation 🌐 Community-driven development Join the revolution: https://t.co/2OcLwKArsi RT to support open AI! 🔄 #OpenSourceAI #AIRevolution

0

122

pseudotensor

@pseudotensor

almost 2 years ago

@MatthewBerman A simple coding agent can do these kinds of things:

0

171

pseudotensor

@pseudotensor

almost 2 years ago

@mattshumer_ @GlaiveAI Very first try failed. It's thinking is poor.

0

31

pseudotensor

@pseudotensor

about 2 years ago

@clefourrier This was done here among other papers: https://t.co/PWtKuVSwCK

0

67

pseudotensor

@pseudotensor

over 2 years ago

Try https://t.co/ypvJCkDacD mixtral 8*7B at https://t.co/kreoWczRQn

0

1

0

151

pseudotensor

@pseudotensor

about 3 years ago

Massive thanks to @ykilcher and Open Assistant team for open sourcing their data. We released fully Apache v2 model and projects, some using their amazing data. This includes fully open 20B models. See: https://t.co/zu8mgfACMc .

0

1

0

170

pseudotensor

@pseudotensor

about 3 years ago

Largest fully apache2 20B parameter LLM with free chatbot: https://t.co/1xpqLjUT6b Checkout the project at: https://t.co/zu8mgfACMc

0

181

pseudotensor

@pseudotensor

about 3 years ago

20B parameter chatbot speaks on the coming of AGI

Arno Candel

@ArnoCandel

about 3 years ago

https://t.co/nuHzOqRHmc #h2oGPT #GPT #OSS #OpenSource ^ 20B parameters Apache 2.0!

1

15

7

3

2K

1

0

86

pseudotensor retweeted

Arno Candel

@ArnoCandel

about 3 years ago

https://t.co/nuHzOqRHmc #h2oGPT #GPT #OSS #OpenSource ^ 20B parameters Apache 2.0!

1

15

7

3

2K

pseudotensor

@pseudotensor

about 4 years ago

@epic4kids @andreakhaid That's not what was asked. They asked hot to prevent child from accessing, like the "learning videos" option but also for "read to me" since it prevents learning to read. I canceled my subscription due to this. On amazon kids, where you can't even turn off videos.

1

0

pseudotensor

@pseudotensor

over 9 years ago

Sleepy on playground

0

1

0

pseudotensor

@pseudotensor

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users