#LLMs can chat & spin yarns. But can they plan to realize their yarns?
The answer: they suck even at stacking blocks!
But how can this be? Come & check out our work at #NeurIPS2022 FMDM on benchmarks for LLMs on planning. Joint work with @_aolmo_, @sarath_ssreedh and @rao2z.
🥳🎉Dr!! 🎓 Alberto Olmo Hernandez
@_aolmo_ 🎉🥳
Analyzing Failure Modes of Inscrutable Machine Learning Models (Defense Video 👉https://t.co/yrUrvUymYz )
🙏 to the Committee: @liuhuan, Baoxin Li & @sailiks
Intrigued by the profusion of 'em "#LLM's are Zero-shot <XXX>'s" papers, we set out to see how good LLMs are at planning and reasoning about change.
tldr; off-the-shelf #GPT3 is pretty bad at these..
👉https://t.co/JuSjU9xSRY
(w/ @karthikv792@sarath_ssreedh & @_aolmo_) 1/
#Dallemini's response to the prompt
"Engineering Professor"
*Finally* #AI#Bias is working in my favor y'all! 😂
(No, didn't do anything nefarious; @_aolmo_ ran the prompt; do see his and @niharikajain_az, @maidylm & @sailiks work that presaged this 👉https://t.co/fdjXhsdGCv)
Alberto @_aolmo_ is presenting his PhD proposal; the first post-pandemic in-person (hybrid) Yochan proposal! (The audience are not social distancing from the speaker as much as congregating nearer to the fresh #Samosas--that very Barcelonan delicacy that Alberto got..😋)
📢 Imperfect ImaGANation: Our comprehensive study of how #GAN-based data augmentation techniques can exacerbate biases--is now online at Artificial Intelligence Journal🍾 (w/ @niharikajain_az, @_aolmo_, @sailiks & @maidylm) #AI#AIEthics#AIbias#AIJ
https://t.co/mUSEfYy8JU