This is a great opportunity to work with world-class faculty. The 16 PhD students who have already joined are excellent and fun. I hope to hear from you! #NLP#nlproc
Final opportunity: Multiple PhD students to start in Fall 2025. Combine neural and symbolic/interpretable models of language, vision, and action, and work with world-class advisors at @Saar_Uni, MPI Informatics, @mpi_sws, @CISPA ,@DFKI. Details: https://t.co/CwukTNFYlS
Now hiring: Multiple PhD students to start in fall 2025, for research on combining neural and symbolic/interpretable models of language, vision, and action. Work with world-class advisors at @Saar_Uni, MPI Informatics, @mpi_sws, @CISPA ,@DFKI. Details: https://t.co/FWLVjCowHc
I am following my university in leaving Twitter. I would be very pleased if you chose to reconnect with me at https://t.co/A6x29Ulsxs
See you there!
https://t.co/fojL1RL8uT
AutoPlanBench 2.0 now evaluates LLMs as planners on more than 50 domains. ReAct (with GPT-4o) is often worse, but sometimes better than symbolic planners. https://t.co/gvOSkG5yIX #nlproc
Now hiring: Twelve (!) PhD students to start in fall 2025, for research on combining neural and symbolic/interpretable models of language, vision, and action. Work with world-class advisors at @Saar_Uni, MPI Informatics, @mpi_sws, @CISPA, @DFKI. Details: https://t.co/ZjzNZN7DKu
🏆 ACL Best Resource Paper Award:
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents by Trivedi et al.
#NLProc#ACL2024NLP
@tobycmurray Transformers run in quadratic time, so of course this model can’t solve problems of high computational complexity with perfect accuracy. We discuss this in the paper.
It was fun to apply #NLProc methods to software engineering with my brilliant colleague @AndreasZeller and his student @TurikMammadov. The coolest part, to me, is that you can backtranslate program outputs into program inputs. Let's see where this will go!
Learning models from programs! Given a program P, our MODELIZER learns a model M that mocks P's behavior, producing P's output for a given input. But M is also reversible, predicting inputs for which P produces a given output, with up to 95.4% accuracy: https://t.co/5YKbUkccpD 🧵
Can you use LLMs to replace crowdworkers in NLP evaluations? My amazing collaborators and I analyzed this broadly. Answer: Sometimes LLMs correlate very well with human judgments, but you can't rely on it.
1/5 📣 Excited to share “LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks”! https://t.co/BIqZCmToz1 🚀 We introduce JUDGE-BENCH, a benchmark to investigate to what extent LLM-generated judgements align with human evaluations. #NLProc
Ellie Pavlick @Brown_NLP is visiting us for three months as a @dfg_public Mercator Fellow. We are thrilled to have her with us and look forward to many fruitful research collaborations.
This is a decent summary of the octopus thought experiment from Bender & @alkoller 2020, with two glaring exceptions, right at the start:
https://t.co/zvj3w26jVr
>>
@emilymbender I was inclined to overlook the AI/LLM distinction; they do talk about it a paragraph later.
Of course we didn't argue that the octopus can't "know" anything - but perhaps the distinction between meaning and knowledge is a bit subtle. Nice to see the paper still being cited!
Come work with my amazing colleagues and me on neurosymbolic models for #NLProc and related fields! You'll join our really excellent first nine PhD students and one of the largest research centers for neurosymbolic models in the world. @LstSaar@SIC_Saar
Now hiring: Three PhD students and one postdoc, as part of a research team of ~30, combining neural and symbolic/interpretable models of language, vision, action, and ML. Work with advisors at @Saar_Uni, MPI Informatics, @mpi_sws_, @CISPA, @DFKI. Details: https://t.co/FWLVjCowHc
We have formed a new global partnership with @AxelSpringer and its news products.
Real-time information from @politico, @BusinessInsider, European properties @BILD and @welt, and other publications will soon be available to ChatGPT users.
ChatGPT’s answers to user queries will include attribution and links to full articles for transparency and further information. https://t.co/MVxjDrY73h
Come work with my amazing colleagues and me on neurosymbolic models! You'll join our really excellent first six PhD students and one of the largest research centers for neurosymbolic models in the world. #NLProc@LstSaar@SIC_Saar
Now hiring: Six PhD students and one postdoc, for research on combining neural and symbolic/interpretable models of language, vision, and action and ML. Work with world-class advisors at @Saar_Uni, MPI Informatics, @mpi_sws_, @CISPA, @DFKI. Details: https://t.co/FWLVjCowHc