Markus Hofmarcher

Sepp Hochreiter @HochreiterSepp

about 2 years ago

Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]

DinuMariusC's tweet photo. Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n] https://t.co/qBIV6lfw6h

5

69

19

39

14K

mrkhof retweeted

about 2 years ago

I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. https://t.co/IH7giCe3gd

46

2K

359

739

277K

mrkhof retweeted

Postdoc at the IML-JKU Linz. Prev. Intern at MSR Cambridge. Passionate about ML for DD, LLMs, and Zero-shot learning. Opinions are my own and evolving ;)

over 2 years ago

🚀 SymbolicAI – a framework for logic-based approaches combining generative models and solvers. Alongside, we introduce a benchmark and empirical measure to evaluate SOTA LLMs in AI-centric workflows. Read more in our paper https://t.co/H49rfzf8tv #MachineLearning 🧠💡[1/n]

2

236

62

179

56K

Who to follow

Philipp Seidl

@phseidl

Andreas Mayr

@AndreasMayr11

Postdoc Scientist in Machine Learning @ Johannes Kepler University Linz

ELLIS Unit Linz & LIT AI Lab

@LITAILab

The LIT Lab is committed to scientific excellence. Our focus is on theoretical and experimental research in machine learning and artificial intelligence.

mrkhof retweeted

Elisabeth Rumetshofer @LizRumetshofer

over 2 years ago

Interested in a semantic memory for reinforcement learning? I was recently invited to a podcast talking about our #NeurIPS2023 paper: Semantic HELM (https://t.co/eRSbVWkFaz). In case you are interested, you can stream the episode here: https://t.co/NxCQCx6mIS

1

23

15

1

2K

mrkhof retweeted

over 2 years ago

🎉 Exciting news! Our latest work has been published in Nature Communications. 🎉 CLOOME utilizes contrastive learning to connect microscopy images and chemical structures, paving the way for major advancements in drug discovery and beyond.🌟🔬💊 📜https://t.co/fH0wunVFLH

0

26

8

2

3K

mrkhof retweeted

over 2 years ago

Personal update: last month, I re-joined the group of my mentor @HochreiterSepp and my amazing colleague @gklambauer in Linz, opening my own group "AI for data-driven simulations". We all share the vision to create a large-scale AI ecosystem in Linz. Big news to come soon 🚀

jo_brandstetter's tweet photo. Personal update: last month, I re-joined the group of my mentor @HochreiterSepp and my amazing colleague @gklambauer in Linz, opening my own group "AI for data-driven simulations". We all share the vision to create a large-scale AI ecosystem in Linz. Big news to come soon 🚀 https://t.co/sK4mcoGMp9

18

244

24

10

41K

mrkhof retweeted

Kajetan Schweighofer @kschweig_

almost 3 years ago

Thanks @_akhaliq for sharing! SITTA unlocks zero-shot image captioning via a generative language model by aligning its embedding space with that of a pretrained vision encoder without any access to gradient information. 1/6

PaischerFabian's tweet photo. Thanks @_akhaliq for sharing!

SITTA unlocks zero-shot image captioning via a generative language model by aligning its embedding space with that of a pretrained vision encoder without any access to gradient information.

1/6 https://t.co/LD6Jo3dZLu

1

74

37

33

50K

mrkhof retweeted

AK

@_akhaliq

almost 3 years ago

SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: https://t.co/cUYfM0UrJK Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

_akhaliq's tweet photo. SITTA: A Semantic Image-Text Alignment for Image Captioning

paper page: https://t.co/cUYfM0UrJK

Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

0

66

15

24

54K

mrkhof retweeted

almost 3 years ago

🚀 Excited to share our latest research on quantifying the predictive uncertainty of machine learning models. QUAM searches for adversarial models (not adversarial examples!) to better estimate the epistemic uncertainty, the uncertainty about chosen model parameters. 1/5

kschweig_'s tweet photo. 🚀 Excited to share our latest research on quantifying the predictive uncertainty of machine learning models. QUAM searches for adversarial models (not adversarial examples!) to better estimate the epistemic uncertainty, the uncertainty about chosen model parameters.
1/5 https://t.co/bOif6d1f1F

4

247

65

143

58K

mrkhof retweeted

Thomas Schmied @thsschmied

almost 3 years ago

Excited to share our recent work on parameter-efficient fine-tuning in RL. We pre-train a Decision Transformer (DT) on 50 tasks from two domains, and subsequently fine-tune on various down-stream tasks. Joint work with @mrkhof, @PaischerFabian, Razvan, and @HochreiterSepp. 1/n

thsschmied's tweet photo. Excited to share our recent work on parameter-efficient fine-tuning in RL. We pre-train a Decision Transformer (DT) on 50 tasks from two domains, and subsequently fine-tune on various down-stream tasks. Joint work with @mrkhof, @PaischerFabian, Razvan, and @HochreiterSepp.
1/n https://t.co/mmsofob7kh

1

43

19

9

7K

mrkhof retweeted

Johannes Schimunek @JSchimunek

about 3 years ago

Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n

PaischerFabian's tweet photo. Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent.
1/n https://t.co/7DJEOZdDev

1

101

44

25

33K

mrkhof retweeted

about 3 years ago

🚀 Excited to share our #ICLR2023 work on 🚨 context-enriched molecule representations🚦 improve few-shot drug discovery 💊 🚨 Paper: https://t.co/uW6Ft9zZj2 App: HuggingFace 🤗 under prep! #ICLR2023 🧑‍💼 poster 🗨: https://t.co/d89129uwdv ⏰ Wed 3 May 4:30 pm - 6:30 pm CAT

JSchimunek's tweet photo. 🚀 Excited to share our #ICLR2023 work on
🚨 context-enriched molecule representations🚦 improve few-shot drug discovery 💊 🚨

Paper: https://t.co/uW6Ft9zZj2
App: HuggingFace 🤗 under prep!

#ICLR2023 🧑‍💼 poster 🗨:
https://t.co/d89129uwdv
⏰ Wed 3 May 4:30 pm - 6:30 pm CAT https://t.co/0rWDYv51id

2

41

21

5

6K

mrkhof retweeted

over 3 years ago

We are excited to present our work, combining the power of a symbolic approach and Large Language Models (LLMs). Our Symbolic API bridges the gap between classical programming (Software 1.0) and differentiable programming (Software 2.0). GitHub: https://t.co/eYmfKFOWBz [1/n]

DinuMariusC's tweet photo. We are excited to present our work, combining the power of a symbolic approach and Large Language Models (LLMs). Our Symbolic API bridges the gap between classical programming (Software 1.0) and differentiable programming (Software 2.0). GitHub: https://t.co/eYmfKFOWBz [1/n] https://t.co/PHoRaey253

22

583

123

296

220K

mrkhof retweeted

over 3 years ago

This includes fact-based generation of text, flow control of a generative process towards a desired outcome, and interpretability within generative processes. GitHub: https://t.co/eYmfKFOWBz [5/n]

4

55

12

21

5K

mrkhof retweeted

about 4 years ago

Excited to share our work on history compression via language models in RL, presented at #ICML2022🤩🤩. Our novel framework HELM⎈ augments an agent with a history compression module which leverages a pretrained language Transformer without any training or finetuning 🤯🤯 1/5

3

74

22

23

0

mrkhof retweeted

over 4 years ago

Wow, wanna see how to beat CLIP with the new CLOOB? Fantastic work lead by my colleagues @fuerst_andreas and @LizRumetshofer (Sepp Hochreiter's group) applying modern Hopfield networks to image-text data. Paper: https://t.co/uATlB6nr9D Blogpost: https://t.co/RLKgNYtL8M

jo_brandstetter's tweet photo. Wow, wanna see how to beat CLIP with the new CLOOB? Fantastic work lead by my colleagues @fuerst_andreas and @LizRumetshofer (Sepp Hochreiter's group) applying modern Hopfield networks to image-text data.

Paper: https://t.co/uATlB6nr9D
Blogpost: https://t.co/RLKgNYtL8M https://t.co/XOUTOOvPcq

2

87

39

19

0

mrkhof retweeted

over 5 years ago

Our paper "Hopfield Networks is All You Need" is accepted at #ICLR2021. Time to give some talks :) I am very honored to present our research today at the great platform of @ml_collective @savvyRL (https://t.co/vb2n5cMjcL).

jo_brandstetter's tweet photo. Our paper "Hopfield Networks is All You Need" is accepted at #ICLR2021. Time to give some talks :) I am very honored to present our research today at the great platform of @ml_collective @savvyRL (https://t.co/vb2n5cMjcL). https://t.co/mmpCKom7x4

6

177

42

30

0

mrkhof retweeted

Forest @forestapp_cc

over 5 years ago

【Final Sprint: #1MTreeChallenge】 Forest has in total planted 980 thousand trees and is about to hit 1 million now! Let’s cross this milestone together: we will donate 1 tree for every 10 Likes or 1 Retweet of this tweet.🌲 Save the Earth at your fingertips🌏!

forestapp_cc's tweet photo. 【Final Sprint: #1MTreeChallenge】
Forest has in total planted 980 thousand trees and is about to hit 1 million now!
Let’s cross this milestone together: we will donate 1 tree for every 10 Likes or 1 Retweet of this tweet.🌲

Save the Earth at your fingertips🌏! https://t.co/DWH0GtvwM7

62

8K

6K

19

0

mrkhof retweeted