Ajay Patel @ajayp95 - Twitter Profile

ajayp95 retweeted

over 1 year ago

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: https://t.co/U2y96rxMzS Dataset: https://t.co/AT4QmiYwdp Paper: https://t.co/mZFpN7kYoP Code: https://t.co/HyDdcuwjsn

YueYangAI's tweet photo. We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models.

Website: https://t.co/U2y96rxMzS
Dataset: https://t.co/AT4QmiYwdp
Paper: https://t.co/mZFpN7kYoP
Code: https://t.co/HyDdcuwjsn

6

194

46

129

23K

ajayp95 retweeted

AK

@_akhaliq

over 1 year ago

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

3

97

14

45

15K

ajayp95 retweeted

Andrea Soria Jimenez @andrejanysa

over 1 year ago

🚀 Synthetic data is revolutionizing AI & ML! DataDreamer, an open-source Python library, makes generating synthetic data seamless & integrates effortlessly with @huggingface . Easily push datasets to the Hub and share them with the community 🔍 Learn how: https://t.co/oyZ6bpqXU1

andrejanysa's tweet photo. 🚀 Synthetic data is revolutionizing AI & ML!
DataDreamer, an open-source Python library, makes generating synthetic data seamless & integrates effortlessly with @huggingface . Easily push datasets to the Hub and share them with the community
🔍 Learn how: https://t.co/oyZ6bpqXU1 https://t.co/GTslG0M4EP

1

28

9

24

2K

ajayp95 retweeted

Zachary Horvitz

@zachary_horvitz

over 1 year ago

I'm at #EMNLP2024 presenting ✨TinyStyler✨, an efficient, effective, and fast method for few-shot text style transfer! Paper: https://t.co/xQmMHvy8tk Demo: https://t.co/DQ3OEZrC67 Code: https://t.co/aYS5lqwqAj

zachary_horvitz's tweet photo. I'm at #EMNLP2024 presenting ✨TinyStyler✨, an efficient, effective, and fast method for few-shot text style transfer!

Paper: https://t.co/xQmMHvy8tk
Demo: https://t.co/DQ3OEZrC67
Code: https://t.co/aYS5lqwqAj https://t.co/IYNdVz1NYq

1

17

5

3

2K

Who to follow

Anka Reuel | @ankareuel.bsky.social

@AnkaReuel

Computer Science PhD Student @ Stanford | Former Fellow @ Harvard Kennedy School | Former Vice Chair EU AI Code of Practice | Views are my own

Yi R. (May) Fung

@May_F1_

Assistant Professor, Hong Kong University of Science and Technology CSE 💻 Multimodal LLM Reasoning and Agents 🚀

Adam Stein

@adamlsteinl

PhD student @ UPenn. Working on reliability and safety of AI.

ajayp95 retweeted

Luca Soldaini 🎀

@soldni

over 1 year ago

Olmo goes multimodal! We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯 We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.

soldni's tweet photo. Olmo goes multimodal!

We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯

We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.

21

988

165

463

90K

ajayp95 retweeted

Duncan Watts @duncanjwatts

almost 2 years ago

Very nice coverage of @csspenn's just-launched Media Bias Detector. I'm very excited about this project, which has been a Herculean team effort! https://t.co/FlCSIfBp5L https://t.co/j5L4aRBzDN

0

49

14

7

6K

ajayp95 retweeted

Sepp Hochreiter @HochreiterSepp

about 2 years ago

New exciting research by @DinuMariusC with @ajayp95 (U of Pennsylvania) and @ExtensityAI. We show LLM self-improvement with synthetic data for web agent tasks on WebArena, and introduce an extended VERTEX score for measuring the trajectory quality of agent workflows.

1

47

11

18

7K

ajayp95 retweeted

Marius-Constantin Dinu @DinuMariusC

about 2 years ago

Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]

DinuMariusC's tweet photo. Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n] https://t.co/qBIV6lfw6h

5

69

19

39

14K

ajayp95 retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

about 2 years ago

Large Language Models Can Self-Improve At Web Agent Tasks abs: https://t.co/G84JKYmK49 "We explore fine-tuning on three distinct synthetic training data mixtures and achieve a 31% improvement in task completion rate over the base model on the WebArena benchmark through a self-improvement procedure."

iScienceLuvr's tweet photo. Large Language Models Can Self-Improve At Web Agent Tasks

abs: https://t.co/G84JKYmK49

"We explore fine-tuning on three distinct synthetic training data mixtures and achieve a 31% improvement in task completion rate over the base model on the WebArena benchmark through a self-improvement procedure."

6

236

54

162

21K

ajayp95 retweeted

AK

@_akhaliq

over 2 years ago

paper page: https://t.co/D7TKlmtwQ6

1

10

4

7

8K

ajayp95 retweeted

AK

@_akhaliq

over 2 years ago

DataDreamer A Tool for Synthetic Data Generation and Reproducible LLM Workflows Large language models (LLMs) have become a dominant and important tool for NLP researchers in a wide range of tasks. Today, many researchers use LLMs in synthetic data generation, task evaluation, fine-tuning, distillation, and other model-in-the-loop research workflows. However, challenges arise when using these models that stem from their scale, their closed source nature, and the lack of standardized tooling for these new and emerging workflows. The rapid rise to prominence of these models and these unique challenges has had immediate adverse impacts on open science and on the reproducibility of work that uses them. In this paper, we introduce DataDreamer, an open source Python library that allows researchers to write simple code to implement powerful LLM workflows. DataDreamer also helps researchers adhere to best practices that we propose to encourage open science and reproducibility.

_akhaliq's tweet photo. DataDreamer

A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Large language models (LLMs) have become a dominant and important tool for NLP researchers in a wide range of tasks. Today, many researchers use LLMs in synthetic data generation, task evaluation, fine-tuning, distillation, and other model-in-the-loop research workflows. However, challenges arise when using these models that stem from their scale, their closed source nature, and the lack of standardized tooling for these new and emerging workflows. The rapid rise to prominence of these models and these unique challenges has had immediate adverse impacts on open science and on the reproducibility of work that uses them. In this paper, we introduce DataDreamer, an open source Python library that allows researchers to write simple code to implement powerful LLM workflows. DataDreamer also helps researchers adhere to best practices that we propose to encourage open science and reproducibility.

4

150

33

106

23K

ajayp95 retweeted

Aran Komatsuzaki

@arankomatsuzaki

over 2 years ago

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows repo: https://t.co/TPartpdNml abs: https://t.co/RBwQwf2DGQ

arankomatsuzaki's tweet photo. DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

repo: https://t.co/TPartpdNml
abs: https://t.co/RBwQwf2DGQ https://t.co/3wOJiEkCJ5

1

254

43

165

20K

ajayp95 retweeted

Bryan Li @bryanlics

about 3 years ago

Are GPT-style LMs best for prompting🤔? Our work shows maybe not! Catch us at the poster for "Bidirectional Language Models are Also Few-Shot Learners" (joint w/ @ajayp95, @colinraffel ) in person at #ICLR2023 in Kigali May 3, 11:30-1:30 PM #162 or https://t.co/p8TZHJBstE

0

13

3

1

649

ajayp95 retweeted

UPenn NLP @upennnlp

over 3 years ago

Work done by Ajay Patel, @bryanlics, and @ccb from @upennnlp with collaborators @colinraffel @noahconst and @rasoolims

0

2

1

0

1K

ajayp95 retweeted

UPenn NLP @upennnlp

over 3 years ago

Bidirectional LMs like T5 learn superior representations, but the field mostly trains unidirectional LMs like GPT-3 since the "emergent" property of prompting was never seen in T5. We show that T5 can be prompted, outperforming GPT-3 with 50% fewer params. https://t.co/WNUa3HvnVk

2

278

36

89

36K

Ajay Patel

@ajayp95

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users