Papers with Datasets

@paperswithdata

Keep up with the latest machine learning datasets from @paperswithcode. Follow for daily updates.

Joined April 2021

1 Following

16.5K Followers

284 Posts

paperswithdata retweeted

Papers with Code

@paperswithcode

over 3 years ago

🪐 Introducing Galactica. A large language model for science. Can summarize academic literature, solve math problems, generate Wiki articles, write scientific code, annotate molecules and proteins, and more. Explore and get weights: https://t.co/jKEP8S7Yfl

209

Papers with Datasets @paperswithdata

over 3 years ago

🦿QDax: a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning domain for robot control. It specifies different benchmarks based on the complexity of both the task and the agent controlled by a deep neural network. https://t.co/6hkXkeLxRK

paperswithdata's tweet photo. 🦿QDax: a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning domain for robot control.

It specifies different benchmarks based on the complexity of both the task and the agent controlled by a deep neural network.

https://t.co/6hkXkeLxRK https://t.co/m8lw2brQOa

Papers with Datasets @paperswithdata

over 3 years ago

🏞ImageNet-X: a set of annotations of factors like pose, background, or lighting of the ImageNet validation set and 12K training images for studying the types of mistakes as a function of a model's architecture, learning paradigm, and training procedures. https://t.co/hPvEG5DRUV

paperswithdata's tweet photo. 🏞ImageNet-X: a set of annotations of factors like pose, background, or lighting of the ImageNet validation set and 12K training images for studying the types of mistakes as a function of a model's architecture, learning paradigm, and training procedures.

https://t.co/hPvEG5DRUV https://t.co/mEUCOvNA1t

Papers with Datasets @paperswithdata

over 3 years ago

For more details about the dataset and full evaluation of current MT metrics. Do check out the link below! Paper, Code & Results: https://t.co/12sQvmNzNQ 4/4

Who to follow

Hugging Face

@huggingface

The AI community building the future. https://t.co/TpiXQMQ9rZ

Sydney Dec 6-12, 26, Paris and Atlanta. Tweets to this account are not monitored. Please send feedback to [email protected].

Papers with Datasets @paperswithdata

over 3 years ago

🎯ACES: a Translation Accuracy Challenge Set, consisting of 36,476 examples covering 146 language pairs and representing challenges from 68 phenomena, designed to evaluate a wide range of machine translation metrics. More details below… Dataset: https://t.co/JmMjhNUYJl 1/4

paperswithdata's tweet photo. 🎯ACES: a Translation Accuracy Challenge Set, consisting of 36,476 examples covering 146 language pairs and representing challenges from 68 phenomena, designed to evaluate a wide range of machine translation metrics. More details below…

Dataset: https://t.co/JmMjhNUYJl

1/4 https://t.co/j183vUMQPT

Papers with Datasets @paperswithdata

over 3 years ago

Results analysis suggests three potential improvements: 1) Combine metrics of various strengths 2) Develop metrics that put more weight to the source instead of surface-level overlap with the reference 3) Include strategies to explicitly model more language-specific info 3/4

Papers with Datasets @paperswithdata

over 3 years ago

We've explored the latest progress and ideas in the world of math world problem solving. We’ll continue to keep track of new and interesting developments in ML for math and the MATH benchmark. For more threads like this, be sure to follow @paperswithcode and @paperswithdata! 7/7

Papers with Datasets @paperswithdata

over 3 years ago

🤔How good are language models at solving math problems? Let’s review the progress of LMs on the MATH benchmark in this thread. 1/7

paperswithdata's tweet photo. 🤔How good are language models at solving math problems?

Let’s review the progress of LMs on the MATH benchmark in this thread.

1/7 https://t.co/hXtpjTJnFR

238

Papers with Datasets @paperswithdata

over 3 years ago

Minerva’s results have shown that the key in improving ML models’ problem solving ability does not solely depend on scaling the model but also novel ideas like majority voting. This encourages new algorithm alteration ideas to improve performance. 6/7

Papers with Datasets @paperswithdata

over 3 years ago

➗Lila: a mathematical reasoning benchmark consisting 23 tasks covering mathematical abilities, language format, language diversity and external knowledge. It's an extension of 20 datasets by collecting Python programs’ task instructions and solutions. https://t.co/DbaL9KSsNB

paperswithdata's tweet photo. ➗Lila: a mathematical reasoning benchmark consisting 23 tasks covering mathematical abilities, language format, language diversity and external knowledge.

It's an extension of 20 datasets by collecting Python programs’ task instructions and solutions.

https://t.co/DbaL9KSsNB https://t.co/A46re8Bm9g

Papers with Datasets @paperswithdata

over 3 years ago

{}Code Syntax: a dataset of programs annotated with the syntactic relationships in their syntax trees designed for code syntax understanding tasks. It contains 30K+ code samples annotated with 2M+ relation edges in 43 relation types for Python and Java. https://t.co/gTLMuKh99Y

paperswithdata's tweet photo. {}Code Syntax: a dataset of programs annotated with the syntactic relationships in their syntax trees designed for code syntax understanding tasks.

It contains 30K+ code samples annotated with 2M+ relation edges in 43 relation types for Python and Java.

https://t.co/gTLMuKh99Y https://t.co/gr3vJmFmA6

Papers with Datasets @paperswithdata

over 3 years ago

💻The Stack: a dataset for pre-training Code LLMs. It contains 3TB of permissively-licensed code in 30 programming languages. It’s created as part of the BigCode Project, an open scientific collaboration working on responsible development of Code LLMs. https://t.co/2zHTyCztBZ

paperswithdata's tweet photo. 💻The Stack: a dataset for pre-training Code LLMs. It contains 3TB of permissively-licensed code in 30 programming languages.

It’s created as part of the BigCode Project, an open scientific collaboration working on responsible development of Code LLMs.

https://t.co/2zHTyCztBZ https://t.co/j9Z48PLFTM

Papers with Datasets @paperswithdata

over 3 years ago

🏺Breaking Bad: a large-scale dataset of fractured objects. It has 10k meshes, each with 100 fractures mode, totalling 1M breakdown patterns. It serves as a benchmark for fractured object reassembly and new challenges for geometric shape understanding. https://t.co/OVPesA1R3r

$paperswithdata's tweet photo. 🏺Breaking Bad: a large-scale dataset of fractured objects. It has 10k meshes, each with 100 fractures mode, totalling 1M breakdown patterns. It serves as a benchmark for fractured object reassembly and new challenges for geometric shape understanding. https://t.co/OVPesA1R3r https://t.co/XaqVOqwGvG$

Papers with Datasets @paperswithdata

over 3 years ago

🌄Diffusion DB: a large text-to-image prompt dataset containing 2M images generated by Stable Diffusion using user’s prompts and hyperparameters. It opens up research in prompt engineering, deepfakes detection, and designing public AI tools. https://t.co/h1121ZArEs

paperswithdata's tweet photo. 🌄Diffusion DB: a large text-to-image prompt dataset containing 2M images generated by Stable Diffusion using user’s prompts and hyperparameters.

It opens up research in prompt engineering, deepfakes detection, and designing public AI tools.

https://t.co/h1121ZArEs https://t.co/NWLQwi4som

Papers with Datasets @paperswithdata

over 3 years ago

👀Perception Test: a benchmark for evaluating the perception and reasoning skills of multimodal models. It uses real-world videos to define tasks that require understanding of memory, patterns, physics, and semantics across visual, audio, and text. https://t.co/rZ9VFkjquM

Papers with Datasets

@paperswithdata

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users