Algorithmic Research Group @algoresearch_ - Twitter Profile

Algorithmic Research Group @algoresearch_

4 days ago

We have been building the "virtual lab" mentioned in (https://t.co/n1wGYsiNGL) and letting agents iterate on it. It will be important for measuring RSI progress and for accelerating automated safety research

algoresearch_'s tweet photo. We have been building the "virtual lab" mentioned in (https://t.co/n1wGYsiNGL) and letting agents iterate on it. It will be important for measuring RSI progress and for accelerating automated safety research https://t.co/16n3U0CiRp

1

4

1

1K

algoresearch_ retweeted

Anthropic

@AnthropicAI

6 days ago

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

2K

28K

5K

15K

18M

Algorithmic Research Group @algoresearch_

about 2 months ago

Link: https://t.co/gGgpPZ5c1R

0

62

Algorithmic Research Group @algoresearch_

about 2 months ago

We've set up @AISecurityInst's Inspect platform as a lightweight remote eval service: Postgres-backed job queue, Dockerized API + worker pool, remote submission, and the existing Inspect web UI for viewing runs. It keeps the core Inspect workflow intact, but makes it practical to run many evals on a shared host. It acts like @METR_Evals vivaria, but has Inspects awesome tooling. Link below:

algoresearch_'s tweet photo. We've set up @AISecurityInst's Inspect platform as a lightweight remote eval service: Postgres-backed job queue, Dockerized API + worker pool, remote submission, and the existing Inspect web UI for viewing runs. It keeps the core Inspect workflow intact, but makes it practical to run many evals on a shared
host. It acts like @METR_Evals vivaria, but has Inspects awesome tooling. Link below:

2

1

766

Algorithmic Research Group @algoresearch_

about 2 months ago

🤗 Link: https://t.co/ru2holJ6Jp

0

2

129

Algorithmic Research Group @algoresearch_

about 2 months ago

We're releasing 'ai-sft' on @huggingface: a 34GB supervised fine-tuning dataset for training models on AI research tasks. 2.7M examples spanning research code generation, scientific QA, and technical problem solving, built from our research-focused data collections. Each example includes structured fields for task family, grounded context, source tracing, loss weighting, and quality flags. Train/val splits included. Link below:

algoresearch_'s tweet photo. We're releasing 'ai-sft' on @huggingface: a 34GB supervised fine-tuning dataset for training models on AI research tasks. 2.7M examples spanning research code generation, scientific QA, and technical problem solving, built from our research-focused data collections. Each example includes structured fields for task family, grounded context, source tracing, loss weighting, and quality flags. Train/val splits included. Link below:

1

11

1

23

4K

Algorithmic Research Group @algoresearch_

2 months ago

whoops! push the wrong version - should be looking even better now.

0

55

Algorithmic Research Group @algoresearch_

2 months ago

We're releasing 's2orc-safety' on @huggingface: a AI safety slice of our s2orc-enriched dataset with 16,806 papers across jailbreaks, prompt injection, red teaming, model security, privacy, robustness, alignment, and more. Each paper is enriched with structured fields for reproducibility, safety taxonomy, experimental details, practicality, normalized model/dataset/metric names, code-link metadata, and more. Link below:

algoresearch_'s tweet photo. We're releasing 's2orc-safety' on @huggingface: a AI safety slice of our s2orc-enriched dataset with 16,806 papers across jailbreaks, prompt injection, red teaming, model security, privacy, robustness, alignment, and more.

Each paper is enriched with structured fields for reproducibility, safety taxonomy, experimental details, practicality, normalized model/dataset/metric names,
code-link metadata, and more. Link below:

2

7

2

5

655

Algorithmic Research Group @algoresearch_

2 months ago

https://t.co/bdGY5B06QR

0

65

Algorithmic Research Group @algoresearch_

2 months ago

AI Safety should be open and well funded

Matthew Kenney @baykenney

2 months ago

Here are my thoughts: https://t.co/FSj3lQ0OVs

0

2

0

1

247

0

94

Algorithmic Research Group @algoresearch_

2 months ago

We dont have a moat now. Time to pack it in 😭

Matthew Kenney @baykenney

2 months ago

Holy shit! An intern was using claude code and made a bunch of our repos public! A thread on the fallout 🧵

1

0

1

641

0

109

Algorithmic Research Group @algoresearch_

2 months ago

AlgorithmicResearchGroup's GitHub Sponsors profile is live! You can sponsor us to support AlgorithmicResearchGroup's open source work 💖 https://t.co/SCiLc99Etn

0

1

0

163

Algorithmic Research Group @algoresearch_

2 months ago

We now support both @METR_Evals task standards and @AISafetyInst Inspect standards, split across two repos

0

1

0

43

Algorithmic Research Group @algoresearch_

3 months ago

We've added a UK AI Safety Institute (@AISafetyInst) inspect variant to make it easier to run DMLB. Link below:

Algorithmic Research Group @algoresearch_

3 months ago

These tasks are tough - we think a broad, open-ended set of research tasks mined from real world repos is the best way to measure progress in automated research. Link below:

1

0

708

1

0

1

0

537

Algorithmic Research Group @algoresearch_

3 months ago

https://t.co/dVnZw6S5KW

1

0

77

Algorithmic Research Group @algoresearch_

2 months ago

perfect for @karpathy style auto-research agents or large scale analysis of CS trends over time

0

35

Algorithmic Research Group @algoresearch_

3 months ago

We're releasing S2ORC CS Enriched, a dataset of 1.1 million computer science papers from Semantic Scholar's S2ORC corpus with LLM-generated enrichment fields added to every row. The base dataset has the full paper text, abstracts, authors, references, citation counts, and venue metadata. We added structured enrichment columns on top: paper summaries, classification, methods used, results, models, datasets, metrics, limitations, and GPU compute details where reported. The enrichment makes it possible to do things that are hard with raw paper text alone, like filtering for papers that used a specific method, or finding papers that report GPU hours, or building training data for models that need to understand the structure of research papers rather than just their text. 1,118 parquet files, 44 GB total. Available on @huggingface https://t.co/Mp2s7suxEE

2

1

0

545

Algorithmic Research Group @algoresearch_

2 months ago

Sneak peek at our experimental interface for multiagent work

0

1

0

201

algoresearch_ retweeted

Algorithmic Research Group @algoresearch_

2 months ago

1.5k downloads in a few days. Not bad for a 54.7GB dataset 📈

0

2

0

283

Algorithmic Research Group @algoresearch_

2 months ago

1.5k downloads in a few days. Not bad for a 54.7GB dataset 📈

0

2

0

283

Algorithmic Research Group @algoresearch_

2 months ago

Contributions welcome! https://t.co/hbuWXq2Cmh

0

41

Algorithmic Research Group @algoresearch_

2 months ago

We're excited to release HF agent - a multi-agent @huggingface model-selection and fine-tuning system that investigates current options, searches the web and Hugging Face, and produces Markdown recommendation reports with citations and code snippets. Link below

1

3

0

459

Algorithmic Research Group @algoresearch_

2 months ago

This was a fun weekend little side project to help navigate the open source repos. Uses the web and HF apis to make suggestions, provide code snippets and more. Reports are saved to @huggingface buckets, and it serves up a Spaces static site for your reports

algoresearch_'s tweet photo. This was a fun weekend little side project to help navigate the open source repos. Uses the web and HF apis to make suggestions, provide code snippets and more. Reports are saved to @huggingface buckets, and it serves up a Spaces static site for your reports https://t.co/IEYEwcHAwZ

1

0

48

Algorithmic Research Group

@algoresearch_

Last Seen Users on Sotwe

Trends for you

Most Popular Users