Mark Hahnel

3 months ago

What parts of the academic knowledge creation & dissemination pipeline can we automate with AI? https://t.co/OkltGhB5BQ

1

2

3

427

MarkHahnel retweeted

Ai2 @allen_ai

5 days ago

LLMs are no longer created w/ human data alone. They rely on other models to generate & filter data, evaluate outputs, & guide dev work. So what is a modern LLM built on? Olmo 3 → 89 model + 183 dataset dependencies; Nemotron 3 → 273 + 560 We made ModSleuth to trace this. 🧵

allen_ai's tweet photo. LLMs are no longer created w/ human data alone. They rely on other models to generate & filter data, evaluate outputs, & guide dev work.

So what is a modern LLM built on? Olmo 3 → 89 model + 183 dataset dependencies; Nemotron 3 → 273 + 560

We made ModSleuth to trace this. 🧵 https://t.co/1QvtmlxzYP

5

254

41

136

87K

4 days ago

Ground the model, or it invents the evidence Trusted, curated, subject-specific models are needed for academia https://t.co/xjBVT1YrWt

MarkHahnel's tweet photo. Ground the model, or it invents the evidence

Trusted, curated, subject-specific models are needed for academia

https://t.co/xjBVT1YrWt https://t.co/9ERu63MsEi

0

34

MarkHahnel retweeted

5 days ago

Today! Are you joining the science sleuths? 🔗 Register here: https://t.co/RhiyBJF5ib #ResearchIntegrity

0

2

0

93

Who to follow

We're an AI-focused technology company providing innovative solutions to complex challenges faced by researchers, universities, funders, industry & publishers.

Richard Poynder

@RickyPo

Independent journalist & blogger commenting on university issues (especially academic freedom), and on AI. Posts/Reposts are not endorsements. @rickypo.bsky.soc

Open Science

@openscience

to #openscience… https://t.co/jcV6uP6Uqx

MarkHahnel retweeted

6 days ago

The #arXiv ban on unchecked #AI in preprints is, in @MarkHahnel's words, needed & appropriate. But the underlying problem is bigger than arXiv. General-purpose models are the wrong tool for research. The contamination, he warns, doesn't go away when the AI gets better. 🔗 Read his post: https://t.co/lMFgS3F7Jl

digitalsci's tweet photo. The #arXiv ban on unchecked #AI in preprints is, in @MarkHahnel's words, needed & appropriate. But the underlying problem is bigger than arXiv.

General-purpose models are the wrong tool for research. The contamination, he warns, doesn't go away when the AI gets better.

🔗 Read his post: https://t.co/lMFgS3F7Jl

0

1

165

MarkHahnel retweeted

6 days ago

"The tools have changed beyond recognition; the intent has not changed at all." — Dr Daniel Hook, CEO, Digital Science. In a new blog post on the @Symplectic website, Daniel discusses a 20-year-old problem that #AI is now helping to solve. 🔗 Read his post: https://t.co/YVjHoOU4Gc

digitalsci's tweet photo. "The tools have changed beyond recognition; the intent has not changed at all." — Dr Daniel Hook, CEO, Digital Science.

In a new blog post on the @Symplectic website, Daniel discusses a 20-year-old problem that #AI is now helping to solve.

🔗 Read his post: https://t.co/YVjHoOU4Gc

0

1

3

0

136

MarkHahnel retweeted

Symplectic @Symplectic

about 2 months ago

Still dealing with “alphabet soup” in your research systems? RIMS, CRIS, IR, RDM… it adds up fast. Watch Building a Research Engine webinar on demand to see how you can simplify your ecosystem with Symplectic Elements and Figshare. 👉 https://t.co/XowBEGYN9C

Symplectic's tweet photo. Still dealing with “alphabet soup” in your research systems? RIMS, CRIS, IR, RDM… it adds up fast.

Watch Building a Research Engine webinar on demand to see how you can simplify your ecosystem with Symplectic Elements and Figshare.

👉 https://t.co/XowBEGYN9C https://t.co/jhDnogKFfd

0

1

0

117

MarkHahnel retweeted

Rory Byrne

@ryrobyrne

7 days ago

Operating a bio database should be much easier. To the point where each lab can run one. These should integrate with lab equipment, run automated QC on new deposits, compute a rich surface of queryable metadata, export to ML-friendly formats, and federate via shared ontologies.

1

8

3

2

847

Training Centre in Communication (TCC Africa) @tccafrica

6 days ago

@mgdurrant Any way to get on trusted access program now?

0

62

MarkHahnel retweeted

clem 🤗

@ClementDelangue

6 days ago

Concentration of power, capabilities and economic wealth is the biggest risk in AI. We need open science and open-source more than ever!

111

3K

481

210

162K

MarkHahnel retweeted

6 days ago

Ongoing Figshare & TCC Africa Community Call in #Kenya: Strengthening Research Data Management & Data Repository Adoption in #OpenScience using @figshare . Speaking now is @MarkHahnel CC @digitalsci

tccafrica's tweet photo. Ongoing Figshare & TCC Africa Community Call in #Kenya: Strengthening Research Data Management & Data Repository Adoption in #OpenScience using @figshare . Speaking now is @MarkHahnel
CC @digitalsci https://t.co/k28k1IyFsp

0

4

3

0

214

6 days ago

Love this - great to see lots of green - could really do with more The state of biological AI at a glance https://t.co/bf6XUK2N0v

0

40

MarkHahnel retweeted

AVB

@neural_avb

7 days ago

Been using the Overleaf AI Agent for latex editing the past few days... It's way more useful than I expected! 🙏🏼

2

15

2

6

1K

MarkHahnel retweeted

7 days ago

👉 Before you attend the webinar, download the report: "Forensic Scientometrics (FoSci) Report 2026: Understanding, Detecting, and Documenting Manipulation in the Research Ecosystem." 🔗 https://t.co/alnAJXr3kW #ResearchIntegrity #TrustInScience

0

1

2

1

783

MarkHahnel retweeted

7 days ago

⚠️ This week: Join our panel of science sleuths to hear about uncovering research misconduct & manipulation. #ResearchIntegrity We're decoding the findings of the Forensic Scientometrics (FoSci) Report. 🗓️ Thursday 11 June 🕒 3pm BST 🕙 10am EDT 🔗 Register now: https://t.co/JCWm92Ae3Z

digitalsci's tweet photo. ⚠️ This week: Join our panel of science sleuths to hear about uncovering research misconduct & manipulation. #ResearchIntegrity

We're decoding the findings of the Forensic Scientometrics (FoSci) Report.

🗓️ Thursday 11 June
🕒 3pm BST 🕙 10am EDT

🔗 Register now: https://t.co/JCWm92Ae3Z

1

4

5

0

721

7 days ago

It’s a monumental shift that Tech companies can now start Pharma companies A new era of biological compute, biosingularity and solving disease @maxjaderberg talking about how AI is transforming drug discovery at London Tech Week

MarkHahnel's tweet photo. It’s a monumental shift that Tech companies can now start Pharma companies

A new era of biological compute, biosingularity and solving disease

@maxjaderberg talking about how AI is transforming drug discovery at London Tech Week https://t.co/h97510QZzE

0

90

MarkHahnel retweeted

Anthropic

@AnthropicAI

8 days ago

New Science Blog: Why has AI advanced faster in coding than in biology? To agents, bio databases are like cities built before cars—maddening to drive in because they're designed for different traffic. How do we build infrastructure agents can use? https://t.co/PQaNQ4GRJZ

318

4K

500

2K

721K

MarkHahnel retweeted

Nathan C. Frey

@nc_frey

9 days ago

Opus 4.7 is as good or better than ChemDraw at interpreting NMR spectra on our evals. We're making Claude more helpful for chemists, starting with routine and time-consuming analytical tasks.

4

153

12

59

22K

8 days ago

On Training Data for Bio AI Models As we advance biological foundation models, which lessons from LLM data curation transfer, and which need rethinking? https://t.co/BPDZrJMq5d

0

1

63

11 days ago

NIH Generalist Repository Ecosystem Initiative (GREI) Year 4: Reflections, Progress, and What Comes Next https://t.co/lgqh0VTKVs

0

96