Marcus Barnes

@MarcusBarnes

PhD Researcher | LLM4SE & multi-agent AI systems | LLMs for mathematics & autoformalization | Open to research collaborations, industry roles & consulting

Toronto, ON

Joined May 2009

6.1K Following

1.4K Followers

10.7K Posts

Pinned Tweet

Marcus Barnes @MarcusBarnes

2 months ago

I recently presented LogSieve at MSR’26: reducing CI logs for LLM-based analysis. 42% smaller logs, 40% fewer tokens, while preserving root cause analysis. Pre-print: https://t.co/Rh9eIe5Kj1 #LLM #MLOps

MarcusBarnes's tweet photo. I recently presented LogSieve at MSR’26: reducing CI logs for LLM-based analysis.

42% smaller logs, 40% fewer tokens, while preserving root cause analysis.

Pre-print: https://t.co/Rh9eIe5Kj1

#LLM #MLOps https://t.co/RTKjuUI4fj

110

Marcus Barnes @MarcusBarnes

2 days ago

Cordyceps CI/CD Flaws Expose 300+ GitHub Repositories to Supply-Chain Attacks https://t.co/LcHHInaYKS via @TheHackersNews

Marcus Barnes @MarcusBarnes

10 days ago

@nasqret You're on a roll! Amazing!

155

MarcusBarnes retweeted

Bartosz Naskręcki

@nasqret

12 days ago

For the last two weeks I switched completely to the "loop philosophy". Just a few research frontier examples: 1. I have a paper on Groebner bases and heavy computations. Made a loop which analyzes the structure of those bases with Codex. Server-based Magma is in the pipeline. The model revamped the proof structure to completely get rid of all the heavy CPU computations. Effect: the final draft contains only computations which can be human-checked. 2. A hunt for sparse, crystal-driven sphere packings. Made a loop which analyzed the existing work of Fisher, made corrections with Python code to verify all the data from the tables. Found a new example of a packing that was missing in our list. Server-grade computations + heavy web search combined with looped Python verification. 3. Presentation about agentic workflows for high school teachers. Made a loop which parsed all the input data, carved a GitHub project with the whole presentation, and helped me prepare easy-to-understand examples for a first encounter with agents. Local runs with Claude Code, reviewing the slides, composition etc. all with the autonomous loop. Gave the talk, very well received, shared examples and code with people on GitHub. And the list can go on. TL;DR: if you wonder whether agentic workflow is important in 2026, think that it's actually critical for stabilizing the verification, consistency, and structure of the work done with LLMs. Big caveat: you need to carefully plan first, provide high-quality data sources, control the intermediate feedback, and be in control of this madness. There is a fine line between full YOLO and loop management. Don't miss it.

Who to follow

Gerardo Monroy Glez

@gmonroyg

Desarrollador por gusto, esposo y padre por placer. 100% aspiracionista!

3d math artist author former teacher @[email protected]

Marcus Barnes @MarcusBarnes

11 days ago

144 Mastra npm Packages Compromised via Hijacked Contributor Account https://t.co/EYT9eKMydU via @TheHackersNews

MarcusBarnes retweeted

SAIR

@SAIRfoundation

11 days ago

We’re excited to launch SAIR’s 3rd competition: IGP24, the Inverse Galois Problem in degree 24. Co-organized by John Jones, Jen Paulhus, David Roe, Andrew Sutherland, and Terence Tao, in collaboration with LMFDB. Join the Compeitition: https://t.co/9J88HV94AR

407

Marcus Barnes @MarcusBarnes

16 days ago

GitHub to Disable npm Install Scripts by Default to Stop Supply Chain Attacks https://t.co/MN5Br1Jr9b via @TheHackersNews

Marcus Barnes @MarcusBarnes

17 days ago

@Mao_Yuzhen Very interesting! On my reading list!

312

MarcusBarnes retweeted

Yuzhen Mao

@Mao_Yuzhen

17 days ago

Paper: https://t.co/4uRWSPs9dy Website and Code: https://t.co/01lwzWuc6M (8/8)

MarcusBarnes retweeted

Google Research

@GoogleResearch

17 days ago

Presenting Regularized f-Divergence Kernel Tests, a new framework for auditing machine unlearning and differential privacy. This method is more sensitive to localized data shifts and requires fewer samples than traditional tools. Learn more: https://t.co/FdPcfJeEs4

GoogleResearch's tweet photo. Presenting Regularized f-Divergence Kernel Tests, a new framework for auditing machine unlearning and differential privacy. This method is more sensitive to localized data shifts and requires fewer samples than traditional tools. Learn more: https://t.co/FdPcfJeEs4 https://t.co/MHxy5TdAJE

179

12K

MarcusBarnes retweeted

Gabriele Berton

@gabriberton

30 days ago

Great quote that every AI researcher should aspire to: "I do things, which work, rather than being novel, but not working." - @ducha_aiki

163

14K

MarcusBarnes retweeted

Daniel Litt

@littmath

29 days ago

Just added a 15th problem to https://t.co/6oEjUqIxgi! This one is a "non-abelian" analogue of the variational Hodge or Tate conjectures.

littmath's tweet photo. Just added a 15th problem to https://t.co/6oEjUqIxgi! This one is a "non-abelian" analogue of the variational Hodge or Tate conjectures. https://t.co/zEpXsxBuBg

103

MarcusBarnes retweeted

Canadian Mathematical Society @canmathsociety

about 1 month ago

The CMS is pleased to announce Dr. Jamie Mulholland (@SFU) as the recipient of the 2026 Adrien Pouliot Award. La SMC a le plaisir d'annoncer que le Dr Jamie Mulholland (Simon Fraser University) est le lauréat du prix Adrien Pouliot 2026. 🔗https://t.co/JmJDuGOFNX

canmathsociety's tweet photo. The CMS is pleased to announce Dr. Jamie Mulholland (@SFU) as the recipient of the 2026 Adrien Pouliot Award.

La SMC a le plaisir d'annoncer que le Dr Jamie Mulholland (Simon Fraser University) est le lauréat du prix Adrien Pouliot 2026.

🔗https://t.co/JmJDuGOFNX https://t.co/NZdzmyeb6x

188

MarcusBarnes retweeted

Alex Kontorovich

@AlexKontorovich

about 1 month ago

The 2026 Jean-Pierre Demailly Prize for Open Science in Mathematics has been awarded to Mathlib! From the citation: "Mathlib is widely recognized as an exceptional contribution to the mathematical community. It is seen as having an exceptionally broad structural significance for the future of mathematics. It is not only a resource of immediate value, but also an infrastructure with the potential to transform mathematical practice in a lasting way. The project plays a central role in the formalization and verification of mathematics. This aspect is of particular importance in the current scientific context, as formal proof verification, automated reasoning, and AI-assisted mathematical work are likely to play an increasingly significant role in the years ahead. Mathlib is also widely viewed as exemplary in its openness and collaborative organization. It lowers barriers to participation, enables contributions from a broad international community, and provides a framework in which mathematical knowledge can be verified, shared, preserved, and reused at large scale. For all these reasons, Mathlib is not merely a successful project within an existing category, but it is helping redefine the way mathematical knowledge may be produced, checked, and disseminated in the future." Congratulations!! https://t.co/TZ9qnyjoBh

115

11K

MarcusBarnes retweeted

Daniel Litt

@littmath

about 1 month ago

STEM academia serves two closely intertwined purposes: the production of high quality science and the production of human capital. These two purposes feed into each other. The obvious direction is that we develop human capital by paying people to produce science. What is perhaps less obvious is that the very fact that human labor is used to produce science has historically been an important input to its quality. The goal of science is not simply to produce papers, but rather to produce good work--that a person is willing to spend months working on a paper is a (weak) witness to the fact that it has some minimum quality. If someone has a record of producing high quality work, that they wrote a paper is a stronger witness, since it was worth the opportunity cost to write it. If many people engage with it substantially, that is even stronger evidence. This is not to say that there isn't lots of low-quality work--there is, in fact a huge amount--but we have strong sorting mechanisms, admittedly using imperfect proxies (all depending on costly human labor!), to find high-quality stuff. Arguably the paper itself is not the primary product here; in many cases the primary product is actually the expertise developed over the course of producing it, which can then be applied to other questions. If you believe, as I do, that producing high quality science should be one of our fundamental goals, I think you’re obligated to embrace new tools that help one do so. Refusing to is a declaration that these outputs are not important. But I worry that we are not on track to automate the production of good work; rather, we are on track to automate the production of papers. We need new mechanisms to ensure that we are also producing good work, and to ensure that we are developing the human capital to engage with it.

442

113

23K

MarcusBarnes retweeted

Perplexity

@perplexity_ai

about 1 month ago

Today we're open-sourcing Bumblebee, a read-only scanner for macOS and Linux. It checks developer machines for risky packages, extensions, and AI tool configs. Connected to Computer, it can trigger deeper scans whenever a new supply-chain risk emerges. https://t.co/FOaWnF1yQy

perplexity_ai's tweet photo. Today we're open-sourcing Bumblebee, a read-only scanner for macOS and Linux.

It checks developer machines for risky packages, extensions, and AI tool configs.

Connected to Computer, it can trigger deeper scans whenever a new supply-chain risk emerges.

https://t.co/FOaWnF1yQy https://t.co/wXauD4wDOT

182

703

MarcusBarnes retweeted

Paata Ivanisvili

@PI010101

about 1 month ago

Academia is one of the few places where, despite all its imperfections, a person could still rise mostly through talent, curiosity, and hard work (example: Ramanujan) — not only through luck, wealth, family connections etc. But what happens if access to extremely powerful AI assistants becomes the new dividing line? The news about “superhuman AI” is scientifically exciting. But socially, it raises difficult questions — especially for gifted children from poor backgrounds who may not have access to frontier models.

227

34K

MarcusBarnes retweeted

Alvaro Lozano-Robledo

@mathandcobb

about 1 month ago

Maybe no one has asked ChatGPT to disprove other generally believed conjectures... someone should ask for an even integer that is not a sum of two primes, or to find some Siegel zeros for us.

Marcus Barnes @MarcusBarnes

about 1 month ago

@mathandcobb @OpenAI This is a really good summary and helps put the result in the broader context of ongoing discussions. Thank very much!

130

MarcusBarnes retweeted

Alvaro Lozano-Robledo

@mathandcobb

about 1 month ago

As promised, here is a video about the recent disproof of Erdős' unit distance conjecture by @OpenAI https://t.co/YKK3CSUUYu

MarcusBarnes retweeted

Alvaro Lozano-Robledo

@mathandcobb

about 1 month ago

If you are in math and you are depressed, you shouldn't be. The number of projects that I have in mind have tripled or quadrupled now that I can have access to LLM's to assist me on things I had given up before.

363

20K

Marcus Barnes

@MarcusBarnes

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users