Francesco Costa

@atsoCF

PhD student at Alex Bateman's group, EBI

Joined October 2021

115 Following

28 Followers

23 Posts

atsoCF retweeted

Gina El Nesr

@ginaelnesr

about 1 month ago

Excited to report the first de novo enzyme catalyzing two of the most energetically demanding reactions in biology—phosphomonoester and phosphodiester hydrolysis—with catalytic efficiencies comparable to natural enzymes! 🚀 desB was designed zero-shot with dEVA. No structure prediction, no pre-defined motif, no reaction-intermediates. 🧵 @StanfordBiosci @bioe_stanford @SLAClab @EPFL @hes_so @simonduerr

ginaelnesr's tweet photo. Excited to report the first de novo enzyme catalyzing two of the most energetically demanding reactions in biology—phosphomonoester and phosphodiester hydrolysis—with catalytic efficiencies comparable to natural enzymes! 🚀

desB was designed zero-shot with dEVA. No structure prediction, no pre-defined motif, no reaction-intermediates. 🧵

@StanfordBiosci @bioe_stanford @SLAClab
@EPFL @hes_so @simonduerr

263

115

21K

atsoCF retweeted

Nada Maucourant Atallah

@MaucourantNada

about 1 month ago

From her hospital bed, Zainab Faraj - who survived a triple-tap Israeli strike -told us she and her colleague Amal Khalil were deliberately targeted for reporting on the ground. They paid the ultimate price for documenting Israel’s actions inside the so-called “yellow zone,” now off-limits to the Lebanese state and its citizens. Concretely, this means that with journalists and residents driven out, Israel is carrying out the systematic destruction of Lebanon’s border villages behind closed doors.

MaucourantNada's tweet photo. From her hospital bed, Zainab Faraj - who survived a triple-tap Israeli strike -told us she and her colleague Amal Khalil were deliberately targeted for reporting on the ground.

They paid the ultimate price for documenting Israel’s actions inside the so-called “yellow zone,” now off-limits to the Lebanese state and its citizens.

Concretely, this means that with journalists and residents driven out, Israel is carrying out the systematic destruction of Lebanon’s border villages behind closed doors.

763

438

13K

atsoCF retweeted

Jorge Bravo Abad

@bravo_abad

about 2 months ago

When a protein embedding is indistinguishable from noise Protein language models have become the backbone of computational biology. Feed them an amino acid sequence, and they return a dense vector—a compact numerical fingerprint that downstream models use to predict function, structure, localization, or the effect of a mutation. The assumption, largely unquestioned, is that this fingerprint actually encodes meaningful biology. Prabakaran and Bromberg challenge that assumption directly. They ask a deceptively simple question: how do you know whether a given embedding actually represents a protein—or whether it's just noise dressed up as a vector? Their answer is the Random Neighbor Score (RNS). The idea is elegant: generate biologically meaningless sequences by randomly shuffling the residues of real proteins—preserving amino acid composition but destroying all evolutionarily meaningful interactions. Then, for each real protein, measure how many of its nearest neighbors in latent space are these random imposters. A high RNS means the model never learned to place that protein somewhere biologically meaningful. Applied to ESM-2 and ProtT5 across thousands of proteins, RNS correlates strongly with structural prediction quality: proteins with poorly predicted structures have embeddings nearly indistinguishable from random sequences. Downstream tasks follow the same pattern—contact prediction precision drops roughly 40% for high-RNS proteins, and variant effect prediction falls to near chance. Most sobering: between 19% and 46% of the human proteome is underlearned by current models, depending on architecture. Intrinsically disordered regions fare especially poorly across all architectures tested. RNS is model-agnostic and computationally cheap—around two minutes on GPU for 10,000 proteins—making it a practical prescreening step before any embedding-based inference. For R&D teams that routinely use protein embeddings to prioritize variants, annotate novel sequences, or screen large libraries, this has immediate consequences. Running RNS before downstream inference flags proteins where predictions are unreliable, reducing the risk of propagating errors into expensive wet-lab campaigns. It also offers a principled way to identify gaps in model coverage—directly actionable for teams building or fine-tuning their own foundation models. Paper: R. Prabakaran & Yana Bromberg, Nature Methods (2026) — CC BY-NC-ND 4.0 | https://t.co/JOblk7kT6R

bravo_abad's tweet photo. When a protein embedding is indistinguishable from noise

Protein language models have become the backbone of computational biology. Feed them an amino acid sequence, and they return a dense vector—a compact numerical fingerprint that downstream models use to predict function, structure, localization, or the effect of a mutation. The assumption, largely unquestioned, is that this fingerprint actually encodes meaningful biology.

Prabakaran and Bromberg challenge that assumption directly. They ask a deceptively simple question: how do you know whether a given embedding actually represents a protein—or whether it's just noise dressed up as a vector?

Their answer is the Random Neighbor Score (RNS). The idea is elegant: generate biologically meaningless sequences by randomly shuffling the residues of real proteins—preserving amino acid composition but destroying all evolutionarily meaningful interactions. Then, for each real protein, measure how many of its nearest neighbors in latent space are these random imposters. A high RNS means the model never learned to place that protein somewhere biologically meaningful.

Applied to ESM-2 and ProtT5 across thousands of proteins, RNS correlates strongly with structural prediction quality: proteins with poorly predicted structures have embeddings nearly indistinguishable from random sequences. Downstream tasks follow the same pattern—contact prediction precision drops roughly 40% for high-RNS proteins, and variant effect prediction falls to near chance. Most sobering: between 19% and 46% of the human proteome is underlearned by current models, depending on architecture. Intrinsically disordered regions fare especially poorly across all architectures tested.

RNS is model-agnostic and computationally cheap—around two minutes on GPU for 10,000 proteins—making it a practical prescreening step before any embedding-based inference.

For R&D teams that routinely use protein embeddings to prioritize variants, annotate novel sequences, or screen large libraries, this has immediate consequences. Running RNS before downstream inference flags proteins where predictions are unreliable, reducing the risk of propagating errors into expensive wet-lab campaigns. It also offers a principled way to identify gaps in model coverage—directly actionable for teams building or fine-tuning their own foundation models.

Paper: R. Prabakaran & Yana Bromberg, Nature Methods (2026) — CC BY-NC-ND 4.0 | https://t.co/JOblk7kT6R

323

281

22K

atsoCF retweeted

Benjamin Maier @b_d_maier

11 months ago

Our group is hiring a postdoc to work on Digital Twin development for rare diseases. Collaborate with me and a fantastic interdisciplinary team! Deadline coming up this Sunday!

355

Who to follow

The Align Foundation

@Align_Bio

Improving life science with programmable experiments. 🧬 🤖 Home of the Protein Engineering Tournament & Open Datasets Initiative

Research Assistant Professor, National Institute of Chemistry, Baker lab alumni, MSCA alumni -- @[email protected] @ajasjaljubetic.bsky.social

atsoCF retweeted

Alex Bateman @Alexbateman1

about 1 year ago

@embl @MSCActions @EMBLHeidelberg @EMBLHamburg @EMBLGrenoble @emblebi @EMBLBarcelona @EMBLRome We would welcome applicants who wish to advance @PfamDB and/or @InterProDB

157

atsoCF retweeted

Christopher W. Wood @ChrisWellsWood

about 1 year ago

Our paper on the Protein Design Archive has just been published in @NatureBiotech! It describes our web application for exploring designed proteins (link ⬇️) and contains analysis of how they have changed over time. @MartaChronowska @mjstam @WoolfsonLab https://t.co/7aa50oYRqQ

12K

atsoCF retweeted

J. Am. Chem. Soc. @J_A_C_S

about 1 year ago

De Novo Design of Proteins for Autocatalytic Isopeptide Bond Formation | Journal of the American Chemical Society @UCSDChemBiochem https://t.co/0vWAoqiVv1

atsoCF retweeted

Bioinformatics Advances

@BioinfoAdv

about 1 year ago

@atsoCF @IoannisRiziotis @andreeva789bg @Alexbateman1

134

Francesco Costa @atsoCF

about 1 year ago

🙏 Special thanks to my amazing collaborators: @IoannisRiziotis , Rob Barringer, @andreeva789bg and to my supervisor @Alexbateman1 for their invaluable contributions to this work!

Francesco Costa @atsoCF

about 1 year ago

📢 Just published! Introducing Isopeptor: a computational tool for detecting intramolecular isopeptide bonds in protein structures with high precision and recall. https://t.co/CYbH7tgoKg

Francesco Costa @atsoCF

about 1 year ago

💻 Available as a Python package for easy integration into bioinformatics workflows, and accessible via Google Colab for everyone: https://t.co/ybeOU0b6rz

Francesco Costa @atsoCF

over 1 year ago

Check out this article about my research at @emblebi where I focus on protein modelling and on the study of fibrillar adhesins. Recently, we developed a method to detect isopeptide bonds, a key stabilizing feature in bacterial proteins. Big thanks to @oanastroe123!!👇

EMBL @embl

over 1 year ago

Meet Francesco Costa @atsoCF 🇮🇹 – a PhD student at EMBL-EBI, focusing on protein science. Francesco is fascinated by protein design, and his recent work helped ‘rescue’ low-confidence #AlphaFold protein structure predictions. https://t.co/KkAkxqKAQM #PhD

embl's tweet photo. Meet Francesco Costa @atsoCF 🇮🇹 – a PhD student at EMBL-EBI, focusing on protein science.

Francesco is fascinated by protein design, and his recent work helped ‘rescue’ low-confidence #AlphaFold protein structure predictions.

https://t.co/KkAkxqKAQM

#PhD https://t.co/Svz7NGR1rx

311

atsoCF retweeted

Biology+AI Daily @BiologyAIDaily

over 1 year ago

NERVE 2.0: Boosting the New Enhanced Reverse Vaccinology Environment via Artificial Intelligence and a User-Friendly Web Interface 1. NERVE 2.0 introduces updates to reverse vaccinology with AI-powered tools and a web interface, aiming to simplify vaccine candidate identification and analysis for diverse users. 2. New modules like ESPAAN predict adhesins with improved accuracy, while Virulent expands the search to virulence factors, addressing broader vaccine development needs. 3. Loop-Razor recovers extracellular regions of transmembrane proteins, allowing researchers to explore potential candidates often overlooked in previous pipelines. 4. Epitope Prediction targets epitopes with broad population coverage, utilizing linear HLA alleles to streamline immunogenicity assessments. 5. The redesigned web platform improves usability with modular options, adjustable parameters, and data storage, making it accessible to researchers with varying levels of bioinformatics expertise. 6. Benchmarking results show NERVE 2.0 performing better than its predecessor and tools like Vaxign2 and VaxiJen, demonstrating reliable enrichment in identifying bacterial protective antigens. 7. With both web-based and standalone versions, NERVE 2.0 supports flexible workflows for comprehensive reverse vaccinology analyses. @Francescop966 💻Code: https://t.co/f0oRG3e9dM 📜Paper: https://t.co/MXa2Av1SCw #ReverseVaccinology #Bioinformatics #VaccineDevelopment #MachineLearning #AI

BiologyAIDaily's tweet photo. NERVE 2.0: Boosting the New Enhanced Reverse Vaccinology Environment via Artificial Intelligence and a User-Friendly Web Interface

1. NERVE 2.0 introduces updates to reverse vaccinology with AI-powered tools and a web interface, aiming to simplify vaccine candidate identification and analysis for diverse users.

2. New modules like ESPAAN predict adhesins with improved accuracy, while Virulent expands the search to virulence factors, addressing broader vaccine development needs.

3. Loop-Razor recovers extracellular regions of transmembrane proteins, allowing researchers to explore potential candidates often overlooked in previous pipelines.

4. Epitope Prediction targets epitopes with broad population coverage, utilizing linear HLA alleles to streamline immunogenicity assessments.

5. The redesigned web platform improves usability with modular options, adjustable parameters, and data storage, making it accessible to researchers with varying levels of bioinformatics expertise.

6. Benchmarking results show NERVE 2.0 performing better than its predecessor and tools like Vaxign2 and VaxiJen, demonstrating reliable enrichment in identifying bacterial protective antigens.

7. With both web-based and standalone versions, NERVE 2.0 supports flexible workflows for comprehensive reverse vaccinology analyses.

@Francescop966
💻Code: https://t.co/f0oRG3e9dM
📜Paper: https://t.co/MXa2Av1SCw

#ReverseVaccinology #Bioinformatics #VaccineDevelopment #MachineLearning #AI

971

atsoCF retweeted

EMBL-EBI @emblebi

over 1 year ago

Prediction confidence scores help #AlphaFold users gauge the reliability of protein structure predictions. But things get more challenging for protein families. This new method helps improve low confidence predictions within protein families. https://t.co/KPhXxUp6Ta

emblebi's tweet photo. Prediction confidence scores help #AlphaFold users gauge the reliability of protein structure predictions.

But things get more challenging for protein families.

This new method helps improve low confidence predictions within protein families.

https://t.co/KPhXxUp6Ta https://t.co/LXgPvOLxKW

Francesco Costa @atsoCF

over 1 year ago

A special thanks goes to my supervisor @Alexbateman1 and to @matthiasblum for their support and guidance. 7/7

Francesco Costa @atsoCF

over 1 year ago

Excited to announce our latest publication: “Keeping it in the family: Using protein family templates to rescue low confidence AlphaFold2 models” where we explore plDDT variability in #AF2 models of @PfamDB domains. https://t.co/qi8ZAQi1B3 1/7

Francesco Costa @atsoCF

over 1 year ago

Our findings have important implications for improving structure predictions, especially for proteins from organisms with limited representation in sequence databases or for rapidly evolving taxa. 6/7

107

Francesco Costa

@atsoCF

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users