STAR-VAE: Latent Variable Transformers for Scalable and Controllable Molecular Generation
1. The vast chemical space of drug-like molecules necessitates powerful generative models. STAR-VAE addresses this by combining a Transformer encoder and autoregressive Transformer decoder, trained on 79 million molecules using SELFIES to ensure syntactic validity. This approach enables both broad distribution learning and conditional generation guided by molecular properties.
2. A key innovation is the principled conditional latent-variable formulation. A property predictor provides a consistent conditioning signal to the latent prior, inference network, and decoder. This allows STAR-VAE to generate molecules with desired properties using limited labeled data, making it highly efficient for property-guided molecular design.
3. Efficiency is further enhanced through low-rank adapters (LoRA) in both the encoder and decoder. This enables fast fine-tuning with minimal data, making the model adaptable to new tasks without extensive retraining. This is crucial for practical applications where labeled data is scarce.
4. STAR-VAE demonstrates strong performance on benchmarks like GuacaMol and MOSES, matching or exceeding existing baselines. In conditional tasks, it shifts docking score distributions toward stronger binding affinities, as shown in the Tartarus protein–ligand design benchmark. This highlights its potential for drug discovery.
5. Latent space analyses reveal smooth, semantically structured embeddings, supporting both unconditional exploration and property-aware steering. This dual capability makes STAR-VAE a versatile tool for navigating the complex landscape of chemical space.
📜Paper: https://t.co/OkSPpCPkwO
#MolecularGeneration #Transformer #VAE #DrugDiscovery #AIinChemistry
🚀 Excited to introduce MMELON—our new multi-view molecular foundation model! By combining graph, image, and text representations, MMELON delivers state-of-the-art performances for prediction and regression tasks.
Code: https://t.co/Gc3XLuCDkw
Preprint: https://t.co/iFlCTGYnkV
Multi-view biomedical foundation models for molecule-target and property prediction @IBMResearch
• The paper introduces MMELON, a multi-view molecular foundation model combining graph, image, and text views to enhance prediction of molecular properties. Unlike single-view models, MMELON leverages multiple representations for a richer, more versatile molecular embedding.
• The model performs exceptionally well on 18 diverse tasks, including ligand-protein binding, molecular solubility, metabolism, and toxicity, balancing the strengths of each modality. This versatility is critical in drug discovery and computational chemistry.
• MMELON integrates three views—graph, image, and text—to learn comprehensive molecular representations. The image view uses ImageMol (pre-trained on 10 million molecules), while the graph and text views are based on advanced transformer architectures, pre-trained on datasets of 200 million molecules.
• A novel aspect is the “late fusion” of these different modalities, ensuring each modality contributes optimally depending on the downstream task. This approach yields interpretable results and allows for an analysis of how each view supports different predictions.
• For validation, MMELON was applied to screen compounds against a large set of G Protein-Coupled Receptors (GPCRs). Of these, 33 GPCRs related to Alzheimer’s disease were identified, and strong binders were predicted, validated through in silico structure modeling.
• The multi-view model shows strong correlations between predicted and experimental affinities, achieving a Pearson correlation of 0.78 for GPCR binding. This suggests the model’s robust application for identifying new therapeutics.
• Compared to single-view models, MMELON delivers superior performance across classification and regression tasks, making it an essential tool for complex molecular property predictions in drug discovery.
@jamorrone3@jianying_hu@FeixiongCheng@jeriscience@BCKwon@timrumbell@dplatt_maths@YunguangQiu@diwakarmahajan
💻Code: https://t.co/HXrLfEwv2X
📜Paper: https://t.co/TBddxQ9iM8
#biomedicalAI #drugdiscovery #foundationmodel #multiviewlearning #GPCR #Alzheimers #machinelearning #bioinformatics
BREAKING NEWS
The 2024 #NobelPrize in Literature is awarded to the South Korean author Han Kang “for her intense poetic prose that confronts historical traumas and exposes the fragility of human life.”
Counterfactuals explain and reduce over-reliance on AI in clinical settings, but how do we create counterfactuals for images like MRIs? And can we ensure their domain relevance?
In our new #facct2024 paper (/w Lifu Deng @ATandonMD @EndertAlex@BCKwon), we present MiMICRI (1/6)
We're hiring!
The HCI & Visualization group at Autodesk Research is hiring for two Research Scientist positions at the intersection of HCI and AI.
If interested, please apply (and reach out if you have questions).
https://t.co/xn4mfoiMfJ
(@ADSKResearch)
What is visualization literacy? How can we measure it? How can we improve it for everyone?
Submit your work to our CHI 2024 workshop by Feb 29 and join our discussions on defining, studying, and enhancing visualization literacy for all.
https://t.co/d2PsEApVuj
What is visualization literacy? How can we measure it? How can we improve it for everyone?
Submit your work to our CHI 2024 workshop by Feb 29 and join our discussions on defining, studying, and enhancing visualization literacy for all.
https://t.co/d2PsEApVuj
It’s Friday, it’s the last day of #ieeevis, and we’re now getting ready for our paper on “Visualization Thumbnails”🎞️🌆🌁🌃 by authors Hwiyeon Kim, Joohee Kim, Yunha Han, Hwajung Hong, Oh-Sang Kwon, Young-Woo Park, myself, @SungahnK, and @BCKwon. Room 109! […]
🚀 Exciting Opportunity Alert! 🌟 Join our team as a Research Intern and contribute to the future of trustworthy foundation models. 🧠 Apply now to make a real impact! #AIResearch#InternshipOpportunity#FoundationModels 🔍👩💻🔬 https://t.co/kt5ZAmXStL
Excited to be in Toronto for #ACL2023NLP! Check out the paper, source code, and video of our paper Finspector. I'll be presenting Finspector at @ibmresearch booth around 9am - 10am Monday & Tuesday and at the main conference hall around 11am - 12:30pm Wednesday. Let's talk 🤩!
How can we uncover hidden biases in language models that impact fairness? Our #ACL2023 demo paper introduces Finspector, an interactive visualization widget available as a Python package for Jupyter.
Paper, Video, Code: https://t.co/bqbrjAKdI3
@nandanamihindu#nlp#fairness
It was a blast working on VisText with @bennyjtang and @arvindsatya1, and I can't wait to see how the dataset can support future chart captioning research!! 📊🤖
Don't miss @bennyjtang's presentation at #ACL2023NLP
Our paper showcases a use case that demonstrates how Finspector can be used to discover biases of language models. We discuss implications, limitations, and future work.
Read the paper in detail: https://t.co/rH36uQuiyc
How can we uncover hidden biases in language models that impact fairness? Our #ACL2023 demo paper introduces Finspector, an interactive visualization widget available as a Python package for Jupyter.
Paper, Video, Code: https://t.co/bqbrjAKdI3
@nandanamihindu#nlp#fairness
To use Finspector, users i) prepare a test dataset containing sentences and relevant metadata; ii) compute pseudo-log-likelihood scores using pre-trained models like BERT, RoBERTa, and ALBERT; and iii) launch Finspector on Jupyter and visually explore the biases of the models.