Daniel Mas Montserrat @_danielmas - Twitter Profile

Pinned Tweet

6 months ago

Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: https://t.co/ZpT5ZiKJs9 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)

_danielmas's tweet photo. Excited to share our preprint on iLTM: an Integrated Large Tabular Model!

arxiv: https://t.co/ZpT5ZiKJs9

No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture:

- Gradient Boosted Decision Trees (GBDT)-based embeddings
- Robust preprocessing with random feature projections
- A meta-trained hypernetwork
- Retrieval augmentation with Soft Nearest Neighbors

@d_bonet @marcal_cc @alexGioannidis
(1/N)

1

14

7

3

2K

_danielmas retweeted

Daniel Tabin

@DanTabin

2 months ago

"Point cloud local ancestry inference (PCLAI): continuous coordinate-based ancestry along the genome" New preprint from @alexGioannidis's group looks super interesting! Deep learning to plot haplotypes into continuous PC spaces. Lots to go over. I need to read it more deeply

DanTabin's tweet photo. "Point cloud local ancestry inference (PCLAI): continuous coordinate-based ancestry along the genome"
New preprint from @alexGioannidis's group looks super interesting! Deep learning to plot haplotypes into continuous PC spaces. Lots to go over. I need to read it more deeply https://t.co/HdDToZVhH1

1

80

24

41

10K

_danielmas retweeted

Ambassador Frank Hull ☤ @frankiethull

3 months ago

The R binding is live at https://t.co/qBJ7xq4VQX We're bringing all TFMs, ICLs, LDMs, & LTMs to R 🔥

2

11

4

3

583

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

6 months ago

Despite being meta-trained exclusively on classification, iLTM transfers effectively to regression tasks with light fine-tuning, matching or surpassing strong baselines on both tasks. iLTM achieves top rankings on TabZilla Hard, TabReD, and more benchmarks, outperforming well-tuned XGBoost, CatBoost, and recent deep tabular models. (3/N)

_danielmas's tweet photo. Despite being meta-trained exclusively on classification, iLTM transfers effectively to regression tasks with light fine-tuning, matching or surpassing strong baselines on both tasks.

iLTM achieves top rankings on TabZilla Hard, TabReD, and more benchmarks, outperforming well-tuned XGBoost, CatBoost, and recent deep tabular models.

(3/N)

1

7

1

0

167

Who to follow

Oscar Mañas

@oscmansan

Research scientist at @AIatMeta, PhD from @Mila_Quebec @UMontrealDIRO. Working on multimodal vision+language generation & evaluation. Català a Zúric.

Xavi Giró

@DocXavi

Applied scientist at @amazonscience Barcelona, Catalonia. Made at @la_upc & @columbia. Promoting @dlbcnai. Opinions my own.

Alexander G. Ioannidis

@alexGioannidis

computational genomics, AI in healthcare, professing, kitesurfing, sailing - καὶ γνώσεσθε τὴν ἀλήθειαν, καὶ ἡ ἀλήθεια ἐλευθερώσει ὑμᾶς

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

6 months ago

Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: https://t.co/ZpT5ZiKJs9 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)

1

14

7

3

2K

Daniel Mas Montserrat @_danielmas

6 months ago

We’re releasing code + pre-trained weights so anyone working with large-scale tabular data can get stronger baselines and build on iLTM. We’d love feedback and comparisons on your own datasets: Paper: https://t.co/ZpT5ZiKJs9 Code: https://t.co/8XtKNFfUF5 Weights: https://t.co/EPbRxnJLDx

0

6

0

117

Daniel Mas Montserrat @_danielmas

6 months ago

Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: https://t.co/ZpT5ZiKJs9 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)

1

14

7

3

2K

Daniel Mas Montserrat @_danielmas

6 months ago

From small tables to real industry-grade datasets with >1M rows and >10k features, our benchmarks show how iLTM scales across sizes. In our labs at @Stanford and @UCSC, we’re already exploring applications of iLTM to genomic data, where dimensionality is even higher. (5/N)

1

4

0

110

_danielmas retweeted

Valeriy M., PhD, MBA, CQF

@predict_addict

about 1 year ago

How was this paper even accepted to ICLR? The commercial promoters of TabPFN are now trying to discredit one of the best open repositories, OpenML. Utterly unacceptable, how did this paper pass ethics board at ICLR?

predict_addict's tweet photo. How was this paper even accepted to ICLR?

The commercial promoters of TabPFN are now trying to discredit one of the best open repositories, OpenML.

Utterly unacceptable, how did this paper pass ethics board at ICLR? https://t.co/yaPP8Uu1qD

1

7

3

5

3K

_danielmas retweeted

Arturo @arturolp

almost 2 years ago

Excited to share our latest PRS work! Our @GalateaBio and @genomelink team performed a comprehensive analysis of published @PGSCatalog models along with locally trained models using LDPred2, PRS-CSx, and SNPnet, across diverse populations using @UKBIOBANK and our own data

0

8

3

0

2K

_danielmas retweeted

medRxiv @medrxivpreprint

almost 2 years ago

Polygenic risk score portability for common diseases across genetically diverse populations https://t.co/Mgw0jjT7Tf #medRxiv

0

7

3

5

3K

_danielmas retweeted

Yannic Kilcher 🇸🇨

@ykilcher

about 2 years ago

No son of a construction worker is just going to randomly start doing ML research if they never hear of it and don't get told that it could be important for their future career, no matter how intelligent the kid is

16

272

8

10

14K

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

over 2 years ago

Introducing "HyperFast: Instant Classification for Tabular Data" at @RealAAAI, which received the Best Paper Award at @NeurIPSConf Table rep. workshop @TrlWorkshop! We provide easy-to-use sklearn-like code: https://t.co/qrMF6XStAA Some insights of the work below 👇🧵(1/N)

1

14

3

1

2K

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

over 2 years ago

This work has been led by @d_bonet with the supervision of @DocXavi and @alexGioannidis! Code available at: https://t.co/qrMF6XStAA #AI #AAAI #AAAI2024 #NeurIPS2023 #NeurIPS (5/5)

_danielmas's tweet photo. This work has been led by @d_bonet with the supervision of @DocXavi and @alexGioannidis!

Code available at: https://t.co/qrMF6XStAA

#AI #AAAI #AAAI2024 #NeurIPS2023 #NeurIPS

(5/5) https://t.co/ubFQQu2bKf

0

4

2

0

894

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

over 2 years ago

Hyperfast provides competitive results in several tabular classification datasets, even matching boosting-tree-based accuracies! While still far from solving tabular data classification, we believe Hyperfast provides a step forward in NN-based tabular applications! (4/N)

_danielmas's tweet photo. Hyperfast provides competitive results in several tabular classification datasets, even matching boosting-tree-based accuracies!

While still far from solving tabular data classification, we believe Hyperfast provides a step forward in NN-based tabular applications!
(4/N) https://t.co/nDk8ifSjTu

1

5

2

0

340

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

over 2 years ago

Hyperfast provides multiple mechanisms to scale to both large and high-dimensional datasets and can be easily applied to real-world applications! (3/N)

_danielmas's tweet photo. Hyperfast provides multiple mechanisms to scale to both large and high-dimensional datasets and can be easily applied to real-world applications! (3/N) https://t.co/4IlMtQ2a92

1

4

1

0

268

_danielmas retweeted

Daniel Mas Montserrat @_danielmas

over 2 years ago

Hyperfast replaces the slow process of training MLPs with gradient-based methods (e.g. Adam) with a fast hypernetwork that directly predicts the weights of the MLP. The generated MLP typically matches (or even surpasses) the accuracy of those trained with gradient descent. (2/N)

_danielmas's tweet photo. Hyperfast replaces the slow process of training MLPs with gradient-based methods (e.g. Adam) with a fast hypernetwork that directly predicts the weights of the MLP.
The generated MLP typically matches (or even surpasses) the accuracy of those trained with gradient descent. (2/N) https://t.co/Enur61L08Z

1

6

1

2

433

Daniel Mas Montserrat @_danielmas

over 2 years ago

This work has been led by @d_bonet with the supervision of @DocXavi and @alexGioannidis! Code available at: https://t.co/qrMF6XStAA #AI #AAAI #AAAI2024 #NeurIPS2023 #NeurIPS (5/5)

0

4

2

0

894

Daniel Mas Montserrat @_danielmas

over 2 years ago

Introducing "HyperFast: Instant Classification for Tabular Data" at @RealAAAI, which received the Best Paper Award at @NeurIPSConf Table rep. workshop @TrlWorkshop! We provide easy-to-use sklearn-like code: https://t.co/qrMF6XStAA Some insights of the work below 👇🧵(1/N)

1

14

3

1

2K

Daniel Mas Montserrat @_danielmas

over 2 years ago

Hyperfast provides competitive results in several tabular classification datasets, even matching boosting-tree-based accuracies! While still far from solving tabular data classification, we believe Hyperfast provides a step forward in NN-based tabular applications! (4/N)

1

5

2

0

340

Daniel Mas Montserrat

@_danielmas

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users