George Ho @_eigenfoo - Twitter Profile

George Ho @_eigenfoo

almost 2 years ago

So, do I know anybody attending #KDD2024 @kdd_news this year? I'll be there next week!

0

1

0

296

_eigenfoo retweeted

DatNoFact ↗ (@datnofact.bsky.social) @datnofact

almost 2 years ago

hello I'm new to the stock market is it good when the intel ceo starts praying

217

117K

12K

5K

8M

_eigenfoo retweeted

Pablo Montalvo @m_olbap

about 2 years ago

It was hard to find quality OCR data... until today! Super excited to announce the release of the 2 largest public OCR datasets ever 📜 📜 OCR is critical for document AI: here, 26M+ pages, 18b text tokens, 6TB! Thanks to @ucsf_library, @industrydocs and @PDFAssociation 🧶 ↓

m_olbap's tweet photo. It was hard to find quality OCR data... until today! Super excited to announce the release of the 2 largest public OCR datasets ever 📜 📜

OCR is critical for document AI: here, 26M+ pages, 18b text tokens, 6TB! Thanks to @ucsf_library, @industrydocs and @PDFAssociation
🧶 ↓ https://t.co/Za7gRnuRi4

7

601

101

519

93K

George Ho @_eigenfoo

over 2 years ago

shot / chaser

0

2

0

305

Who to follow

Thomas Wiecki

@twiecki

Founder of @pymc_labs - The Bayesian Consultancy https://t.co/pOpveNUsa3 @pymc_devs co-author.

PyMC Labs

@pymc_labs

The Bayesian AI Consultancy • Using PyMC (@pymc_devs) to solve your most challenging data science problems • https://t.co/Tnl7mlPeQw

Allen Downey

@AllenDowney

Author of Probably Overthinking It, Think Python, and Think Bayes. Emeritus Prof at Olin College, consultant with PyMC Labs.

_eigenfoo retweeted

Dr Kareem Carr

@kareem_carr

over 2 years ago

The perfect peer-reviewed article title does not exi-

34

2K

232

502

184K

George Ho @_eigenfoo

over 2 years ago

Also from the group chat today Wordle 934 3/6* ⬛⬛🟨🟩⬛ 🟩🟩⬛🟩🟩 🟩🟩🟩🟩🟩

0

167

George Ho @_eigenfoo

over 2 years ago

My NYT word game group chat has just come up with a new idea: play Wordle, get your score, and then prompt an image generation AI to draw a picture of what you see in your score. I'll go first. Wordle 934 6/6 ⬜🟦⬜⬜⬜ ⬜🟦⬜⬜🟦 🟦🟦🟦🟦⬜ 🟧🟧⬜🟦🟦 🟧🟧⬜🟧🟧 🟧🟧🟧🟧🟧

_eigenfoo's tweet photo. My NYT word game group chat has just come up with a new idea: play Wordle, get your score, and then prompt an image generation AI to draw a picture of what you see in your score.

I'll go first.

Wordle 934 6/6

⬜🟦⬜⬜⬜
⬜🟦⬜⬜🟦
🟦🟦🟦🟦⬜
🟧🟧⬜🟦🟦
🟧🟧⬜🟧🟧
🟧🟧🟧🟧🟧 https://t.co/d6NPdENq3e

1

3

1

0

569

_eigenfoo retweeted

Armineh @arminehnouri

over 2 years ago

Very excited to introduce DocLLM, a multimodal LLM developed by my colleagues @jpmorgan. DocLLM-7B outperforms other SotA LLMs on 12/16 benchmarks within four core Document AI tasks! Incredibly proud of the team for their hard work. Check it out at https://t.co/BNHo1ia8d5

arminehnouri's tweet photo. Very excited to introduce DocLLM, a multimodal LLM developed by my colleagues @jpmorgan. DocLLM-7B outperforms other SotA LLMs on 12/16 benchmarks within four core Document AI tasks! Incredibly proud of the team for their hard work. Check it out at https://t.co/BNHo1ia8d5 https://t.co/5j1qp621yD

6

110

27

80

41K

_eigenfoo retweeted

AK

@_akhaliq

over 2 years ago

JPMorgan announces DocLLM A layout-aware generative language model for multimodal document understanding paper page: https://t.co/azTKT5jZjH Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities. The visual cues offered by their complex layouts play a crucial role in comprehending these documents effectively. In this paper, we present DocLLM, a lightweight extension to traditional large language models (LLMs) for reasoning over visual documents, taking into account both textual semantics and spatial layout. Our model differs from existing multimodal LLMs by avoiding expensive image encoders and focuses exclusively on bounding box information to incorporate the spatial layout structure. Specifically, the cross-alignment between text and spatial modalities is captured by decomposing the attention mechanism in classical transformers to a set of disentangled matrices. Furthermore, we devise a pre-training objective that learns to infill text segments. This approach allows us to address irregular layouts and heterogeneous content frequently encountered in visual documents. The pre-trained model is fine-tuned using a large-scale instruction dataset, covering four core document intelligence tasks. We demonstrate that our solution outperforms SotA LLMs on 14 out of 16 datasets across all tasks, and generalizes well to 4 out of 5 previously unseen datasets.

_akhaliq's tweet photo. JPMorgan announces DocLLM

A layout-aware generative language model for multimodal document understanding

paper page: https://t.co/azTKT5jZjH

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities. The visual cues offered by their complex layouts play a crucial role in comprehending these documents effectively. In this paper, we present DocLLM, a lightweight extension to traditional large language models (LLMs) for reasoning over visual documents, taking into account both textual semantics and spatial layout. Our model differs from existing multimodal LLMs by avoiding expensive image encoders and focuses exclusively on bounding box information to incorporate the spatial layout structure. Specifically, the cross-alignment between text and spatial modalities is captured by decomposing the attention mechanism in classical transformers to a set of disentangled matrices. Furthermore, we devise a pre-training objective that learns to infill text segments. This approach allows us to address irregular layouts and heterogeneous content frequently encountered in visual documents. The pre-trained model is fine-tuned using a large-scale instruction dataset, covering four core document intelligence tasks. We demonstrate that our solution outperforms SotA LLMs on 14 out of 16 datasets across all tasks, and generalizes well to 4 out of 5 previously unseen datasets.

23

2K

341

2K

353K

George Ho @_eigenfoo

over 2 years ago

I sawed my copy of the power broker in half so that it’s easier to carry around When a book’s size becomes an impediment to reading it, I feel like something’s gone seriously wrong

_eigenfoo's tweet photo. I sawed my copy of the power broker in half so that it’s easier to carry around

When a book’s size becomes an impediment to reading it, I feel like something’s gone seriously wrong https://t.co/smZy3iGI1y

0

7

0

341

George Ho @_eigenfoo

over 2 years ago

Hi yes hello good morning I was on a podcast, talking about crossword archivism and milk cartons You can listen to it here: https://t.co/1j0pzcbPFa

0

181

_eigenfoo retweeted

Patrick Collison

@patrickc

over 4 years ago

Gerty and Carl Cori won the Nobel Prize together in 1947. Then 6 of their students won Nobel Prizes, all in physiology/medicine and chemistry. (Five separate prizes in total; one was shared.) https://t.co/VYkSySJ4SR

14

690

60

229

0

_eigenfoo retweeted

Jennifer R. Weiser @ProfJRWeiser

over 2 years ago

Beyond ecstatic for our Cooper Brue team from @cooperunion for winning both best beer label and 3rd place overall in the annual beer brewing competition at AIChE. Go team and thanks Ana for helping us compete! And yes, the poster is hand drawn!

ProfJRWeiser's tweet photo. Beyond ecstatic for our Cooper Brue team from @cooperunion for winning both best beer label and 3rd place overall in the annual beer brewing competition at AIChE. Go team and thanks Ana for helping us compete! And yes, the poster is hand drawn! https://t.co/FNLa6MTBcD

2

20

3

0

1K

_eigenfoo retweeted

Jack Morris

@jxmnop

over 2 years ago

i would retire too if i had to rewrite the entire HuggingFace Trainer to work with HuggingFace Accelerate, jesus that must have been a nightmare

4

113

4

11

37K

_eigenfoo retweeted

Loplop @__loplop

about 3 years ago

Hello, long time no #crossword! A new #cryptic is up, and I’m pretty happy with it! My favorite clue: I'm about to stuff fruit with trace of radium — it might bring death (4,6) https://t.co/PawHGrWrFY

1

8

2

0

666

_eigenfoo retweeted

Flatiron Health @flatironhealth

about 3 years ago

#ML can extract clinically relevant information from EHRs at scale, but evaluating its quality has focused on single variables. This @flatironhealth study aims to evaluating ML's usefulness for research & RWE generation at scale: https://t.co/iTpSf2JhbX @Cancers_MDPI

1

5

2

1

1K

_eigenfoo retweeted

Dr. Blythe Adamson

@DrBlytheAdamson

about 3 years ago

@flatironhealth @zachweinberg Big reveal of Flatiron Health #machinelearning with #language and documents in EHR. The full text explainer from our team is here: https://t.co/BJzzxIatPX

DrBlytheAdamson's tweet photo. @flatironhealth @zachweinberg Big reveal of Flatiron Health #machinelearning with #language and documents in EHR. The full text explainer from our team is here: https://t.co/BJzzxIatPX https://t.co/xTXQkQy0Vq

0

2

1

0

369

George Ho @_eigenfoo

about 3 years ago

@mayhuangwrites @MF_wordz I still haven’t forgotten about that collab you promised me 👀

1

0

42

_eigenfoo retweeted

Flatiron Health @flatironhealth

about 3 years ago

Extracting meaningful clinical detail from EHRs for millions of patients with cancer is challenging. @FlatironHealth uses #NLP & #ML to extract key information from unstructured documents in the curation of high quality #RWD. Read more on our approach: https://t.co/kaHpx1pEhN

1

16

5

6

11K

George Ho

@_eigenfoo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users