Davis Liang

over 1 year ago

Interested in the intersection of healthcare and cutting edge applied science (LLMs, multilinguality, multimodal models, speech recognition, etc.)? We’re hiring machine learning scientists at @AbridgeHQ 🚀 DM me or apply here: https://t.co/vibpb9W0lH

Senior Director & RS @Meta + Visiting Prof NYU | OG in LLMs | Pretrain+Finetune in 2008+ | 148k+ citations | Current: Self-Improving & Co-Improving AI

over 1 year ago

What makes being an Abridger truly special? ✨ For @LiangDavis, Staff Research Scientist: “It’s this culture of being able to see past what’s directly in front of you…and into the lives of the patients, the clinicians, the doctors, the nurses.” ❤️‍🩹 At Abridge, we’re not just building a groundbreaking AI solution—we’re shaping the future of healthcare by putting patients and clinicians at the heart of everything we do. 🚀 If you’re ready to join one of the most exciting, mission-driven teams in healthcare, we’re hiring: https://t.co/zQzOyPIxoz

0

11

0

3K

0

14

4

5

2K

LiangDavis retweeted

Pranav @PranavMani30

over 1 year ago

Does adapting general-domain models to medical-domain actually help w med-domain tasks? Stop by at Tuttle Hall, 230p EST, Nov 14 @emnlpmeeting to catch the amazing @danielpjeong present his 🚀oral 🚀talk. Super glad to be part of this work w @danielpjeong @saurabh_garg67 @zacharylipton @MichaelOberst Paper: https://t.co/wy4WQjxsrT

0

9

3

1

920

LiangDavis retweeted

Shiv Rao, MD

@ShivdevRao

over 1 year ago

1. Healthcare systems need enterprise-grade AI solutions they can trust. Read our latest whitepaper to learn more about what enterprise-grade AI for healthcare looks like: https://t.co/rQiSv0gMJN

1

38

9

11

4K

Who to follow

Jason Weston

@jaseweston

Zhaofeng Wu

@zhaofeng_wu

PhD student @MIT_CSAIL | Previously @allen_ai | MS'21 BS'19 BA'19 @uwnlp | 💼 on the industry job market

Yu Su

@ysu_nlp

co-founder @NeoCognition | prof. @osunlp | sloan fellow | building towards abundance of specialized intelligence

LiangDavis retweeted

Zachary Lipton

@zacharylipton

over 1 year ago

Evaluation of ambient scribes is a formidable task due to the free-form nature of generated text. Rigor requires automated metrics, strong benchmarks, clinician-in-loop trials, and in vivo testing. Learn how we're tackling these challenges at @AbridgeHQ: https://t.co/8oWka8elwh

2

62

18

24

14K

LiangDavis retweeted

Nikhil Krishnan

@nikillinit

almost 2 years ago

new post - we took a look at behind the technical curtain some of the interesting engineering challenges behind a company (@AbridgeHQ ) training their own LLMs. -dealing with multiple languages -handling model drift -generalist models vs. healthcare specific ones and more (just fyi, this one is a sponsored post but it's pretty interesting if you're interested in healthcare-specific LLMs)

nikillinit's tweet photo. new post - we took a look at behind the technical curtain some of the interesting engineering challenges behind a company (@AbridgeHQ ) training their own LLMs.

-dealing with multiple languages
-handling model drift
-generalist models vs. healthcare specific ones

and more

(just fyi, this one is a sponsored post but it's pretty interesting if you're interested in healthcare-specific LLMs)

2

19

5

22

6K

LiangDavis retweeted

Shiv Rao, MD

@ShivdevRao

almost 2 years ago

Excited to announce our work at @AbridgeHQ with Kaiser Permanente. https://t.co/rydZN5SCe4

7

73

11

5

8K

LiangDavis retweeted

Lucas Bandarkar @LucasBandarkar

almost 2 years ago

We presented Belebele at ACL 2024 this week! (Thx to @LiangDavis and @ImSNShukla) A year on from its release, it’s been really cool to see the diversity of research projects that have used it. The field is in dire need of more multilingual benchmarks !

0

23

6

0

2K

LiangDavis retweeted

Julian Salazar @JulianSlzr

almost 2 years ago

Scary to think that in a generation, no one will make the EMNLP / LMNOP joke anymore 😢 @emnlpmeeting

0

1

2

0

847

almost 2 years ago

@haroonc @AbridgeHQ Thanks, Haroon! Great meeting you at the hackathon

0

1

0

30

LiangDavis retweeted

almost 2 years ago

🌐 We loved presenting at the Out-Of-Pocket Gen AI x Healthcare Ops Hackathon. Our very own @LiangDavis demoed Abridge and spoke about the importance of multilinguality in health tech. 🗣️ Did you know? •Over 350 languages are spoken in the United States. •20% of Americans speak two or more languages. •More than 11% of patients in California-licensed hospitals prefer to speak Spanish (CA Dept. Healthcare Access to Information, 2021). Yet, multilingual performance remains a significant challenge today, even for state-of-the-art models like GPT-4 (The Belebele Benchmark. Bandarkar et al. ACL 2024). Tokenizations are often biased towards English, making it more expensive and less efficient for languages like Arabic, Hindi, and Chinese. Davis also discussed ways we can start tackling this issue: •Leverage both English and non-English data for training. •Consider up-weighted sampling of important languages during training. •Increase rank for parameter efficient fine-tuning on multilingual data. •Construct intelligent multilingual vocabularies (XLM-V. Liang et al. EMNLP 2023). At Abridge, we care deeply about multilingual performance. Our speech recognition is tuned to handle medical conversations across 14+ languages, coping with cross-talk, background noise, and an evolving landscape of maladies, medications, and practice patterns. Learn more about our AI here: https://t.co/QTHy5DOW4F

AbridgeHQ's tweet photo. 🌐 We loved presenting at the Out-Of-Pocket Gen AI x Healthcare Ops Hackathon. Our very own @LiangDavis demoed Abridge and spoke about the importance of multilinguality in health tech.

🗣️ Did you know?

•Over 350 languages are spoken in the United States.
•20% of Americans speak two or more languages.
•More than 11% of patients in California-licensed hospitals prefer to speak Spanish (CA Dept. Healthcare Access to Information, 2021).

Yet, multilingual performance remains a significant challenge today, even for state-of-the-art models like GPT-4 (The Belebele Benchmark. Bandarkar et al. ACL 2024). Tokenizations are often biased towards English, making it more expensive and less efficient for languages like Arabic, Hindi, and Chinese.

Davis also discussed ways we can start tackling this issue:

•Leverage both English and non-English data for training.
•Consider up-weighted sampling of important languages during training.
•Increase rank for parameter efficient fine-tuning on multilingual data.
•Construct intelligent multilingual vocabularies (XLM-V. Liang et al. EMNLP 2023).

At Abridge, we care deeply about multilingual performance. Our speech recognition is tuned to handle medical conversations across 14+ languages, coping with cross-talk, background noise, and an evolving landscape of maladies, medications, and practice patterns. Learn more about our AI here: https://t.co/QTHy5DOW4F

4

22

4

2

2K

LiangDavis retweeted

Alexandr Wang

@alexandr_wang

about 2 years ago

How overfit are popular LLMs on public benchmarks? New research out of @scale_ai SEAL to answer this: - produced a new eval GSM1k - evaluated public LLMs for overfitting on GSM8k VERDICT: Mistral & Phi are overfitting benchmarks, while GPT, Claude, Gemini, and Llama are not.

alexandr_wang's tweet photo. How overfit are popular LLMs on public benchmarks?

New research out of @scale_ai SEAL to answer this:

- produced a new eval GSM1k
- evaluated public LLMs for overfitting on GSM8k

VERDICT: Mistral & Phi are overfitting benchmarks, while GPT, Claude, Gemini, and Llama are not. https://t.co/hRhcNQWo93

10

426

76

219

204K

LiangDavis retweeted

about 2 years ago

We’re thrilled to be featured on the @Forbes AI 50, alongside companies that inspire us such as OpenAI, Anthropic, Databricks, and others. It’s a special privilege to represent the impact of AI in healthcare, improving the care delivery experience at scale for both clinicians and patients. Grateful to Forbes for this acknowledgment, and to the health systems we work with for their partnership. Check out the full list: https://t.co/7Pyh7e3u60 #ForbesAI50

AbridgeHQ's tweet photo. We’re thrilled to be featured on the @Forbes AI 50, alongside companies that inspire us such as OpenAI, Anthropic, Databricks, and others.

It’s a special privilege to represent the impact of AI in healthcare, improving the care delivery experience at scale for both clinicians and patients. Grateful to Forbes for this acknowledgment, and to the health systems we work with for their partnership.

Check out the full list: https://t.co/7Pyh7e3u60 #ForbesAI50

0

24

9

0

3K

LiangDavis retweeted

Sebastian Ruder

@seb_ruder

about 2 years ago

Ahia et al. (2023; https://t.co/XKPAybRVY6) observed that the same is true for current LLMs such as ChatGPT: They segment text in non-English languages into many more tokens and are thus much more costly to use in such languages. They call this “double unfairness”: higher prices + lower utility (reduced performance) in these languages.

seb_ruder's tweet photo. Ahia et al. (2023; https://t.co/XKPAybRVY6) observed that the same is true for current LLMs such as ChatGPT: They segment text in non-English languages into many more tokens and are thus much more costly to use in such languages.

They call this “double unfairness”: higher prices + lower utility (reduced performance) in these languages.

1

27

7

9

3K

about 2 years ago

@WilliamWangNLP The one true public benchmark — random twitter shitposts

0

1

0

49

about 2 years ago

@WilliamWangNLP Whenever I see these model releases… I immediately jump to the comments 😂

1

0

258

LiangDavis retweeted