Joseph Lai 🍱 @jtlai - Twitter Profile

Joseph Lai 🍱 @jtlai

about 1 year ago

"You're absolutely right!" OK, I feel this is too assuring 😅 especially when hearing it so many times per day.

0

42

Joseph Lai 🍱 @jtlai

about 1 year ago

@borisjabes @census @fivetran @frasergeorgew Wow congrats Boris and Census team!

0

50

jtlai retweeted

Cameron Jones @camrobjones

about 1 year ago

New preprint: we evaluated LLMs in a 3-party Turing test (participants speak to a human & AI simultaneously and decide which is which). GPT-4.5 (when prompted to adopt a humanlike persona) was judged to be the human 73% of the time, suggesting it passes the Turing test (🧵)

camrobjones's tweet photo. New preprint: we evaluated LLMs in a 3-party Turing test (participants speak to a human & AI simultaneously and decide which is which).

GPT-4.5 (when prompted to adopt a humanlike persona) was judged to be the human 73% of the time, suggesting it passes the Turing test (🧵) https://t.co/GBEtoFJHVY

46

1K

197

563

279K

Joseph Lai 🍱 @jtlai

over 1 year ago

This should have been just a line item under the payment or invoice object as "Invoice Fee". But they deliberately choose to hide it. Very disappointed.

0

33

Who to follow

American Briton founded MIND THE GAP Records UK Ltd- back in the day.. Writer & independant music producer slowly dying of labile diabetes m1. ...I read manga

KentPai

@KentPai1983

生性懶散但不知道為什麼選擇當一個創業者。信奉「如果可以坐下為什麼要站著」的信條，缺乏耐心，喜歡去探訪那些未知的領域。最大的願望就是希望可以把所有的時間都浪費在那些自己鍾愛的事物上。喜歡旅遊、電影、美食與紐約洋基。

Joseph Lai 🍱 @jtlai

over 1 year ago

PSA: If you use Stripe Invocing, they've been taking 0.4% of your invoice total silently. These fees don't show up in the invoice itself, nor the corresponding payment. It's hidden in an obscure "balance report". This is NOT transparent at all.

1

0

1

76

jtlai retweeted

Ivan Leo

@ivanleomk

over 1 year ago

Processing PDF files to chat with in 2023 was an entire startup idea. Today with Gemini's native multimodality, all it takes is a simple file upload to get started. You can now process audio, videos, PDFs and even images with a simple upload using gemini instead of spending valuable engineering hours on a pre-processing pipeline. The 2 million token context also makes it easy to stuff a gazillion things inside it. Instructor makes this easy with its automatic validation and retries of the response, turning messy and sometimes unreliable llm outputs into clean, validated Pydantic objects. Building a prototype shouldn't have to feel like rocket science and starting with structured outputs will save you hours down the line - whether it's for testing, future fine-tuning or even manual annotation. Check out our latest article that walks you through how to do just that https://t.co/oR08grdJ35

4

87

5

97

10K

jtlai retweeted

Rohan Paul

@rohanpaul_ai

over 1 year ago

Local models now protect your privacy while still accessing powerful LLM capabilities Chain small and large LLMs to get best performance while keeping data private 🔍 Original Problem: Users share sensitive personal information with proprietary LLMs during inference, raising privacy concerns. While local open-source models help with privacy, they perform worse than proprietary models. ----- 🛠️ Solution in this Paper: • PAPILLON: A multi-stage pipeline where local models act as privacy-conscious proxies • Uses DSPy prompt optimization to find optimal prompts for privacy preservation • Two key components: - Prompt Creator: Generates privacy-preserving prompts - Information Aggregator: Combines responses while protecting PII • Created PUPA benchmark with 901 real-world user-LLM interactions containing PII ----- 💡 Key Insights: • Simple redaction significantly lowers LLM response quality • Privacy-conscious delegation can balance privacy and performance • Smaller local models can effectively leverage larger models while protecting privacy • Prompt optimization improves both quality and privacy metrics ----- 📊 Results: • Maintains 85.5% response quality compared to proprietary models • Restricts privacy leakage to only 7.5% • Outperforms simple redaction approaches • Shows consistent improvement across different model sizes

rohanpaul_ai's tweet photo. Local models now protect your privacy while still accessing powerful LLM capabilities

Chain small and large LLMs to get best performance while keeping data private

🔍 Original Problem:

Users share sensitive personal information with proprietary LLMs during inference, raising privacy concerns. While local open-source models help with privacy, they perform worse than proprietary models.

-----

🛠️ Solution in this Paper:

• PAPILLON: A multi-stage pipeline where local models act as privacy-conscious proxies
• Uses DSPy prompt optimization to find optimal prompts for privacy preservation
• Two key components:
- Prompt Creator: Generates privacy-preserving prompts
- Information Aggregator: Combines responses while protecting PII
• Created PUPA benchmark with 901 real-world user-LLM interactions containing PII

-----

💡 Key Insights:

• Simple redaction significantly lowers LLM response quality
• Privacy-conscious delegation can balance privacy and performance
• Smaller local models can effectively leverage larger models while protecting privacy
• Prompt optimization improves both quality and privacy metrics

-----

📊 Results:

• Maintains 85.5% response quality compared to proprietary models
• Restricts privacy leakage to only 7.5%
• Outperforms simple redaction approaches
• Shows consistent improvement across different model sizes

15

278

50

344

32K

Joseph Lai 🍱 @jtlai

over 1 year ago

🤙

Lisa Dunlap

@lisabdunlap

over 1 year ago

🧵We love measuring accuracy, but what about the vibes? Intro VibeCheck—a system that discovers and measures qualitative differences in LLMs. VibeCheck shows Llama is friendlier while GPT-4 focuses on ethics; these vibes can even predict model identity and user preference. https://t.co/PpeJEcc7gj

lisabdunlap's tweet photo. 🧵We love measuring accuracy, but what about the vibes?

Intro VibeCheck—a system that discovers and measures qualitative differences in LLMs. VibeCheck shows Llama is friendlier while GPT-4 focuses on ethics; these vibes can even predict model identity and user preference.

https://t.co/PpeJEcc7gj

2

167

38

75

29K

0

38

jtlai retweeted

Zhe Gan

@zhegan4

over 1 year ago

💡Imagine a multimodal LLM that masters universal UI understanding across platforms? Here it is, 🎁 we upgrade Ferret-UI to Ferret-UI 2, a generalist model for grounded UI understanding across iPhone, Android, iPad , Webpage, and AppleTV. Check the image below for visual examples. And check our paper for details on how we achieve this: https://t.co/bshfQrIAuP. Led by our awesome intern Zhangheng, together with Keen, @HaotianZhang4AI @yinfeiy and other great collaborators.

zhegan4's tweet photo. 💡Imagine a multimodal LLM that masters universal UI understanding across platforms?

Here it is, 🎁 we upgrade Ferret-UI to Ferret-UI 2, a generalist model for grounded UI understanding across iPhone, Android, iPad , Webpage, and AppleTV. Check the image below for visual examples.

And check our paper for details on how we achieve this: https://t.co/bshfQrIAuP.

Led by our awesome intern Zhangheng, together with Keen, @HaotianZhang4AI @yinfeiy and other great collaborators.

2

151

26

63

12K

jtlai retweeted

Maziyar PANAHI

@MaziyarPanahi

over 1 year ago

Microsoft just dropped OmniParser model on ⁦@huggingface⁩, so casually! 😂 “OmniParser is a general screen parsing tool, which interprets/converts UI screenshot to structured format, to improve existing LLM based UI agent.” 🔥 https://t.co/h9nzhyUUQB

16

831

115

659

71K

jtlai retweeted

Ruohong Zhang

@RuohongZhang

over 1 year ago

[p1] Improve Visual Language Model Chain-of-thought Reasoning paper link: https://t.co/eUnlisUsv5 project page (to be updated upon approval on release): https://t.co/LpAYt6k8yQ Content: 1. We distill 193K CoT data 2. Train with SFT 3. DPO to futher improve performance

RuohongZhang's tweet photo. [p1] Improve Visual Language Model Chain-of-thought Reasoning

paper link: https://t.co/eUnlisUsv5

project page (to be updated upon approval on release): https://t.co/LpAYt6k8yQ

Content:
1. We distill 193K CoT data
2. Train with SFT
3. DPO to futher improve performance https://t.co/jUjbLJWR8G

3

214

36

135

28K

jtlai retweeted

merve

@mervenoyann

over 1 year ago

I'm bullish on this foundation OCR model called GOT 📝 @eccvconf This model can transcribe anything and it's Apache-2.0! Keep reading to learn more 🧶

mervenoyann's tweet photo. I'm bullish on this foundation OCR model called GOT 📝 @eccvconf

This model can transcribe anything and it's Apache-2.0!

Keep reading to learn more 🧶

33

2K

225

3K

182K

Joseph Lai 🍱 @jtlai

almost 2 years ago

Creative and surprisingly it works

Andrew Drozdov

@mrdrozdov

almost 2 years ago

1

85

12

16

14K

0

149

jtlai retweeted

Andrew Drozdov

@mrdrozdov

almost 2 years ago

1

85

12

16

14K

Joseph Lai 🍱 @jtlai

almost 2 years ago

@tomkit08 Need one for property tax assessment appeal too!

0

27

jtlai retweeted

Nat Friedman

@natfriedman

about 2 years ago

"People hire a janitor service to clean their office. They don't hire a generic labor service, even though it's basically the same thing." – advice for AI startups.

33

1K

82

433

155K

Joseph Lai 🍱 @jtlai

about 2 years ago

@jeffiel 🧡 The Onion & ty @jeffiel for keeping them around!

0

9

Joseph Lai 🍱 @jtlai

about 2 years ago

A few of our domains started getting moved and it's a shit show. Squarespace's DNS update is slow & unreliable. Good luck trying to fix anything they mess up. (And meanwhile your whole service is down to your customers due to DNS issues)

0

36

Joseph Lai 🍱 @jtlai

about 2 years ago

PSA: If you have any domains on Google Domain, transfer them out NOW before the Squarespace move causes serious disruption to your service.

1

0

80

Joseph Lai 🍱 @jtlai

over 2 years ago

@chptung same

0

33

Joseph Lai 🍱

@jtlai

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users