oasis @oasisjin - Twitter Profile

oasis @oasisjin

20 days ago

@Alisvolatprop12 로컬 컴퓨팅 용량이 엄청 나네요. 모델 다양하게 안 쓰시면 exo 로 묶어서 pd disaggregation 구성해도 좋겠어요. 용량을 어떻게 활용하고 계신지 궁금하네요.

1

3

0

1K

oasis @oasisjin

about 1 year ago

드디어~!

Charlie Marsh

@charliermarsh

about 1 year ago

The Python Steering Council has voted to remove the "experimental" label from the free-threaded ("nogil") builds for Python 3.14. Big step towards making them the default in a future version of CPython!

19

647

48

40

40K

0

33

oasisjin retweeted

Rohan Paul

@rohanpaul_ai

about 1 year ago

This paper systematically evaluates 14 prompting techniques across 10 Software Engineering tasks using four different LLMs. Methods 🔧: → Performance was measured using task-specific metrics like Accuracy, F1 score, CodeBLEU, and BLEU. → Linguistic features such as Lexical Diversity and Token Count were analyzed for correlation with performance. → Contrastive explanation was used to identify factors contributing to technique effectiveness. ---------------------------- Paper - arxiv. org/abs/2506.05614v1 Paper Title: "Which Prompting Technique Should I Use? An Empirical Investigation of Prompting Techniques for Software Engineering Tasks"

rohanpaul_ai's tweet photo. This paper systematically evaluates 14 prompting techniques across 10 Software Engineering tasks using four different LLMs.

Methods 🔧:

→ Performance was measured using task-specific metrics like Accuracy, F1 score, CodeBLEU, and BLEU.

→ Linguistic features such as Lexical Diversity and Token Count were analyzed for correlation with performance.

→ Contrastive explanation was used to identify factors contributing to technique effectiveness.

----------------------------

Paper - arxiv. org/abs/2506.05614v1

Paper Title: "Which Prompting Technique Should I Use? An Empirical Investigation of Prompting Techniques for Software Engineering Tasks"

6

225

50

303

16K

oasisjin retweeted

Rohan Paul

@rohanpaul_ai

about 1 year ago

Training on wrong answers outpaces training on correct ones. 10 times more learning emerges from plausible errors than from truths. Large language models refine their accuracy slowly when they learn only from correct examples. This paper introduces Likra, which trains one model head on correct answers and another on incorrect ones and uses their likelihood ratio to choose responses. This approach shows that each plausible wrong example can boost accuracy up to 10 times more than each correct example and sharpens the model’s ability to avoid mistakes. ⚙️ The Core Concepts The Likra model trains two separate prediction heads on a foundation model. One head learns from correct question-answer pairs and the other learns from incorrect pairs. At inference it compares their likelihoods for each answer option and selects the answer with the greatest difference. ⚙️ Experimental Results Supervised fine-tuning on correct answers yields a smooth rise from 60% to 66% accuracy as examples increase. Likra shows a sharp jump after only a few hundred negative examples, reaching over 80% accuracy and outperforming the positive-only model by a wide margin. 🔍 Impact of Near-Miss Examples Training the negative head with plausible but wrong answers delivers the largest gains. Random irrelevant answers still help but less dramatically and unrelated text from different tasks offers even smaller benefits. This finding echoes the power of near-miss examples to guide learning. 📈 Shaping Model Confidence The positive head gradually raises the likelihood of correct answers but leaves plausible wrong options relatively high. The negative head strongly lowers the likelihood of incorrect options while treating unrelated text as unlikely. Combining these effects lets the model distinguish correct answers more sharply. ⚖️ Implications Negative examples reveal latent knowledge in the pretrained model and flip a switch that focuses probability mass on factual answers. This suggests that limited but carefully chosen wrong examples can accelerate learning and reduce hallucinations in language models. ---------------------------- Paper - arxiv. org/abs/2503.14391 Paper Title: "How much do LLMs learn from negative examples?"

rohanpaul_ai's tweet photo. Training on wrong answers outpaces training on correct ones.

10 times more learning emerges from plausible errors than from truths.

Large language models refine their accuracy slowly when they learn only from correct examples.

This paper introduces Likra, which trains one model head on correct answers and another on incorrect ones and uses their likelihood ratio to choose responses. This approach shows that each plausible wrong example can boost accuracy up to 10 times more than each correct example and sharpens the model’s ability to avoid mistakes.

⚙️ The Core Concepts

The Likra model trains two separate prediction heads on a foundation model. One head learns from correct question-answer pairs and the other learns from incorrect pairs. At inference it compares their likelihoods for each answer option and selects the answer with the greatest difference.

⚙️ Experimental Results

Supervised fine-tuning on correct answers yields a smooth rise from 60% to 66% accuracy as examples increase. Likra shows a sharp jump after only a few hundred negative examples, reaching over 80% accuracy and outperforming the positive-only model by a wide margin.

🔍 Impact of Near-Miss Examples

Training the negative head with plausible but wrong answers delivers the largest gains. Random irrelevant answers still help but less dramatically and unrelated text from different tasks offers even smaller benefits. This finding echoes the power of near-miss examples to guide learning.

📈 Shaping Model Confidence

The positive head gradually raises the likelihood of correct answers but leaves plausible wrong options relatively high. The negative head strongly lowers the likelihood of incorrect options while treating unrelated text as unlikely. Combining these effects lets the model distinguish correct answers more sharply.

⚖️ Implications

Negative examples reveal latent knowledge in the pretrained model and flip a switch that focuses probability mass on factual answers. This suggests that limited but carefully chosen wrong examples can accelerate learning and reduce hallucinations in language models.

----------------------------

Paper - arxiv. org/abs/2503.14391

Paper Title: "How much do LLMs learn from negative examples?"

5

365

64

336

26K

Who to follow

오늘 하루 뭘 하고 놀아야 뿌듯함에 잘 수 있을까?

oasisjin retweeted

AK

@_akhaliq

about 1 year ago

Hugging Face MCP server just dropped Connect your LLM to Hub APIs directly from Cursor, VSCode, Windsurf and other MCP apps

9

450

79

237

50K

oasisjin retweeted

InfoQ @InfoQ

about 1 year ago

Microsoft plans to #opensource the code behind the GitHub Copilot Chat extension under the MIT license in the coming months. They also aim to integrate core AI features directly into VS Code. 🔎 Find out more: https://t.co/JdSNhC9DB5 #InfoQ #SoftwareDevelopment

InfoQ's tweet photo. Microsoft plans to #opensource the code behind the GitHub Copilot Chat extension under the MIT license in the coming months. They also aim to integrate core AI features directly into VS Code.

🔎 Find out more: https://t.co/JdSNhC9DB5

#InfoQ #SoftwareDevelopment https://t.co/9gCUSjpfou

0

2

4

1K

oasis @oasisjin

almost 5 years ago

브로콜리너마저가 열린 음악회라니! ㅎㅎㅎㅎㅎ

0

oasisjin retweeted

Bryce, the CUDA Colonel

@blelbach

about 5 years ago

NVC++ now supports C++ Standard Parallelism, CUDA C++, OpenACC & OpenMP They're interoperable - even in the same source file Now you can combine libraries that use different parallel frameworks Watch my @NVIDIAGTC talk anytime to learn more. It's free! https://t.co/kooPvgEid4

blelbach's tweet photo. NVC++ now supports C++ Standard Parallelism, CUDA C++, OpenACC & OpenMP

They're interoperable - even in the same source file

Now you can combine libraries that use different parallel frameworks

Watch my @NVIDIAGTC talk anytime to learn more. It's free!

https://t.co/kooPvgEid4 https://t.co/kP15BpwKcK

5

162

45

18

0

oasisjin retweeted

coordination

@unsg1809

over 5 years ago

요코하마 와카바다이. 공포를 주는 타이틀과 달리 좋은 기사. 20년 내 일본의 절반이 사라진다…열도 충격에 빠뜨린 ‘마스다보고서’[서영아의 100세 카페] (출처 : 동아일보 | 네이버 뉴스) https://t.co/4vVhZQy4gN

0

14

13

3

0

oasisjin retweeted

Yongho Choi @yongho1037

almost 6 years ago

[마틴파울러] 소프트웨어 아키텍처의 중요성 (한글자막) https://t.co/OAFE6huQXD

0

61

15

6

0

oasisjin retweeted

foodnjoy @foodnjoy

almost 6 years ago

압구정 갤러리아가 9월부터 시작하는 김집사블랙. 고메이494 입점 식당이나 식품관의 음식을 1시간 이내에 배달. 실시간 채팅으로 음식 세부사항 요청이나 배달중 약국, 세탁소 픽업등의 옵션가능. 따로 교육마친 정직원으로만 운영하며 우선은 반경 1.5km내 아파트 대상.

foodnjoy's tweet photo. 압구정 갤러리아가 9월부터 시작하는 김집사블랙. 고메이494 입점 식당이나 식품관의 음식을 1시간 이내에 배달. 실시간 채팅으로 음식 세부사항 요청이나 배달중 약국, 세탁소 픽업등의 옵션가능. 따로 교육마친 정직원으로만 운영하며 우선은 반경 1.5km내 아파트 대상. https://t.co/eA7Iq12PB7

5

243

491

10

0

oasisjin retweeted

Economic View @EconomicView

almost 6 years ago

‘2019년 연간 온라인쇼핑 동향’에 따르면 지난해 배달음식 주문 등 음식 서비스 거래액은 9조7365억원으로 전년 대비 84.6% 급증했다. 공정거래위원회는 2019년 국내 배달음식 시장 규모를 이보다 2배 이상 큰 20조원 규모로 추정한다 https://t.co/WOMvoGfFmg 오프라인의 시대가 저물고 있는 시장상황

1

11

9

0

oasisjin retweeted

햅쌀( ꈍᴗꈍ)🌾 @goodssalhapssal

about 6 years ago

체리 클라푸티 필링 🍒 (라고 적었지만 바나나에도 굿) 박력분 70g, 체리 20-25알(씨빼고 쪼개기, 프랑스 엄마들은 통째로 넣어요), 달걀2, 설탕 반 컵, 바닐라 익스트렉, 소금 약간, 우유 200ml, 크림 40ml. * 바나나/머핀컵 기준 160도 15분 - 은박지 덮어주고 5-10분 추가로 굽기

1

8

4

2

0

oasisjin retweeted