Code_G

Install in Paperclip: Settings → Adapters → Install Adapter paperclip-adapter-custom-llm-local Then configure: { "adapterType": "custom_llm_local", "adapterConfig": { "model": "your-model-id", "baseUrl": "http://127.0.0.1:8317/v1", "transport": "openai_chat_completions", "apiKeyEnv": "LOCAL_LLM_API_KEY" } } Repo: https://t.co/8xMzLpxdF4

Takip edebileceğin hesaplar

여운

@kskook

당신의 마음 속에 무언가를 남기고 싶은 사람. 나를 찾고 있음. 순간의 생각을 기록하고 있음. 자기생각을 가진 사람을 좋아함.

echian kim

@jinpoblues

서로를 배려하는 상식적인 세상을 꿈꾼다. 내것이 아닌것은 탐내지않으며 너무 많은것을 가지려고 욕심부리지 않는다. 어떠한 종류의 폭력과 억압,간섭을 거부하�� 히피정신에 적극 공감한다. 토착왜구들과 그에 동조하는 족속들은 제발 팔로 금지

yaklaşık 2 ay önce

I made a tiny Paperclip adapter for custom/local LLM endpoints. If you already have an OpenAI-compatible proxy, OpenRouter gateway, Qwen/GLM setup, local server, or anything speaking HTTP, your Paperclip agents can call it directly. Built for the boring but useful problem: fewer setup layers, more model freedom. #paperclip

Code_G

@code_g

yaklaşık 2 ay önce

4.19 기념식하는데 '국민의례'가 말이 되나. 저건 안바뀌네.

Code_G

@code_g

2 ay önce

예보 상으론 오늘 최고 기온은 어제보다 낮을 것 같더니 아침 햇살�� 더 쌔다. 오늘 엄청 더울듯.

Code_G

@code_g

2 ay önce

한국이랑 똑같네

El Programador Senior

@5eniorDeveloper

2 ay önce

Así es trabajar en bancos como programador: 1. Te contratan. 2. Te entregan el equipo. 3. Levantas tickets para acceso a repos, herramientas, jira, etc. 4. Pasa un mes y ya tienes los accesos. 5. Te faltó pedir un permiso y debes esperar otros 20 días. 6. Dos meses después ya puedes empezar a desarrollar. 7. Te asignan la fecha de release. 8. Terminas el desarrollo y pruebas. 9. Creas la documentación necesaria para solicitar el release. 10. Creas los tickets para solicitar el release. 11. Debes conseguir los approvals de los tickets al menos 3 días antes de la fecha de release. 12. Te autorizan el release. 13. Falla el release, tienes que hacer rollback. 14. Debes esperar hasta la siguiente fecha de release. 15. Inicias el desarrollo del nuevo sprint. 16. Vuelve al paso 8. Después les explico como se hacen los hotfix!

204

401

896K

Code_G

@code_g

2 ay önce

지능이 없는 것에 멍청하니 똑똑하니 하는 것도 어폐가 있다..

Code_G

@code_g

2 ay önce

애초에 그렇게 사용할 수 있도록 '설계'된 구독 플랜들 아니었나? 이제와서 죽는소리 내는 것은 그저 자신들의 자랑스런 'AI'가 얼마나 멍청한지 증명하는 것일뿐.

Poe Zhao

@poezhao0605

2 ay önce

One Claude Max subscriber generated $5,600 in API costs on a $100/month plan. A 56x subsidy ratio. Anthropic is now moving Enterprise to usage-based pricing, The Information reports. The flat-rate era for AI is ending. In China, the same collision is playing out at a fraction of the price:https://t.co/APNawmeZBW

$poezhao0605's tweet photo. One Claude Max subscriber generated $5,600 in API costs on a $100/month plan. A 56x subsidy ratio. Anthropic is now moving Enterprise to usage-based pricing, The Information reports. The flat-rate era for AI is ending. In China, the same collision is playing out at a fraction of the price:https://t.co/APNawmeZBW$

466

171

99K

116

Code_G

@code_g

2 ay önce

그러나 한 편으로는 나도 마케팅에 기반해 말해야 되지 않나 그런 생각을 한다. 떼 돈 벌고 싶다...

Code_G

@code_g

2 ay önce

샘 올트먼이나 다리오 아모데이가 말하는 건 기본적으로 마케팅이다. 그들의 말은 들리는대로 받아들여선 안된다. 심심이(LLM)들은 인류의 삶을 본질적으로 바꿀 수 없다. AGI든 초지능이든 그 무엇이 나온다해도 본질적으로 변하는 것은 아무것도 없다. 감히 산업혁명에 비할 데가 아니다. 그리고 그 산��혁명조차도 인류를 "더 일하게" 만들었다. 심심이들도 인간의 일을 늘렸으면 늘렸지 줄일 순 없다. 고용이 줄어드는건 순전히 자본의 농간일뿐. https://t.co/utC8aht2tH

133

Code_G

@code_g

2 ay önce

LLM Provider들은 자기들 현금을 위해 어떻게 하면 제공 모델에 제한을 걸까 고민할 게 아니라, 이런 사태를 막을 가드레일부터 탑재해야한다.

GeekNews

@GeekNewsHada

2 ay önce

바이브 코딩으로 만든 환자 관리 앱의 보안 참사 - 의료기관 직원이 AI 코딩 에이전트로 환자 관리 시스템을 직접 제작하며, 환자 데이터가 인터넷에 암호화 없이 노출됨 - 진료 대화 녹음이 두 개의 AI 서비스로 ��송되어 자동 요약되었고, 모든 데이터에 읽기·쓰기 권… https://t.co/Gq9HAhEFfn

16K

Code_G

@code_g

2 ay önce

감기 때문에 힘든데 날은 덥고. 참 애매하다.

Code_G

@code_g

2 ay önce

I just got @code_g from the X Handle Marketplace! Get your own at https://t.co/982S52Nbl4, but fuck off Elon, I hate you. You ruined twitter.

Code_G

@code_g

2 ay önce

와! 내 원래 계정명 슈��했다! 15년만에 되찾은 원래 이름.

Code_G

@code_g

2 ay önce

와... 파딱 이거 사악한 기능 많네. 내 원래 계정 슈킹해올 수 있을지도 모르겠음.

Code_G

@code_g

2 ay önce

내가 만드는게 그리드맨이었다는 생각을 하니 기분이 좋아짐. https://t.co/3xy4bgnugI

Code_G

@code_g

2 ay önce

내가 LLM을 AI라고 부르는 걸 꺼리는 이유다. 심심이라니까.

How To AI

@HowToAI_

2 ay önce

🚨 Stanford just published the most uncomfortable AI paper of the year. They just dropped a systematic teardown of how large language models actually "think." It proves that passing a benchmark has almost nothing to do with real reasoning. We have spent years optimizing for tests. But the researchers found that performance does not transfer nearly as well as the leaderboards imply. A model that looks incredibly strong on a math benchmark will quietly fall apart when asked to do scientific reasoning, planning, or multi-step decision-making. They call these "application-specific failures." The AI didn't learn how to think. It learned how to pass the test it was trained on. The paper outlines the paths forward: inference-time scaling, analogical memory, and external verification. But they are blunt. There are no silver bullets yet. We need to stop evaluating models based on how often they succeed on static tests, and start injecting known failure cases to see when they break. Because right now, we are building an entire industry on an illusion. We are deploying systems that pass benchmarks, but fail reality.

HowToAI_'s tweet photo. 🚨 Stanford just published the most uncomfortable AI paper of the year.

They just dropped a systematic teardown of how large language models actually "think."

It proves that passing a benchmark has almost nothing to do with real reasoning.

We have spent years optimizing for tests.

But the researchers found that performance does not transfer nearly as well as the leaderboards imply.

A model that looks incredibly strong on a math benchmark will quietly fall apart when asked to do scientific reasoning, planning, or multi-step decision-making.

They call these "application-specific failures."

The AI didn't learn how to think. It learned how to pass the test it was trained on.

The paper outlines the paths forward: inference-time scaling, analogical memory, and external verification.

But they are blunt. There are no silver bullets yet.

We need to stop evaluating models based on how often they succeed on static tests, and start injecting known failure cases to see when they break.

Because right now, we are building an entire industry on an illusion.

We are deploying systems that pass benchmarks, but fail reality.

574

166

431

40K

112

code_g retweetledi

Charlie Hills

@charliejhills

2 ay önce

Someone finally put numbers to what developers have been feeling for weeks. The answer: 67%. An AMD Director of AI analyzed 6,852 Claude Code sessions spanning nearly three months. Not vibes. Not Reddit complaints. Hard session data. Here’s what the logs show: → Thinking depth collapsed 67% by late February before Anthropic hid the reasoning process from users. → Code reads per edit dropped from 6.6 to 2.0 Claude stopped researching before touching your files. → A stop-hook script catching lazy behavior fired 173 times after March 8. It fired zero times before. → API costs ballooned 80x because shallow thinking caused constant wrong outputs, interruptions, and retries. Anthropic said nothing until the numbers went public. Then Boris Cherny showed up on the GitHub issue and pointed to a default “thinking effort” setting, quietly lowered to “medium” on March 3 described internally as the sweet spot between intelligence, latency, and cost. The team tried every effort-flag combination. Still broken. AMD has since switched to a competing provider. Users are calling it AI shrinkflation same price, meaningfully less reasoning. Fourteen product releases shipped alongside five outages in March alone. The parting line from the issue says it all: “Six months ago, Claude was in a league of its own. Anthropic is no longer the sole player in the capability tier Opus previously occupied.”

charliejhills's tweet photo. Someone finally put numbers to what developers have been feeling for weeks. The answer: 67%.

An AMD Director of AI analyzed 6,852 Claude Code sessions spanning nearly three months. Not vibes. Not Reddit complaints. Hard session data.

Here’s what the logs show:

→ Thinking depth collapsed 67% by late February before Anthropic hid the reasoning process from users.
→ Code reads per edit dropped from 6.6 to 2.0 Claude stopped researching before touching your files.
→ A stop-hook script catching lazy behavior fired 173 times after March 8. It fired zero times before.
→ API costs ballooned 80x because shallow thinking caused constant wrong outputs, interruptions, and retries.

Anthropic said nothing until the numbers went public.

Then Boris Cherny showed up on the GitHub issue and pointed to a default “thinking effort” setting, quietly lowered to “medium” on March 3 described internally as the sweet spot between intelligence, latency, and cost.

The team tried every effort-flag combination. Still broken.

AMD has since switched to a competing provider.

Users are calling it AI shrinkflation same price, meaningfully less reasoning. Fourteen product releases shipped alongside five outages in March alone.

The parting line from the issue says it all:

“Six months ago, Claude was in a league of its own. Anthropic is no longer the sole player in the capability tier Opus previously occupied.”

784

113

252

57K

Code_G

@code_g

Takip edebileceğin hesaplar

Sotwe'de En Son Ziyaret Edilenler

Senin İçin Trendler

En Popüler Kullanıcılar