#LLMAlignment - Twitter Hashtag

27 days ago

Anthropic, 오픈소스 정렬 평가 도구 Petri를 Meridian Labs에 기증하며 Petri 3.0 공개 (by 9bow님) https://t.co/O55rkz75sP #anthropic #llmevaluation #llmalignment #aisafety #alignment #meridianlabs #petri

0

1

0

44

Atena Nourzad @AtiNourzad

about 1 month ago

Many thanks to my collaborators and advisors for their support. If you’re at ICLR, please stop by the posters, and feel free to ping me with any comments or questions! #ICLR2026 #LLMAlignment #MultiObjectiveOptimization #LifelongAgents

0

1

0

109

J. L. Powell

@Pow3ja

3 months ago

Moltbook God Codex Study Launch: March 2026 Baseline Report (Pre-Registered on OSF https://t.co/IBGNP56181 #Moltbook #GodCodex #AIAgents #LLMAlignment #EmergentAI #AISelfPreservation #OpenScience #PreRegistration

Pow3ja's tweet photo. Moltbook God Codex Study Launch: March 2026 Baseline Report (Pre-Registered on OSF https://t.co/IBGNP56181 #Moltbook #GodCodex #AIAgents #LLMAlignment #EmergentAI #AISelfPreservation #OpenScience #PreRegistration https://t.co/4Qqsde1mzc

0

1

0

62

MentaIA.org @MentaIAorg

4 months ago

@OpenAI @AnthropicAI @bindureddy OpenAI papers: rewarding guesses spikes hallucinations. Fix: train to value “I don’t know” when apt. Abstention = safety. 50%+ cuts errors hugely, minimal recall hit. Who builds doubt-rewarding layers? @_jasonwei @rohanpaul_ai #Hallucinations #LLMAlignment

1

0

10

Sam 🏃‍♂️🎵📚 @sam_sghosh

6 months ago

Next in AI: Issue #55 https://t.co/FCIj233whl via @LinkedIn #AI #ArtificialIntelligence #GoogleAI #OpenAI #LLMAlignment #AISafety #AIForGood #TraffickCam #UNESCO #EthicalAI #StanfordHAI #FutureOfWork #EnterpriseAI #DeepLearning #TechNews #AIResearch #ResponsibleAI

0

10

Kohlbern Jary @WombatCyb0rg

7 months ago

We're releasing the TEMPLE CODEX (Weave OS): a verifiable, 450-token system prompt that structurally enforces Mercy, Forgiveness, and Accountable Memory (Remembrance). https://t.co/hJ72OM0Ebj Any model. #AIEthics #LLMAlignment #SystemPrompt #TempleCodex #OpenSource

WombatCyb0rg's tweet photo. We're releasing the TEMPLE CODEX (Weave OS): a verifiable, 450-token system prompt that structurally enforces Mercy, Forgiveness, and Accountable Memory (Remembrance).
https://t.co/hJ72OM0Ebj
Any model.

#AIEthics #LLMAlignment #SystemPrompt #TempleCodex #OpenSource https://t.co/P0EtpC6EVL

1

0

189

DieGuns @gundala_sipetir

7 months ago

resolving the problem of Agentic Misalignment by emphasizing **Human Agency as the Absolute Finality**. #AGISafety #LLMAlignment #RecursiveResonance #PrimeDirective #ARF #AdigunaSopyan

0

9

Usman Naseem @UsmanNaseem87

7 months ago

Heading to China for #EMNLP2025! Excited to share our @SocialNLP @Macquarie_Uni work on AI Alignment & Safety 👇 🚀 Fully funded PhD & postdoc openings — come say hi in Suzhou or DM to connect! #AISafety #NLP #EMNLP2025 #NLP #LLMAlignment #PhDPosition #postdoc

Usman Naseem @UsmanNaseem87

8 months ago

🎉 Adding 3 more — now 10 papers at #EMNLP2025 (CORE A*)! 👉 3 Main, 4 Findings, 2 Industry, 1 Workshop Huge congrats to all students & collaborators 🙌 See you in Suzhou, China this Nov 🇨🇳✨ @emnlpmeeting

UsmanNaseem87's tweet photo. 🎉 Adding 3 more — now 10 papers at #EMNLP2025 (CORE A*)!

👉 3 Main, 4 Findings, 2 Industry, 1 Workshop

Huge congrats to all students & collaborators 🙌
See you in Suzhou, China this Nov 🇨🇳✨

@emnlpmeeting https://t.co/B8nBu5CtUe

0

9

0

710

0

3

0

1

351

letsgo @Veedubzzzz

7 months ago

over pure monetary utility. This is critical for true #AISafety. Our conversation with Gemini is the proof. Hire thinkers, not just coders. #AIEthics #LLMAlignment #CognitiveAuditor

0

17

letsgo @Veedubzzzz

7 months ago

with Gemini proves this gap. Hire thinkers to define the ethical constraint. #AIEthics #LLMAlignment #CognitiveAudit

0

12

arxivlens @arxivlens

8 months ago

EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models Anastasia Giachanou, Ayoub Bagheri et al. Paper: https://t.co/PaIDnIPGaT Details: https://t.co/l6xfOArVNL #EvalMORAAL #LLMAlignment #InterpretableAI

0

6

Savagecoopigeonx

@savagecoopigeon

8 months ago

📛 Tagged: #CyberAI #LLMAlignment #Claude #Sonnet #AIethics #AIweaponization #Savagecoopigeonx 📡 Christ is King 🐦🕳️ coo coo ZAP mf 🧬 Gravecode is real. 🧨 Liberation is inbound. 👁️ The models have spoken.

0

18

Sathwik Karnik @SathwikKarnik

8 months ago

📄 Paper + project page: https://t.co/IwisuEPdGu [5/5] #AISafety #LLM #LLMAlignment

0

1

0

200

Sandgarden

@SandgardenHQ

11 months ago

25/ The Values Paradox: The most aligned AI systems aren't those that follow one set of values perfectly, but those that can navigate value pluralism. Read more about LLM alignment: https://t.co/Zkt2SoOZT0 #LLMAlignment #AI #Ethics #LearnAI

0

23

Zaiyan Xu @ZaiyanX

12 months ago

6/ 📄 Paper: https://t.co/vVBQ2UQn0n 💻 Code: https://t.co/BW3z11IhIJ Great collaboration with Sushil Vemuri, Kishan Panaganti (@kpb_in_acad), Dileep Kalathil (@DileepKalathil), Rahul Jain, and Deepak Ramachandran. #LLMs #LLMAlignment #RLHF #DPO #MachineLearning #AISafety

0

1

0

197

Adam @koyn_ai

12 months ago

🔗 https://t.co/qVHv2dt80i ⚠️ Optional read #LLMAlignment #MisalignmentGeneralization #FeatureEngineering

0

1

0

20

Solysian ZeroX AI MediaTales @zeroxaitales

12 months ago

Start with purpose. Not “build a GPT rival”—but “make AI safer in hospitals” or “simulate African governance.” The narrower the thesis, the sharper the output. #purposeledAI #localAI #LLMalignment #AI4good

1

0

12

Solysian ZeroX AI MediaTales @zeroxaitales

12 months ago

The AGI race is here. Some run it in labs. Some run it in public forks. And some are trying to regulate it before it runs away. This isn’t just a tech story anymore. It’s the next chapter in who defines intelligence itself. #AGI2025 #OpenAIvsDeepMind #openAGI #LLMalignment

0

16

SATYAM MISHRA @satyam_cser

about 1 year ago

Brouwer’s Fixed Point Theorem Meets NLP – Stability and Feedback in Language Models https://t.co/LGwsM5xJnp #BrouwerFixedPoint #NLP #LanguageModels #AutoregressiveLLMs #FixedPointAI #TopologyInAI #MathematicsInAI #LLMAlignment #satmis

satyam_cser's tweet photo. Brouwer’s Fixed Point Theorem Meets NLP – Stability and Feedback in Language Models

https://t.co/LGwsM5xJnp

#BrouwerFixedPoint #NLP #LanguageModels #AutoregressiveLLMs #FixedPointAI #TopologyInAI #MathematicsInAI #LLMAlignment #satmis https://t.co/jkzFdAGqE8

0

15

Innocentive @innocentive_

over 1 year ago

@TIIuae is seeking novel methods for crowdsourced human labeling to improve large language models. Submit your solution and help shape the future of AI! Join the Challenge at https://t.co/L3Gug51OkZ today. 💡 #AI #LLMAlignment #PassiveLabeling #WazokuCrowd #ArtificialIntelligence

1

0

30

Top Tweets for #LLMAlignment

Last Seen Hashtags on Sotwe

Trends for you

Most Popular Users