You can now enable Claude to use your computer to complete tasks.
It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk.
Research preview in Claude Cowork and Claude Code, macOS only.
Thanks to 3D Gaussian splatting, you can now publish amazing interactive experiences to the web.
🎦 Orbit, fly and first person cameras
ℹ️ Informational hotspots
🪄 Full screen effects
🌤️ Custom skyboxes
Rendered by @PlayCanvas. Served up on SuperSplat.
Free and open source!
RAG was never the end goal.
Memory in AI agents is where everything is heading. Let me break down this evolution in the simplest way possible.
RAG (2020-2023):
- Retrieve info once, generate response
- No decision-making, just fetch and answer
- Problem: Often retrieves irrelevant context
Agentic RAG:
- Agent decides *if* retrieval is needed
- Agent picks *which* source to query
- Agent validates *if* results are useful
- Problem: Still read-only, can't learn from interactions
AI Memory:
- Read AND write to external knowledge
- Learns from past conversations
- Remembers user preferences, past context
- Enables true personalization
The mental model is simple:
↳ RAG: read-only, one-shot
↳ Agentic RAG: read-only via tool calls
↳ Agent Memory: read-write via tool calls
Here's what makes agent memory powerful:
The agent can now "remember" things - user preferences, past conversations, important dates. All stored and retrievable for future interactions.
This unlocks something bigger: continual learning.
Instead of being frozen at training time, agents can now accumulate knowledge from every interaction. They improve over time without retraining.
Memory is the bridge between static models and truly adaptive AI systems.
But it's not all smooth sailing.
Memory introduces new challenges RAG never had, like memory corruption, deciding what to forget, and managing multiple memory types (procedural, episodic, and semantic).
Solving these problems from scratch is hard. If you want to build Agents that never forget, Cognee is an open-source framework (12k+ stars) to build real-time knowledge graphs and get self-evolving AI memory.
Getting started with Cognee is as simple as this:
𝗮𝘄𝗮𝗶𝘁 𝗰𝗼𝗴𝗻𝗲𝗲[.]𝗮𝗱𝗱("𝗬𝗼𝘂𝗿 𝗱𝗮𝘁𝗮 𝗵𝗲𝗿𝗲")
𝗮𝘄𝗮𝗶𝘁 𝗰𝗼𝗴𝗻𝗲𝗲[.]𝗰𝗼𝗴𝗻𝗶𝗳𝘆()
𝗮𝘄𝗮𝗶𝘁 𝗰𝗼𝗴𝗻𝗲𝗲[.]𝗺𝗲𝗺𝗶𝗳𝘆()
𝗮𝘄𝗮𝗶𝘁 𝗰𝗼𝗴𝗻𝗲𝗲[.]𝘀𝗲𝗮𝗿𝗰𝗵("𝗬𝗼𝘂𝗿 𝗾𝘂𝗲𝗿𝘆 𝗵𝗲𝗿𝗲")
That’s it. Cognee handles the heavy lifting, and your agent gets a memory layer that actually learns over time.
I have shared the repo in the replies!
Today we're releasing two embedding model families, pplx-embed-v1 and pplx-embed-context-v1.
These SOTA embedding APIs are designed specifically for real-world, web-scale retrieval.
https://t.co/fUUasIGhYX
We’re excited to introduce Doc-to-LoRA and Text-to-LoRA, two related research exploring how to make LLM customization faster and more accessible.
https://t.co/ApVzVsBuv1
By training a Hypernetwork to generate LoRA adapters on the fly, these methods allow models to instantly internalize new information or adapt to new tasks.
Biological systems naturally rely on two key cognitive abilities: durable long-term memory to store facts, and rapid adaptation to handle new tasks given limited sensory cues. While modern LLMs are highly capable, they still lack this flexibility. Traditionally, adding long-term memory or adapting an LLM to a specific downstream task requires an expensive and time-consuming model update, such as fine-tuning or context distillation, or relies on memory-intensive long prompts.
To bypass these limitations, our work focuses on the concept of cost amortization. We pay the meta-training cost once to train a hypernetwork capable of producing tasks or document specific LoRAs on demand. This turns what used to be a heavy engineering pipeline into a single, inexpensive forward pass. Instead of performing per-task optimization, the hypernetwork meta-learns update rules to instantly modify an LLM given a new task description or a long document.
In our experiments, Text-to-LoRA successfully specializes models to unseen tasks using just a natural language description. Building on this, Doc-to-LoRA is able to internalize factual documents. On a needle-in-a-haystack task, Doc-to-LoRA achieves near-perfect accuracy on instances five times longer than the base model's context window. It can even generalize to transfer visual information from a vision-language model into a text-only LLM, allowing it to classify images purely through internalized weights.
Importantly, both methods run with sub-second latency, enabling rapid experimentation while avoiding the overhead of traditional model updates. This approach is a step towards lowering the technical barriers of model customization, allowing end-users to specialize foundation models via simple text inputs. We have released our code and papers for the community to explore.
Doc-to-LoRA
Paper: https://t.co/87xEEpf0GN
Code: https://t.co/zBfQi2L9LW
Text-to-LoRA
Paper: https://t.co/emLRZ4Vdvo
Code: https://t.co/b9mrdoWWRB