@iamreddave@JFPuget 👋When working on RAG I tend to think of the system as three different sub-systems, and work on evaluating each part in turn: 1) the pipeline into your RAG's knowledge base, 2) the 'retrieval' component (which is the searches) 3) the generation. I wrote: https://t.co/Nbk35Q0L4a
At @TryRevaAI we just published a deep evaluation of OpenAI and Anthropic for @intercom's customer support use case, to see the impact for their customers of their move to Claude https://t.co/1ufkkWa3E2
Fascinating to see how tech leaders are thinking about generative AI, without factoring in model quality. Surely how well the model can do on your task should be a primary concern? Still, this is a good read on judging vendors for common issues https://t.co/Glk5I4hizj
The Evolving Landscape of LLM Evaluation
Navigating Evaluation Pitfalls
Thoughts on memorization and overfitting in LLM evals, a shift to 'vibe-based' evals, and how the future of LLM evaluation may look like.
https://t.co/wbMIx41300
Ireland is seeking legislation to allow the use of facial recognition technology (FRT) for policing, and there currently exist little critical discussion/awareness on its dangers.
I'm hoping my piece, fresh out of press, contributes. Pls share widely
https://t.co/OPJYSuOaBV
If you're an early stage founder working on your first round of funding and are struggling with how to tell your story to people who could invest, then I wrote this for you.
https://t.co/Ko7GwNgkyP
#startups#fundraising
@NoChorus The Destruction of The Temple by Barry N Malzberg, which just happens to be the book I randomly picked to start reading two days ago from the massive pile I bought in Hay-on-Wye (it’s good so far)
@DetectiveKen Horus heresy is very much a “pick which ones you like based on the armies you like” series, and great fun. I’ll never read all 50. Recommend any ones covering Chaos marine armies, especially the thousand Sons