@tobi Who knew early singularity could be this fun? :)
I just confirmed that the improvements autoresearch found over the last 2 days of (~650) experiments on depth 12 model transfer well to depth 24 so nanochat is about to get a new leaderboard entry for “time to GPT-2” too. Works 🤷♂️
GPT-5.4 set a new record on FrontierMath, our benchmark of extremely challenging math problems! We had pre-release access to evaluate the model. On Tiers 1–3, GPT-5.4 Pro scored 50%. On Tier 4 it scored 38%.
See thread for commentary and additional experiments.
After 7 years of building @Crossing_Minds , our team is joining @OpenAI.
We’ve poured everything into building better retrieval, personalization, and real-time AI.
Now we get to bring that work to a mission we deeply believe in.
Let’s build what’s next — together.
🧠⚡️
#AGI
Do you clerb? ICLERB!
Introducing the In-Context Learning Embedding and Reranker Benchmark (#ICLERB)! It evaluates retrieval models for #LLM in-context learning, based on downstream task performance, not text similarity.
📖: https://t.co/ay2gkWe1Ix
🏆: https://t.co/o6u7t69nCR
ICLERB: In-Context Learning Embedding and Reranker Benchmark
Introduces an evaluation framework for assessing retrieval models based on their ability to enhance LLM accuracy in in-context learning tasks.
📝https://t.co/bH8w6JVLNj
Your mood is a reflection of your daily habits
Your relationships are a reflection of your daily habits
Your mindset is a reflection of your daily habits
Your health is a reflection of your daily habits
Your future will be a reflection of your daily habits
RAG Does Not Work for Enterprises
Explores the challenges and requirements for implementing RAG in enterprises proposing potential solutions like semantic search and hybrid queries, and an evaluation framework to validate enterprise-grade RAG solutions
📝https://t.co/mLEspZEYPE
My brain automatically disconnect the moment I read “delve” on anybody’s post
Not sure if it is the good reflex since the form doesn’t always match the content, but it’s an interesting allergic reaction reflecting surely some Chat GPT abuse from all of us sharing “original” thoughts on social media …
Introducing Gemma - a family of lightweight, state-of-the-art open models for their class, built from the same research & technology used to create the Gemini models.
Blog post:
https://t.co/JEhKUsLzXI
Tech report:
https://t.co/MCzyojmDV4
This thread explores some of the performance characteristics of these models.
I'm very proud to announce that our paper on using pre-trained image-to-text models for eCommerce has been accepted to the ISIR-eCom Workshop at #WSDM2024! Congrats to @jasonjytang@marieiag@Crossing_Minds! Preprint: https://t.co/w2S54oTuw6 Workshop: https://t.co/uhxJrTW13z
Seeing some qs on what Gemini *is* (beyond the zodiac :). Best way to understand Gemini’s underlying amazing capabilities is to see them in action, take a look ⬇️
To all developers and customers building on @OpenAI, we are here for you. ❤️
Our commitment to serve you remains unwavering, and we are continuing to prioritize stability and security of our systems.
Always have been a huge believer in small incremental adjustments to be the foundations of great changes
One habit at a time
“How to Create a Good Habit The 1st law (Cue): Make it obvious. The 2nd law (Craving): Make it attractive. The 3rd law (Response): Make it easy. The 4th law (Reward): Make it satisfying”.
Atomic Habits by @JamesClear
🚀 Exciting News from the @Crossing_Minds Team 🚀
Last week marked a significant milestone for our team at Crossing Minds. I am thrilled to announce that after months of relentless dedication and rigorous evaluation, we have received the "Shopify Plus App Certification."
Achieving this certification is not just a badge of honor for us; it signifies the validation of our commitment to offering unparalleled recommendation solutions for e-commerce platforms.
At Crossing Minds, our guiding principle has always been to personalize the entire web. We believe every customer is unique, and their shopping experience should mirror this individuality.
The @ShopifyPlus Certification stands as a testament to the compatibility, security, and performance of our app on one of the world's leading e-commerce platforms.
A massive thank you to our team for their ceaseless passion and the countless hours they've put into making this possible.
I also would like to thank directly people at Shopify who have been more than supportive, thank you to @JordanaFuller, Brian Peters and Acca Yeung and the @ShopifyEng and partner team.
And to our partners and clients, thank you for your trust and collaboration. Together, we'll continue to make online shopping experiences more intuitive, engaging, and, most importantly, personal.
#CrossingMindsUpdate #ShopifyPlusCertified #Ecommerce #PersonalizationMatters #DigitalTransformation #EcommerceEvolution #RetailTech #TechMilestone #BusinessGrowth