We’ll never build #AI that truly learns until we break out of the old software dev life cycle + realize that #ML models demand a brand new process that understands the highly iterative nature of #DataScience.
We call it the Machine Learning Loop:
https://t.co/ILXngBabU3
Why?
Because the web is a mess. And websites are fundamentally built for human consumption.
If you want to automate everything that a human can do with a browser, then you have to perceive like a human.
Stay tuned for more in the coming weeks!
Wonderfully written by @0xjasper in collaboration with the Yutori team.
https://t.co/fwWYvPNuU2
@tuhinone It's so exciting to see custom models built for real world workloads. Cursor has the data, and it's amazing to see it put to use in a product we use everyday.
Today, we’re excited to announce our $150M Series D, led by BOND, with Jay Simons joining our Board. We’re also thrilled to welcome Conviction and CapitalG to the round, alongside support from 01 Advisors, IVP, Spark Capital, Greylock Partners, Scribble Ventures, and Premji Invest.
The last eighteen months have been a whirlwind; as the AI application layer has taken off, we've been proud to play a small part supporting world class companies run their production workloads. Thanks to all our customers including Abridge, Bland, Clay, Gamma, Mirage, OpenEvidence, Sourcegraph, WRITER, and Zed Industries.
We’re just getting started. If you’re building the next generation of AI products, we’d love to work with you.
i wrote #beginner level book teaching #deeplearning
its goal is to be the easiest intro possible
each lesson builds a neural component *from scratch* in #NumPy
each *from scratch* toy code example is in the #Github below #100DaysOfMLCode
https://t.co/3KEfLNjvj5
OpenAI launched GPT-4 Turbo today. It’s incredible:
- 128k token context window. 300 pages of a book 🚀
- More control. GPT-4 will respond with valid JSON, and can call multiple functions
- Better knowledge. THE API NOW COMES WITH RETRIEVAL built in. And knowledge cutoff is now April 2023.
- Multi-modal: DALLE, text to speech, and vision are in the API.
#Adala has sparked my curiosity as a great application for LLM agents! 🚀 Human focus on gold standard data curation, agents adapt to label this data automatically.
Check it out: https://t.co/YW3uLP8Y1V
#LLM#DataLabeling#MachineLearning
I collaborated with the @LabelStudioHQ folks to do a quick and simple example for fine-tuning LLM applications - in example, creating a tunable chat bot for your GitHub repo. Check out the blog and the code!
https://t.co/QkW7IHl2g5
No More GIL!
the Python team has officially accepted the proposal.
Congrats @colesbury on his multi-year brilliant effort to remove the GIL, and a heartfelt thanks to the Python Steering Council and Core team for a thoughtful plan to make this a reality.
https://t.co/58QK2yctRD
Walmart used an LLM (PaLM-2) generated embeddings for their DLRM to make product search better on https://t.co/dXWRG9d6Qa
They are about to push this out into their live website because the results were so good.
Noteworthy other firms have already done this such as Google
@AICoffeeBreak is so good at providing context around ML. I absolutely love this quick video on things to keep in mind when dealing with #LLMs such as #ChatGPT
https://t.co/Kr83CYSITP
For a novel (and potentially cost-effective) approach to data labeling, @JimmyMWhitaker proposes bootstrapping the process with the aid of GPT-4. https://t.co/4qVNk0huei
🚀 Had a blast collaborating with the @LabelStudioHQ team on data labeling for fine-tuning #LLM.
We're using the @carperai trlX library, and we've got an interactive Colab Notebook ready for you to fine-tune your own RL model.🔍👩💻
https://t.co/g7oeUH8cZi
In one week: Join us for our May Open Office Hours!
@JimmyMWhitaker, Chief Scientist at HPE and Pachyderm, will be there!
sign up here: https://t.co/pe4QLPomWN
#RLHF is essential to ensure #LLMs are safe and accurate, but you may have wondered how to incorporate it into your workflow.
@JimmyMWhitaker is here to help with practical advice on How to Create a High-Quality RLHF Dataset with Label Studio!
https://t.co/M9Ky6Huqrl