Today, we're launching Village to let investors orchestrate teams of agents to scale their judgment.
Three years ago, we bet that both humans and agents would need new tools to fulfill the promise of LLMs to transform research. Village is that tool, the first IDE for research built from the ground up so your agents can answer bigger questions, learn from your feedback, and build the AI research infrastructure for your firm.
We're starting with public equity research, enabled by our EDGAR and earnings call transcript datasets, and we'd love to hear what use-cases you'd like to see next.
The feeling of manipulating huge amounts of information and collaborating with hundreds of Assistants is really special, and I'm excited for you to experience it. Our beta sign up link is below!
making our @conviction embed submission public to share some long-overdue writing on:
- why vertical AI products haven’t hit widespread adoption among investors
- our differentiated product vision
- why we’ll win
link below. lmk if you love it or hate it
@random_walker@simonw That's incorrect. There's a 32k tokens/min limit as well as a hard cap on the number of output tokens. See https://t.co/Z7XKldI0Ie
@jerryjliu0@disiok Great guide on fine-tuning! The evals could use some work tho:
- BGE works best with instructions, but these aren't prepended.
- Half the texts exceed BGE's 512 token limit and are silently truncated.
After adding both, vanilla bge-small-en's hit rate is 0.87, on par with Ada.
We got 🥈 at the @AnthropicAI hackathon by impersonating our favorite anons and judges -
but we're really excited about the idea of using LLMs to Simulate Everything! So, we created Twitter (we can legally call it that now @elonmusk ).
1/n
(❤️ @kvvnhu @taehyoungjo @thetejmahal)
gpt-3.5-turbo is criminally underrated at coding
When using it with Azure's completion endpoint instead of OpenAI's chat endpoint, you can get a jump in HumanEval performance from <50% to 74%!
This blows claude v1.3 out of the water, which sits just below 60% perf. [1]
"—Drastically increases the risk of flameout when the private markets correct." You got that from a Bill Gurley interview, right? Do you have any thoughts of your own or were you just gonna plagiarize his whole argument about startup overvaluation and the coming correction?
Of course that's your contention. You just launched your first company after reading The Lean Startup—you're convinced that constant iteration and pivoting based on customer feedback is the only way to build an empire.