@rakyll Ya I meant why don’t we do all the logic the docker/proxy is doing behind the API so migration is simple. This feels complicated enough to just use a different model
OpenAIs new embedding seems to really prefer earlier text. Cos(emb(t1), emb(t1+t2)) > Cos(emb(t1), emb(t2+t1)) ~97% of the time of the 1000 examples I tried. Interesting retrieval consideration