Jay van Zyl

@jayvanzyl

Interested in real-time predictions and experimentation

Palo Alto, CA

Joined May 2008

828 Following

1K Followers

3.2K Posts

Jay van Zyl @jayvanzyl

almost 3 years ago

Important factors to consider wrt cost of model training and serving: “SOTA models these days have about ~500B parameters and that represents at least ~1TB of GPU memory to operate with specialized infrastructure. That's a minimum of ~$60,000 - $100,000 p…https://t.co/7PyedGu7Lm

0

4

0

1

246

Jay van Zyl @jayvanzyl

almost 3 years ago

StableLM is trained on a new experimental dataset built on The Pile, but three times larger with 1.5 trillion tokens of content. The richness of this dataset gives StableLM surprisingly high performance in conversational and coding…https://t.co/4XYLtbg9OQ https://t.co/iDshAxrqad

0

0

0

0

99

Jay van Zyl @jayvanzyl

almost 3 years ago

The analogy between the syntax-semantics of natural languages and the sequence-function of proteins has revolutionized the way humans inves- tigate the language of life. https://t.co/szapPmr9yA

0

0

0

0

88

Jay van Zyl @jayvanzyl

almost 3 years ago

With YouTube creators becoming increasingly empowered by versatile generative AI tools, it will only amplify the rising trend of audiences consuming more user-generated content on TVs, conducive to more YouTube advertising revenue,…https://t.co/CsRvgLRs0p https://t.co/aJ3Pzc4Ewu

0

0

0

0

69

Who to follow

Verified account

Official account of WaFd Bank. Providing communities and businesses with simple, straightforward banking solutions. Member FDIC.

Verified account

Was born on the 4'th of July. For real. Interlocutor. “Happiness is an internal configuration, not an external curation”.

Jay van Zyl @jayvanzyl

almost 3 years ago

They say a good craftsman shouldn't blame his tools, but can a good tool [LLM] blame a shoddy craftsman? But Large language models specialize in generating human-like text. Correct answers are a bonus. https://t.co/ViqcLqPM9l

0

0

0

0

74

Jay van Zyl @jayvanzyl

almost 3 years ago

Brilliant rendition of the human brain https://t.co/or82SGFpi0

0

1

0

0

39

Jay van Zyl @jayvanzyl

almost 3 years ago

Another key concept to understand: Most of the AI-generated images currently produced rely on Diffusion Models as their foundation. https://t.co/rOUViLJRL2

0

0

0

0

39

Jay van Zyl @jayvanzyl

almost 3 years ago

Together with https://t.co/VckAzyW9l6 real-time behavioral capabilities, generative models add a much needed angle to AI for business usefulness. Here is a another outline in summary for those who need a quick reference: Generativ…https://t.co/s8WWxqxUp2 https://t.co/PBwsXmTJni

0

2

0

0

52

Jay van Zyl @jayvanzyl

almost 3 years ago

Effective intervention :) https://t.co/3xAT9PfmX1

0

0

0

0

30

Jay van Zyl @jayvanzyl

almost 3 years ago

Cape Town looks like a safe option while we're working on solving all of this :) https://t.co/fab0XN9vs4

0

0

0

0

32

Jay van Zyl @jayvanzyl

almost 3 years ago

Excellent share @dxbrob. "It is perhaps uncontroversial to say that this claim that one of us made eight years ago (Soman, 2015) is now accepted as universal truth. Governments, for-profit organizations, not for profits, startups, consumer protect…https://t.co/BUyZaqvXbr

0

0

0

0

38

Jay van Zyl @jayvanzyl

almost 3 years ago

FinGPT emphasizes the critical significance of data collecting, cleaning, and preprocessing in creating open-source FinLLMs using a data-centric approach. FinGPT seeks to advance financial research, cooperation, and innovation by p…https://t.co/xg1GQIXfRl https://t.co/8LxNu6S0bt

0

1

0

0

83

Jay van Zyl @jayvanzyl

almost 3 years ago

Extreme actions for good results. https://t.co/2pRxavUa4G

0

0

0

0

18

Jay van Zyl @jayvanzyl

almost 3 years ago

Great event yesterday in the city, San Francisco #mtpcon https://t.co/Dh5kYIoM1y

0

1

0

0

46

Jay van Zyl @jayvanzyl

about 3 years ago

Great paper on transformers: “Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on…https://t.co/xPeVzIS8Ag https://t.co/LDXgdXAznr

0

1

0

2

82

Jay van Zyl @jayvanzyl

about 3 years ago

How will the generative AI world be affected? https://t.co/HhAwyHh2J8

0

0

0

0

43

Jay van Zyl @jayvanzyl

about 3 years ago

Gorilla is a major addition to the list of language models, as it even addresses the issue of writing API calls. Its capabilities enable the reduction of problems related to hallucination and reliability. https://t.co/ux06hP4qUl

0

0

0

0

43

Jay van Zyl @jayvanzyl

about 3 years ago

Another great set of models. Why use Falcon-40B? 1. It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard. 2. It features an architecture optimized for inference, wit…https://t.co/kpucrfPnDR

0

0

0

0

107

Jay van Zyl @jayvanzyl

about 3 years ago

Should recommender system be held liable? https://t.co/3CcNxJheHp

0

0

0

0

23

Jay van Zyl @jayvanzyl

about 3 years ago

As the commoditization of LLM models continue, here's a list to review. https://t.co/ME7shGG0Ut

0

0

0

0

44

Last Seen Users on Sotwe

Trends for you

Most Popular Users