Bytez @bytez - Twitter Profile

Pinned Tweet

about 1 year ago

Run 100,000+ AI Models for Free Build AI faster with the largest inference API on the internet. Instantly demo and deploy thousands of models through one unified API. Serverless inference. No infra or DevOps required. Free for developers. Try now👇

Bytez's tweet photo. Run 100,000+ AI Models for Free

Build AI faster with the largest inference API on the internet. Instantly demo and deploy thousands of models through one unified API.

Serverless inference. No infra or DevOps required. Free for developers.

Try now👇

24

579

42

515

805K

Bytez @Bytez

2 months ago

How Bytez 0.1 works: Think of an exam. 1,000 students in the room — each one the top mind in their field. Lawyers, doctors, engineers, artists. Teacher asks a question. Everyone writes their answer. Our model gets to cheat. It sees all 1,000 answers. It figures out what kind of question was asked. Legal question? It pulls the top 3 legal minds' answers. Medical? Top 3 medical minds. Then it either combines their answers or picks the best one. Every other model gets 1-shot at the answer. Ours gets N-shots, from N experts. This is what we call a Web-Scale MoE. Each "expert" isn't a subnetwork inside a single model — it's an entirely separate model. As more experts show up on the web, our model gets smarter without retraining. The upside: it scores higher than Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 across benchmarks. The downside: it behaves like a bigger model and costs more to run. We think the tradeoff is worth it. 1,000 minds wired together are smarter than any single mind. Is the path to AGI training one massive mind — or wiring together every mind that comes into existence?

Bytez's tweet photo. How Bytez 0.1 works:

Think of an exam. 1,000 students in the room — each one the top mind in their field. Lawyers, doctors, engineers, artists.

Teacher asks a question. Everyone writes their answer.

Our model gets to cheat. It sees all 1,000 answers. It figures out what kind of question was asked. Legal question? It pulls the top 3 legal minds' answers. Medical? Top 3 medical minds.

Then it either combines their answers or picks the best one.

Every other model gets 1-shot at the answer. Ours gets N-shots, from N experts.

This is what we call a Web-Scale MoE. Each "expert" isn't a subnetwork inside a single model — it's an entirely separate model.

As more experts show up on the web, our model gets smarter without retraining.

The upside: it scores higher than Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 across benchmarks.
The downside: it behaves like a bigger model and costs more to run.

We think the tradeoff is worth it. 1,000 minds wired together are smarter than any single mind.

Is the path to AGI training one massive mind — or wiring together every mind that comes into existence?

1

3

0

1

147

Bytez @Bytez

2 months ago

Gemma 4 Models Now Available On Bytez Hey all, as promised, we keep up with the latest and greatest in open source and closed source machine learning. The following models are now available: google/gemma-4-E2B-it google/gemma-4-E4B-it google/gemma-4-26B-A4B-it google/gemma-4-31B-it Models support the ability to understand text, image, audio, and video as context. Hit them either via the Bytez.js client, or via our chat/completions endpoint! May thy vibe harvest be fruitful, and may thy cup overfloweth with success! https://t.co/aueAkgISZe

0

4

0

139

Bytez retweeted

NeurIPS Conference

@NeurIPSConf

3 months ago

The Position Paper Track is back at NeurIPS 2026 for the second year, with an expanded scope, and better alignment with the main and Evaluation and Dataset tracks! Head to the Call for Paper at https://t.co/AexnZLLfsx for all the important dates and information and read our accompanying blog post at https://t.co/02v3jUxv7J to learn more about the changes we are making this year and how we adapted the process based on the feedback we got from the community! The submission deadline is the same as for the main and ED track: May 6, 2026 AoE. We are looking forward to read your papers and any feedback you may have!

9

111

16

39

26K

Who to follow

Eduardo Castro

@ecastropro

Marketing & ₿itcoin | University lecturer Entrepreneur & content creator 🇸🇻 ⚡

Greg F. Barbieri

@GregFBarbieri

Amateur economist, Python hobbyist. Trying to master the basics at @MasonEconomics.

Sootblower

@Fishlips1971

Bytez @Bytez

3 months ago

Early access coming soon Benchmarks: https://t.co/u4IdmVa0VZ

Bytez @Bytez

3 months ago

Bytez 0.1 beats Opus, Gemini Pro, and GPT-5.4 across benchmarks. Achieved without spending millions on GPUs. Karpathy recently said LLM ensembles are "under-explored." He vibe-coded a weekend prototype to test the idea. We've been building the production version for 2 years. Bytez 0.1 is a Web-Scale MoE — instead of training one massive model, we fuse thousands of models into a single intelligence. Instead of training intelligence, we absorb it. More benchmarks dropping soon. What's a faster path to a v1 of AGI? A) One massive model that tries to be an expert on everything B) Thousands of experts fused into one

Bytez's tweet photo. Bytez 0.1 beats Opus, Gemini Pro, and GPT-5.4 across benchmarks.

Achieved without spending millions on GPUs.

Karpathy recently said LLM ensembles are "under-explored."

He vibe-coded a weekend prototype to test the idea.

We've been building the production version for 2 years. Bytez 0.1 is a Web-Scale MoE — instead of training one massive model, we fuse thousands of models into a single intelligence.

Instead of training intelligence, we absorb it.

More benchmarks dropping soon.

What's a faster path to a v1 of AGI?
A) One massive model that tries to be an expert on everything
B) Thousands of experts fused into one

3

12

4

352

0

2

0

44

Bytez @Bytez

3 months ago

Bytez 0.1 update: more evals ran, matching or beating Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 across benchmarks PS: the 0.1 model also does pro-level 3D generation What happens after models learn to generate 3D worlds?

2

4

0

96

Bytez @Bytez

3 months ago

@alejandrGZAX Appreciate it, DM us your email and we'll add you to the early access list — first group gets in soon

1

0

12

Bytez @Bytez

3 months ago

Bytez 0.1 beats Opus, Gemini Pro, and GPT-5.4 across benchmarks. Achieved without spending millions on GPUs. Karpathy recently said LLM ensembles are "under-explored." He vibe-coded a weekend prototype to test the idea. We've been building the production version for 2 years. Bytez 0.1 is a Web-Scale MoE — instead of training one massive model, we fuse thousands of models into a single intelligence. Instead of training intelligence, we absorb it. More benchmarks dropping soon. What's a faster path to a v1 of AGI? A) One massive model that tries to be an expert on everything B) Thousands of experts fused into one

3

12

4

352

Bytez retweeted

Julien Chaumond

@julien_c

3 months ago

Dataset Editing has landed for Parquet Datasets on the HF Hub ✍️

3

89

15

33

14K

Bytez retweeted

a16z @a16z

3 months ago

.@illscience says the future of AI isn’t one model to rule them all—and explains why platforms that integrate multiple models will benefit the most: "I think we're going to need and rely on all of the models." "It's sort of like if you have a team of people... if you have five people, they could all do a basic set of things pretty capably." "But then they all have their specializations. Maybe one of them is really good at closing a customer who doesn't want to sign the deal, and one of them is really good at culture and getting the best out of the team." "There are some areas in which they are going to build apps, and that will be a threat to app companies. But there are many areas in which app companies are advantaged. Cursor and Krea are great examples of this—products where you benefit from being multi-model." "When you actually use a creative tool, you don't want to just use Nano Banana, you want to have access to OpenAI, Nano Banana, Kling—all of them—Qwen, you name it. So using a single interface to access all the models is powerful." Anish Acharya on BILLIONS with @GuillaumeMbh

19

162

28

67

33K

Bytez retweeted

Palatial

@PalatialSim

4 months ago

Last week, we launched Palatial PhysReady and the response blew us away. Over 100 companies signed up for our waitlist and the team had a blast watching everyone tagging @PalatialSim with their creative prompts. We generated over 100 assets in 1 day and below are a few of the highlights. We're looking forward to giving everyone access to the platform and API, wave 2 goes live on Wednesday! Sign up at https://t.co/5fUD94wuge

5

22

5

6

2K

Bytez retweeted

Julien Chaumond

@julien_c

4 months ago

We don’t want to have to choose between 2 model providers We want to choose between 1,000s of model providers

23

258

30

11

28K

Bytez retweeted

Palatial

@PalatialSim

4 months ago

A child consumes more data in 1 month than any LLM has ever seen. Embodied agents learn by doing, but the data that teaches them is tactile, sensorial and causal. Such data does not exist. To make physical AGI possible, we need to generate this new data at an industrial scale. Enter Palatial: automated infrastructure that converts raw data into sensory rich playgrounds for robots to learn in. Today, we’re unveiling Palatial PhysReady, the first automated sim asset generator (try it ⬇️) [1/5]

56

276

38

201

58K

Bytez @Bytez

4 months ago

@PalatialSim @hollympeck @PalatialSim generate me a chocolate chip cookie 🍪

2

1

0

107

Bytez @Bytez

4 months ago

@PalatialSim @hollympeck 👀

1

0

63

Bytez retweeted

Director Michael Kratsios

@mkratsios47

4 months ago

The future of AI is agentic, and America is leading the way to make it secure and interoperable. A new AI Agent Standards Initiative is launching this week @NIST to drive industry-led standards and open protocols that build trust and advance innovation. https://t.co/bS5oqvU8iu

140

2K

329

883

153K

Bytez retweeted

OpenRouter

@OpenRouter

4 months ago

Benchmarks are now available on OpenRouter! See how models perform on industry standard tests, including programming, math, science, long context reasoning, and more to come.

OpenRouter's tweet photo. Benchmarks are now available on OpenRouter!

See how models perform on industry standard tests, including programming, math, science, long context reasoning, and more to come. https://t.co/vJhXaM0lu0

26

737

32

107

97K

Bytez retweeted

Artificial Analysis

@ArtificialAnlys

6 months ago

NVIDIA has just released Nemotron 3 Nano, a ~30B MoE model that scores 52 on the Artificial Analysis Intelligence Index with just ~3B active parameters Hybrid Mamba-Transformer architecture: Nemotron 3 Nano combines the hybrid Mamba-Transformer approach @NVIDIAAI has used on previous Nemotron models with a moderate-sparsity MoE architecture, enabling highly efficient inference, particularly at longer sequence lengths Small-model improvements: with 31.6B total and 3.6B active parameters, Nemotron 3 Nano scores 52 on our Intelligence Index, in line with OpenAI’s gpt-oss-20b (high). This represents a +6 point lead on the similarly-sized Qwen3 30B A3B 2507 and +15 improvement on NVIDIA’s previous Nemotron Nano 9B V2 (a dense model) High openness: Nemotron 3 Nano follows other recent NVIDIA models in open licensing and releases of data and methodology for the community to use and replicate - it scores an 67 on the Artificial Analysis Openness Index, in line with previous Nemotron Nano models Key model details: ➤ 1 million token context window, with text only support ➤ Supports reasoning and non-reasoning modes ➤ Released under the NVIDIA Open Model License; the model is freely available for commercial use or training of derivative models ➤ On launch, the model is being made available with a range of serverless inference providers including @baseten, @DeepInfra, @FireworksAI_HQ, @togethercompute and @friendliai, and it is available now on Hugging Face for local inference or self-deployment See below for our full analysis and key announcement links from NVIDIA 👇

ArtificialAnlys's tweet photo. NVIDIA has just released Nemotron 3 Nano, a ~30B MoE model that scores 52 on the Artificial Analysis Intelligence Index with just ~3B active parameters

Hybrid Mamba-Transformer architecture: Nemotron 3 Nano combines the hybrid Mamba-Transformer approach @NVIDIAAI has used on previous Nemotron models with a moderate-sparsity MoE architecture, enabling highly efficient inference, particularly at longer sequence lengths

Small-model improvements: with 31.6B total and 3.6B active parameters, Nemotron 3 Nano scores 52 on our Intelligence Index, in line with OpenAI’s gpt-oss-20b (high). This represents a +6 point lead on the similarly-sized Qwen3 30B A3B 2507 and +15 improvement on NVIDIA’s previous Nemotron Nano 9B V2 (a dense model)

High openness: Nemotron 3 Nano follows other recent NVIDIA models in open licensing and releases of data and methodology for the community to use and replicate - it scores an 67 on the Artificial Analysis Openness Index, in line with previous Nemotron Nano models

Key model details:
➤ 1 million token context window, with text only support

➤ Supports reasoning and non-reasoning modes

➤ Released under the NVIDIA Open Model License; the model is freely available for commercial use or training of derivative models

➤ On launch, the model is being made available with a range of serverless inference providers including @baseten, @DeepInfra, @FireworksAI_HQ, @togethercompute and @friendliai, and it is available now on Hugging Face for local inference or self-deployment

See below for our full analysis and key announcement links from NVIDIA 👇

9

285

49

65

111K

Bytez retweeted

OpenRouter

@OpenRouter

6 months ago

You can now see the most popular large-context models on the OpenRouter Rankings 👇

4

50

6

5

7K

Bytez retweeted

Michael Bronstein @mmbronstein

6 months ago

NeurIPS 2025 papers per 1 Million People 1. Singapore – 64.51 2. Switzerland – 22.13 3. Israel – 11.17 4. UAE – 9.47 5. UK – 7.50 6. US – 7.44 7. Denmark – 7.37 8. Australia – 7.31 9. Canada – 6.93 10. South Korea – 5.78

42

1K

110

301

145K

Bytez

@Bytez

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users