Our new Gemma 4 12B model hits a sweet spot between size + performance: it can run locally on a laptop, while enabling powerful multi-step reasoning and agentic workflows. Can’t wait to see what the community does with this one!
Gemma4 12B 를 맥 스튜디오 64기가에서 실행해보라고 시켰더니 완벽한 머신이라고 평가해줬네요. 예전에는 LocalLLM 실행하는것도 실행방법 찾고 안되면 문제 해결하고 이런게 귀찮아서 안했는데, 요즘은 어차피 Claude Code 같은게 알아서 다 하니까 실행에 대한 비용이 낮아져서 새로 나올때마다 부담없이 시도.
Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs.
Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.
You can run and train the model via Unsloth Studio.
GGUF: https://t.co/8cL321pVDh
Guide: https://t.co/odRo9WjRpA
Interesting results from M3
Designs are very Opus like, some things are not perfect as you can see. It is a reliable model (in terms of tool use), not lazy at all. But will keep testing it if actually stands out. The pricing is really nice if it actually works as shown on benchmarks.
You can’t outwork the whole world. There’s always going to be someone somewhere willing to work as hard as you. Someone just as hungry. Or hungrier.
Assuming you can work harder and longer than someone else is giving yourself too much credit for your effort and not enough for theirs. Putting in 1,001 hours to someone else’s 1,000 isn’t going to tip the scale in your favor.
What’s worse is when management holds up certain people as having a great “work ethic” because they’re always around, always available, always working. That’s a terrible example of a work ethic and a great example of someone who’s overworked.
A great work ethic isn’t about working whenever you’re called upon. It’s about doing what you say you’re going to do, putting in a fair day’s work, respecting the work, respecting the customer, respecting coworkers, not wasting time, not creating unnecessary work for other people, and not being a bottleneck. Work ethic is about being a fundamentally good person that others can count on and enjoy working with.
So how do people get ahead if it’s not about outworking everyone else?
People make it because they’re talented, they’re lucky, they’re in the right place at the right time, they know how to work with other people, they know how to sell an idea, they know what moves people, they can tell a story, they know which details matter and which don’t, they can see the big and small pictures in every situation, and they know how to do something with an opportunity. And for so many other reasons.
So get the outwork myth out of your head. Stop equating work ethic with excessive work hours. Neither is going to get you ahead or help you find calm.
[The Outwork Myth — It Doesn't Have To Be Crazy At Work, 2018]
고민중인것과 비슷한 내용. 아직은 내가 병목인 지점이 많고, 개인적인것들이나 단순한거는 코드도 보지 않고 hermes 와 해결하지만 회사 업무나 좀더 복잡한거는 내가 검증해야되는데 내가 병목. 그래서 가끔 좀더 비싸도 10배 빠르게 해주는게 있으면 내 문제가 조금이나마 해결될까 싶기도 하고요.
SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible.
The potential speed improvement vs JAX for large training runs is over an order of magnitude.
Nvidia will now pay you to put a mini AI data center on your house
It looks like a normal AC unit in the yard.
But inside sits 16 Nvidia Blackwell GPUs and Dell servers.
A startup called Span builds them, backed by Nvidia.
They bolt onto your home and you get paid for the power and Wi-Fi.
Some estimates put that around $1,000 a month in your pocket.
That is rent money just for hosting a box outside.
Span says it deploys way faster and cheaper than a real data center.
The AI boom is literally moving into the suburbs.
Save this, the grid is getting rebuilt in real time.
Nvidia is now putting a mini AI data centre in homes that looks like a normal AC unit
It has 16 Nvidia Blackwell GPUs and Dell servers inside, with a startup called Span building & fitting them
The incentive? It is estimated to generate homeowners $1,000 a month in revenue
NVIDIA will cut your power bill if you let them bolt a $1,000,000 data center to your house.
Or spend $2,999 and make $22,000.
Option 1, they bolt 16 Blackwell chips to the outside of your house. You get free internet, battery backup, cheaper electricity. They get a million-dollar asset running 24/7 on your property, serving requests for Amazon, Microsoft, Google.
Option 2, you buy a DGX Spark for $2,999. Size of a paperback. Sits on your desk. 128GB memory. Runs a 70B model locally. $10/month in electricity. No data leaves the room. Paid for itself in 6 weeks for someone spending $1,900/month on cloud GPU rentals. $22,000 back in year one.
One option, you're the infrastructure.
Other option, you own it.
Same NVIDIA chips. Very different contracts.
grok-build-0.1 is now available via the xAI API in public beta.
This is the same model that powers the Grok Build CLI and excels at agentic coding.
Priced at $1/m input and $2/m output, it’s extremely cost effective, intelligent, and fast.