Right, bluesky sorted!
Just for completeness sake, I'm:
@chton on https://t.co/X33nEDAXmr
@chton.bsky.social on the blue site
@[email protected] on the elephant site
@TheChton on discord
/u/chton on Reddit
and https://t.co/fDfjZXAJx5 on the web if you want other details.
@Alex___F___ i'm working on one! Unfortunately i'm not an iOS developer so that's all a lot slower, and the site updates apply to the app too, so it's been less of an issue until recently. The new iOS update really screwed that over!
Goblin Tools has just had a big update, fixing a lot of bugs with translations and ToDo lists. If you had sync issues, or performance problems, hopefully they're all solved now!
You might need to re-open the app to load it.
If any of you still run into issues, let me know?
@CH4R10T_TV Marika is in the photo the black woman is pulled from, maybe people are getting confused because of that. Marika is as screen accurate as they can get! The black person seems to be a random noble, not anyone from the game that's been race-swapped.
Pushed huge upgrade to https://t.co/fChj9OeyMk
Should help with stability, and it adds a few things people have asked for since... forever.
Multiple todo lists! Deadlines! A prioritisation button!
And a new tool! Go give it a shot, it might help make todos less overwhelming.
I'm also officially looking for a sponsor for it. Some company who would like to support it in return for showing their logo on the site in a non-intrusive way.
If you know anyone that fits, send them my way!
Pushed huge upgrade to https://t.co/fChj9OeyMk
Should help with stability, and it adds a few things people have asked for since... forever.
Multiple todo lists! Deadlines! A prioritisation button!
And a new tool! Go give it a shot, it might help make todos less overwhelming.
I've built a full LLM inference engine in C#/.NET 10. From scratch. Not a wrapper - native GGUF loading, BPE tokenizer, attention, KV-cache, SIMD-vectorized CPU kernels, CUDA GPU backend, OpenAI-compatible API. Solo dev, ~2 months, AI-assisted (not vibe-coded!). First preview is out.
Check it out for mode details at https://t.co/Bl5wAYalYY and https://t.co/rQWhKN0iVA
@konradkokosa Oh this is awesome, i've been thinking about grabbing an inference engine and changing it for experiments, and having one in my primary language to work with is going to make that so much easier. Hell yeah.
Judging by my tl there is a growing gap in understanding of AI capability.
The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code.
But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along.
So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions.
TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.
@lbletan It depends what you want to achieve with it, but with this level of codebase i would expect it to not really have a problem? hard to prove without a test case.
What have you tried? Just diving in blind, or are you asking your AI agent to start by creating overviews first?
@lbletan Aaah I see! Maybe it's worth writing out a test scenario and letting people have a go? 'clone the codebase, fix X bug/add X feature, run script Y to validate that the feature works and nothing else broken' would be a great way for yourself too, to test out different workflows.
Big News! Arcology has been selected for @CivTechScotland Round 11 🎉
We're building a recording and workflow assistant to free teachers from incident admin. Less paperwork, less stress, more time for students.
#CivTechRound11#CivTechScotland#CivTechChallenge
man: i wish to publish
reviewer 2: your paper is no good
man: i'll do anything to improve
reviewer 2: it's simple. you must read the work of the great scientist Pagliarini
man: *bursts into tears* but i am Pagliarini
@gaghyogi49@MazderVerhal Not only that, her FATHER was the Jem'Hadar, her mother the Klingon, clearly from those quotes of them.
And we've seen so many hybrids of different species that normally can't reproduce among each other.
@SGTWipper1Each It's a pip-boy watchface, as others have pointed out, but it's on a Garmin Fenix 3, which hasn't been in production in almost a decade. That's harsh conditions for a smartwatch!
@loweffortbricks I realise this probably comes out of the blue but i've had problems with people using my toolset's name to pump and dump or otherwise scam people, so i'm sensitive to its name being confused and my userbase losing trust.
@loweffortbricks Hiya!
You don't know me, but i'm the builder of Goblin Tools (https://t.co/fChj9OeyMk). It's a free AI toolset for neurodivergent people.
Would it be possible to change your bot's name? Just to avoid confusion, and avoid association between my tools and crypto.