More

_davide_ · 2026-06-08T20:47:45 1780951665

>It's such a weird "Gotcha" that seems to only assume that Chinese LLMs might censor something.

We are not assuming anything; it is illegal, and you will get prison time just for talking about it. Yeah, sure, everyone distorts reality, but there is a huge gap between hiding and enforcing. So yeah, having models respond accordingly is unexpected. There are probably multiple variants tuned differently.

_davide_ · 2026-06-03T20:23:18 1780518198

Consciousness doesn't exist, it's a vanity concept, to boost human ego...

tvshtr · 2026-06-04T00:07:15 1780531635

This is probably a semantic issue seeing as we don't have a widely agreed definition of it. I like to think about it in terms of self-reflective, subjective experience. I'm not even sure if emotions would be a requirement and was surprised to see Chiang so hung up on them. Would he consider humans which can have a variety of mental disorders, causing a complete lack of some of them to not poses consciousness?

Jtarii · 2026-06-03T20:39:36 1780519176

>Consciousness doesn't exist

I can confirm that this is incorrect.

Procrastes · 2026-06-03T20:47:46 1780519666

But not in any convincing way, which seems like the root of the problem.

vajdagabor · 2026-06-04T18:11:14 1780596674

I can confirm too. It's empirical, look!

enduser · 2026-06-04T02:41:48 1780540908

What if "I" is incorrect?

vajdagabor · 2026-06-04T18:57:47 1780599467

What if "I" is both correct and incorrect?

layer8 · 2026-06-03T20:51:38 1780519898

Why should we believe you?

Jtarii · 2026-06-04T00:40:07 1780533607

You already believe me so that seems like a pointless question.

layer8 · 2026-06-04T09:18:46 1780564726

I don’t believe you, you may just be an AI bot.

_davide_ · 2026-05-27T07:24:36 1779866676

I had a subscription before the price was cut down; the model kept randomly looping the with same character (burning 30% of the budget in one shot), and the overall performance for agentic purposes is, simply put, terrible. It finds non-existing bugs and randomly removes chunks of code to fix them, then even presents it as an "extra fix". Maybe it's a good generalistic model; I haven't tested it in that regard.

MiniMax (currently 2.7) which is a ~270B model tuned exclusively for agentic purposes, performs so MUCH better; it's more reliable and cheaper. Both are still far away from Opus 4.7 that I'm using at work. IMO benchmarks are just a very rough estimation; everyone cheats as much as they can get away with. Test the model yourself; do not make any assumptions based on the benchmarks.

I would love to see specialized, cheaper, bleeding-edge models like MiniMax for other non-agentic purposes as well. Why pay $1 for a general model when, for example, you can pay $0.1 for a content-moderator model that you actually need?

zarify · 2026-05-27T09:53:13 1779875593

Funny, I had the opposite experience with MiniMax and Mimo when using OpenCode. MiniMax got stuck with looping through broken tool calls all the time and MiMo just powered through things and for the most part just worked.

shanoaice · 2026-05-29T19:48:19 1780084099

similarly for me, MiniMax is kind of horrible that it somewhat regularly fall into loops that I had to save it from. DeepSeek & MiMO rarely got stuck. wonder how you get completely reversed experience.

_davide_ · 2026-06-09T07:11:54 1780989114

I did develop my own agent around MiniMax. I did see weird behavior when I messed up the loop, like omitting pieces of remove thinking; maybe it's an agent bug, some models/providers just ignore/normalize the broken input, some don't.

ignoramous · 2026-06-10T07:17:32 1781075852

Seems like the harness may have to be "finetuned" for models. I've had my share of MiMo 2.5 Pro get stuck in thinking loops with GitHub Copilot.

_davide_ · 2026-05-26T19:43:32 1779824612

They do want to see the American bubble burst, this is the quickest way

andrekandre · 2026-05-26T23:33:14 1779838394

with all the price increases in everything else, i think we are all tired of this bubble to be honest...

_davide_ · 2026-05-26T18:13:41 1779819221

100% agree, unwatchable and cheap, it's the most effective way to make sure I'll never touch the product.

_davide_ · 2026-05-14T06:41:51 1778740911

Lol, use Gentoo

_davide_ · 2026-05-14T06:19:27 1778739567

Except rust is way more productive than any other language out there...

worik · 2026-05-14T20:11:34 1778789494

> ...rust is way more productive than any other language...

Eventually

The learning curve is unfun

_davide_ · 2026-04-23T11:31:45 1776943905

Thank you, but no thanks

_davide_ · 2026-03-25T10:58:57 1774436337

Most examples and presented issues would not compile or be a real issue... I stopped reading midway

_davide_ · 2025-10-04T06:51:07 1759560667

> What could go wrong?

LoL, an insane amount of things. TCP connections are an illusion of safely, for the purpose of database commits use UDP packets as a model instead, it'll be much closer to reality.

0x1ceb00da · 2025-10-04T09:18:24 1759569504

> an insane amount of things

List a couple

> TCP connections are an illusion of safely

Why?