Hacker Newsnew | past | comments | ask | show | jobs | submit | ambewas's commentslogin

You're right to be skeptical. Without a way to actually implement how the human brain processes experiences into a consolidated memory, we won't be able to solve the long term memory problem at all. Not with the current technology.

An LLM context is a pretty well extended short term memory, and the trained network is a very nice comprehensive long term memory, but due to the way we currently train these networks, an LLM is just fundamentally not able to "move" these experiences to long term, like a human brain does (through sleep, among others).

Once we can teach a machine to experience something once, and remember it (preferably on a local model, because you wouldn't want a global memory to remember your information), we just cannot solve this problem.

I think this is probably the most interesting field of research right now. Actually understanding in depth how the brain learns, and figuring out a way to build a model that implements this. Because right now, with backtracking and weight adjustments, I just can't see us getting there.


I think if we want to build on what we have, instead of compaction at the end of the context window, the LLM would have to 'sleep', i.e. adjust its weights, then wake up with the last bits of the old context window in the new one, and have a 'feel' for what it did before through the change in weights. I just sense it's not that simple to get there, because simply updating the weights based on a single context sample risks degrading the weights of the whole network.

I like the idea of using small local model (or several) for tackling this problem, like low rank adaptation, but with current tech, I still have to piece this together or the small local models will forget old memories.


Sleep would probably be a part of the equation for consolidating , but there's still the question of how exactly does the brain process the information during sleep in a way that it permanently consolidates the information.

It's not how an llm can work right now, it needs too much iterations & a much bigger dataset than what we can work with. A single time experiencing something and we can remember it. That's orders of magnitude more efficient than an LLM right now can achieve.


Couldn't fitting solve the problem? That's what companies do: take a model as a base and train it on the specific data long enough so that it prefers the new data. Overfitting may be a thing but for personal use, I may want to have it work as I expected, every time.


> I think this is probably the most interesting field of research right now. Actually understanding in depth how the brain learns, and figuring out a way to build a model that implements this.

This field of research has been around for decades, so who's to say when there'll be a breakthrough.

In fact, LLMs are great despite our very limited understanding, and not because we had some breakthrough about the human brain.


Exactly. It's been around so long and we still don't know how to mimic it.

The way an llm learns is a very interesting way of doing it, but it sure isn't what the brain is doing.

But it's indisputable.. We can get enormous results with this technique. It's just probably not the way forward for faster learning to remediate the issue of context loss.


Why does a language model have to be monolithic? I think retraining a model is expensive (relatively speaking). Is there some way to bolt on specialization?


That's exactly the issue. Retraining is too expensive & needs too much iteration to work efficiently I think.


How well do LoRAs work for this using something like Thinking Machine's Tinker?


It's kind of fascinating that everyone is trying to build a Chinese Room agent with stateless models, since we don't know how to produce a stateful model with continuous, incremental training.

It's like spontaneous implemention of thought experiments from yesteryear. I wonder if all this product-focused experimentation will accidentally impact philosophy of mind after all...


in my experience, as long as you set up a decent set of agent definitions & a good skillset, and work in an already pretty clean codebase with established standards, the code quality an agent outputs is actually really good.

Couple that with a self-correcting loop (design->code->PR review->QA review in playwright MCP->back to code etc), orchestrated by a swarm coordinator agent, and the quality increases even further.


still working on https://stringscales.com - fun sideproject to visualize guitar scales on a configurable fretboard, with interactive note highlighting to a backingtrack.

The backingtrack is what I'm actively improving right now. It's just a pad running now, but it will turn into a full track with bass/drums/piano/... and will feature a comprehensive chords based editor so you can add and save your own progressions with a logged in account.


Does it let you visualise quarter tone fretting? (Or is it just me who's obsessed with Angine de Poitrine right now?)


that would certainly be a very interesting thing to add, but in the current iteration that is not possible.

The biggest issue here is that there's not really "one" microtonal system out there. The entire fretboard of a conventional guitar is mapped to work in 12tet - and the libraries I'm using to do all the musical operations also only supports 12tet.

to accommodate all the microtonal temperaments out there would be a pretty daunting task. But I'm not saying never!


I was thinking the same. The author makes a great point when it comes to a portfolio, but most of the work I do is for corporate clients for example. They value efficiency and ease of use (I.e. predictability) of a ui/ux solution over any creative outlet. I don't think I should start doing more of that.. I'll be out of a job pretty fast


I've relied on Tonal heavily to build stringscales.com - it was a very pleasant experience. Much is already present, and extending the lib with more scales and functionality was easy as well. Definitely recommend


Oh nice, that looks really cool! How did you get the fretboard to sync up with those songs? Ai or some manual encoding work somehow?

Well done!


Very cool! I like the non standard fretting as well. Neat feature


Thanks a lot for sharing!

Yeah, I love the tuning option myself!


I totally agree, in fact feature number one is already planned. It'll probably be in the form of a chord schema generator and player that also shows you which scales and roots/thirds... you can use in a given chord.

For 2, it's possible to flip the fretboard to a lefty guitar. Is that what you are looking for? You can find the option under settings. Or do you mean flipping it vertically?


Vertically as well as 90 degrees.

https://www.google.com/search?output=search&q=Chords

See how most images are showing chords. The fret is laid out vertically. I've been using those and can thereby read that more naturally.


Oh right! I see what you mean. Got it, awesomeme idea: Chord charts like these are as of now a planned feature. Something to look out for in the future.

Thanks for the suggestion!


Definitely go for it! And let me know how I can help you to make it a little bit less overwhelming


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: