undefined | Better HN

0 pointselcomet7y ago0 comments

This seems quite intuitive to me:

When you have nothing to learn, you need to memorize the data. But when there is structure, it is easier to memorize the structure, so the network will learn this first (and will memorize after).

0 comments

srean7y ago

But thats the crux of the question: why and how does it not just memorize when we know it can do so easily ?

elcometOP7y ago

Maybe just because it's easier to find patterns than to memorize (if you have a lot of data).

currymj7y ago

That sounds like it’s probably right to me. But so do lots of things that turn out to be wrong. I wish we had a better grasp of what is happening, not just plausible stories. I’m already sick of doing alchemical tinkering to find a model that works.

1 more reply

j / k navigate · click thread line to collapse