I remain confused but still somewhat interested as to a definition of "novel", g...

snickerbockers · 2025-11-15T00:30:55 1763166655

Generally novel either refers to something that is new, or a certain type of literature. If the AI is generating something functionally equivalent to a program in its training set (in this case, dozens or even hundreds of such programs) then it by definition cannot be novel.

brulard · 2025-11-15T00:55:20 1763168120

This is quite a narrow view of how the generation works. AI can extrapolate from the training set and explore new directions. It's not just cutting pieces and gluing together.

throwaway173738 · 2025-11-15T05:21:56 1763184116

Calling it “exploring” is anthropomorphising. The machine has weights that yield meaningful programs given specification-like language. It’s a useful phenomenon but it may be nothing like what we do.

grosswait · 2025-11-15T12:36:26 1763210186

Or it may be remarkably similar to what we do

beeflet · 2025-11-15T01:32:29 1763170349

In practice, I find the ability for this new wave of AI to extrapolate very limited.

fragmede · 2025-11-15T01:43:56 1763171036

Do you have any concrete examples you'd care to share? While this new wave of AI doesn't have unlimited powers of extrapolation, the post we're commenting on is asserting that this latest AI from Google was able to extrapolate solutions to two of AI's oldest problems, which would seem to contradict an assertion of "very limited".

kazinator · 2025-11-15T03:13:08 1763176388

Positively not. It is pure interpolation and not extrapolation. The training set is vast and supports an even vaster set of possible traversal paths; but they are all interpolative.

Same with diffusion and everything else. It is not extrapolation that you can transfer the style of Van Gogh onto a photographl it is interpolation.

Extrapolation might be something like inventing a style: how did Van Gogh do that?

And, sure, the thing can invent a new style---as a mashup of existing styles. Give me a Picasso-like take on Van Gogh and apply it to this image ...

Maybe the original thing there is the idea of doing that; but that came from me! The execution of it is just interpolation.

BoorishBears · 2025-11-15T03:53:34 1763178814

This is knock against you at all, but in a naive attempt to spare someone else some time: remember that based on this definition it is impossible for an LLM to do novel things and more importantly, you're not going to change how this person defines a concept as integral to one's being as novelty.

I personally think this is a bit tautological of a definition, but if you hold it, then yes LLMs are not capable of anything novel.

Libidinalecon · 2025-11-15T12:50:24 1763211024

I think you should reverse the question, why would we expect LLMs to even have the ability to do novel things?

It is like expecting a DJ remixing tracks to output original music. Confusing that the DJ is not actually playing the instruments on the recorded music so they can't do something new beyond the interpolation. I love DJ sets but it wouldn't be fair to the DJ to expect them to know how to play the sitar because they open the set with a sitar sample interpolated with a kick drum.

HeinzStuckeIt · 2025-11-16T00:07:25 1763251645

A lot of musicians these days are using sample libraries instead of actually holding real instruments in their hands. It’s not just DJs or electronic producers. It’s remarkable that Brendan Perry of Dead Can Dance, for example, who played guitar and bass as a young man and once amassed a collection of exotic instruments from around the world, built recent albums largely out of instrument sample libraries. One of technology’s effects on culture that maybe doesn’t get talked about as much as outright electronic genres.

BoorishBears · 2025-11-15T22:08:05 1763244485

It just depends on how you define novel.

Would you consider the instrumental at 33 seconds a new song? https://youtu.be/eJA0wY1e-zU?si=yRrDlUN2tqKpWDCv

8note · 2025-11-15T20:41:13 1763239273

kid koala does jazz solos on a disk of 12 notes, jumping the track back and forth to get different notes.

i think that, along with the sitar player are still interpolating. the notes are all there on the instrument. even without an instrument, its still interpolating. the space that music and aound can be in is all well known wave math. if you draw a fourier transform view, you could see one chart with all 0, and a second with all +infinite, and all music and sound is gonna sit somewhere between the two.

i dont know that "just interpolation" is all that meaningful to whether something is novel or interesting.

kazinator · 2025-11-15T23:47:30 1763250450

The DJ's tracks are just tone producing elements.

If he plucked one of the 13 strings of a koto, we wouldn't say he is just remixing the vibration of the koto. Perhaps we could say that, if we had justification. There is a way of using a musical instrument as just a noise maker to produce its characteristics sounds.

Similarly, a writer doesn't just remix the alphabet, spaces and punctuation symbols. A randomly generated soup of those symbols could the thought of as their remix, in a sense.

The question is, is there a meaning being expressed using those elements as symbols?

Or is just the mixing all there is to the meaning? I.e. the result says "I'm a mix of this stuff and nothing more".

If you mix Alphagetti and Zoodles, you don't have a story about animals.

kazinator · 2025-11-15T05:27:27 1763184447

That is not strictly true, because being able to transfer the style of Van Gogh onto an arbitrary photographic scene is novel in a sense, but it is interpolative.

Mashups are not purely derivative: the choice of what to mash up carries novelty: two (or more) representations are mashed together which hitherto have not been.

We cannot deny that something is new.

regularfry · 2025-11-15T08:07:09 1763194029

Innovation itself is frequently defined as the novel combination of pre-existing components. It's mashups all the way down.

BoorishBears · 2025-11-15T20:58:11 1763240291

I'm saying their comment is calling that not something new.

I don't agree, but by their estimation adding things together is still just using existing things.

ozgrakkurt · 2025-11-15T04:56:36 1763182596

This is how people do things as well imo. LLM does the same thing on some level but it is just not good enough for majority of use cases

snickerbockers · 2025-11-15T01:44:38 1763171078

uhhh can it? I've certainly not seen any evidence of an AI generating something not based on its training set. It's certainly smart enough to shuffle code around and make superficial changes, and that's pretty impressive in its own way but not particularly useful unless your only goal is to just launder somebody else's code to get around a licensing problem (and even then it's questionable if that's a derived work or not).

Honest question: if AI is actually capable of exploring new directions why does it have to train on what is effectively the sum total of all human knowledge? Shouldn't it be able to take in some basic concepts (language parsing, logic, etc) and bootstrap its way into new discoveries (not necessarily completely new but independently derived) from there? Nobody learns the way an LLM does.

ChatGPT, to the extent that it is comparable to human cognition, is undoubtedly the most well-read person in all of history. When I want to learn something I look it up online or in the public library but I don't have to read the entire library to understand a concept.

BobbyTables2 · 2025-11-15T03:48:59 1763178539

You have to realize AI is trained the same way one would train an auto-completer.

Theres no cognition. It’s not taught language, grammar, etc. none of that!

It’s only seen a huge amount of text that allows it to recognize answers to questions. Unfortunately, it appears to work so people see it as the equivalent to sci-fi movie AI.

It’s really just a search engine.

snickerbockers · 2025-11-15T04:51:21 1763182281

I agree and that's the case I'm trying to make. The machine-learning community expects us to believe that it is somehow comparable to human cognition, yet the way it learns is inherently inhuman. If an LLM was in any way similar to a human I would expect that, like a human, it might require a little bit of guidance as it learns but ultimately it would be capable of understanding concepts well enough that it doesn't need to have memorized every book in the library just to perform simple tasks.

In fact, I would expect it to be able to reproduce past human discoveries it hasn't even been exposed to, and if the AI is actually capable of this then it should be possible for them to set up a controlled experiment wherein it is given a limited "education" and must discover something already known to the researchers but not the machine. That nobody has done this tells me that either they have low confidence in the AI despite their bravado, or that they already have tried it and the machine failed.

throwaway173738 · 2025-11-15T05:25:31 1763184331

There’s a third possible reason which is that they’re taking it as a given that the machine is “intelligent” as a sales tactic, and they’re not academic enough to want to test anything they believe.

ezst · 2025-11-15T05:20:48 1763184048

> The machine-learning community

Is it? I only see a few individuals, VCs, and tech giants overblowing LLMs capabilities (and still puzzled as to how the latter dragged themselves into a race to the bottom through it). I don't believe the academic field really is that impressed with LLMs.

ninetyninenine · 2025-11-15T05:13:55 1763183635

no it's not I work on AI and what these things do are much much more then a search engine or an autocomplete. If an autocomplete passed the turing test you'd dismiss it because it's still an autocomplete.

The characterization you are regurgitating here is from laymen who do not understand AI. You are not just mildly wrong but wildly uninformed.

versteegen · 2025-11-15T13:23:52 1763213032

Well, I also work on AI, and I completely agree with you. But I've reached the point of thinking it's hopeless to argue with people about this: It seems that as LLMs become ever better people aren't going to change their opinions, as I had expected. If you don't have good awareness of how human cognition actually works, then it's not evidently contradictory to think that even a superintelligent LLM trained on all human knowledge is just pattern matching and that humans are not. Creativity, understanding, originality, intent, etc, can all be placed into a largely self-consistent framework of human specialness.

MangoToupe · 2025-11-15T13:08:31 1763212111

To be fair, it's not clear human intelligence is much more than search or autocomplete. The only thing that's clear here is that LLMs can't reproduce it.

ninetyninenine · 2025-11-15T13:13:47 1763212427

Yes but colloquially this characterization you see used by laymen is deliberately used to deride AI and dismiss it. It is not honest about the on the ground progress AI has made and it’s not intellectual honest about the capabilities and weaknesses of Ai.

MangoToupe · 2025-11-15T13:35:22 1763213722

I disagree. The actual capabilities of LLMs remain unclear, and there's a great deal of reasons to be suspicious of anyone whose paycheck relies on pimping them.

ninetyninenine · 2025-11-15T13:41:35 1763214095

The capabilities of LLMs are unclear but it is clear that they are not just search engines or autocompletes or stochastic parrots.

You can disagree. But this is not an opinion. You are factually wrong if you disagree. And by that I mean you don’t know what you’re talking about and you are completely misinformed and lack knowledge.

The long term outcome if I’m right is that AI abilities continue to grow and it basically destroys my career and yours completely. I stand not to benefit from this reality and I state it because it is reality. LLMs improve every month. It’s already to the point of where if you’re not vibe coding you’re behind.

MangoToupe · 2025-11-16T11:47:51 1763293671

> It’s already to the point of where if you’re not vibe coding you’re behind.

I like being productive, not babysitting a semi-literate program incapable of learning

ninetyninenine · 2025-11-16T16:58:03 1763312283

Let me be utterly clear. People with your level of programming skill who incorporate AI into their workflow are in general significantly more productive than you. You are a less productive, less effective programmer if you are not using AI. That is a fundamental fact. And all of this was not true a year ago.

Again if you don’t agree then you are lost and uninformed. There are special cases where there are projects where human coding is faster but that is a minority.

MangoToupe · 2025-11-17T06:17:17 1763360237

fragmede · 2025-11-15T04:31:29 1763181089

Isn't that what's going on with synthetic data? The LLM is trained, then is used to generate data that gets put into the training set, and then gets further trained on that generated data?

BirAdam · 2025-11-15T03:24:24 1763177064

You didn’t have to read the whole library because your brain has been absorbing knowledge from multiple inputs your entire life. AI systems are trying to temporally compress a lifetime into the time of training. Then, given that these systems have effectively a single input method of streams of bits, they need immense amounts of it to be knowledgeable at all.

ninetyninenine · 2025-11-15T05:14:57 1763183697

>I've certainly not seen any evidence of an AI generating something not based on its training set.

There is plenty of evidence for this. You have to be blind not to realize this. Just ask the AI to generate something not in it's training set.

gf000 · 2025-11-15T17:12:03 1763226723

Like the seahorse emoji?

taneq · 2025-11-15T12:27:00 1763209620

OK, but by that definition, how many human software developers ever develop something "novel"? Of course, the "functionally equivalent" term is doing a lot of heavy lifting here: How equivalent? How many differences are required to qualify as different? How many similarities are required to qualify as similar? Which one overrules the other? If I write an app that's identical to Excel in every single aspect except that instead of a Microsoft Flight Simulator easter egg, there's a different, unique, fully playable game that can't be summed up with any combination of genre lables, is that 'novel'?

gf000 · 2025-11-15T17:14:50 1763226890

I think the importance is the ability. Not every human have produced (or even can) something novel in their life, but there are humans who have time after time.

Meanwhile, depending on how you rate LLM's capabilities, no matter how many trials you give it, it may not be considered capable of that.

That's a very important distinction.

terminalshort · 2025-11-15T03:30:49 1763177449

If a LLM had written Linux, people would be saying that it isn't novel because it's just based on previous OS's. There is no standard here, only bias.

jofla_net · 2025-11-15T16:28:37 1763224117

Cept its not made Linux (in the absence of it).

At any point prior to the final output it can garner huge starting point bias from ingested reference material. This can be up to and including whole solutions to the original prompt minus some derivations. This is effectively akin to cheating for humans as we cant bring notes to the exam. Since we do not have a complete picture of where every part of the output comes from we are at a loss to explain if it indeed invented it or not. The onus is and should be on the applicant to ensure that the output wasn't copied (show your work), not on the graders to prove that it wasn't copied. No less than what would be required if it was a human. Ultimately it boils down to what it means to 'know' something, whether a photographic memory is, in fact, knowing something, or rather derivations based on other messy forms of symbolism. It is nevertheless a huge argument as both sides have a mountain of bias in either directions.

jstummbillig · 2025-11-15T18:26:08 1763231168

> Cept its not made Linux (in the absence of it).

Neither did you (or I). Did you create anything that you are certain your peers would recognize as more "novel" than anything a LLM could produce?

snickerbockers · 2025-11-15T21:17:21 1763241441

>Neither did you (or I).

Not that specifically but I certainly have the capability to create my own OS without having to refer to the source code of existing operating systems. Literally "creating a linux" is a bit on the impossible side because it implies compatibility with an existing kernel despite the constraints prohibiting me from referring to the source of that existing kernel (maybe possible if i had some clean-room RE team that would read through the source and create a list of requirements without including any source).

If we're all on the same page regarding the origins of human intelligence (ie, that it does not begin with satan tricking adam and eve into eating the fruit of a tree they were specifically instructed not to touch) then it necessarily follows that any idea or concept was new at some point and had to be developed by somebody who didn't already have an entire library of books explaining the solution at his disposal.

For the Linux thought-experiment you could maybe argue that Linux isn't totally novel since its creator was intentionally mimicking behavior of an existing well-known operating system (also iirc he had access to the minix source) and maybe you could even argue that those predecessors stood on the shoulders of their own proverbial giants, but if we keep kicking the ball down the road eventually we reach a point where somebody had an idea which was not in any way inspired by somebody else's existing idea.

The argument I want to make is not that humans never create derivative or unoriginal works (that obviously cannot be true) but that humans have the capability to create new things. I'm not convinced that LLMs have that same capability; maybe I'm wrong but I'm still waiting to see evidence of them discovering something new. As I said in another post, this could easily be demonstrated with a controlled experiment in which the model is bootstrapped with a basic yet intentionally-limited "education" and then tasked with discovering something already known to the experimenters which was not in its training set.

>Did you create anything that you are certain your peers would recognize as more "novel" than anything a LLM could produce?

Yes, I have definitely created things without first reading every book in the library and memorizing thousands of existing functionally-equivalent solutions to the same problem. So have you so long as I'm not actually debating an LLM right now.

baq · 2025-11-15T10:09:00 1763201340

If the model can map an unseen problem to something in its latent space, solve it there, map back and deliver an ultimately correct solution, is it novel? Genuine question, ‘novel’ doesn’t seem to have a universally accepted definition here

gf000 · 2025-11-15T17:21:21 1763227281

Good question, though I would say that there may be different grades of novelty.

One grade might be your example, while something like Gödel's incompleteness theorems or Einstein's relativity could go into a different grade.

visarga · 2025-11-15T06:53:54 1763189634

> For example, I can't wrap my head around how a) a human could come up with a piece of writing that inarguably reads "novel" writing, while b) an AI could be guaranteed to not be able to do the same, under the same standard.

The secret ingredient is the world outside, and past experiences from the world, which are unique for each human. We stumble onto novelty in the environment. But AI can do that too - move 37 AlphaGo is an example, much stumbling around leads to discoveries even for AI. The environment is the key.

QuadmasterXLII · 2025-11-15T02:12:07 1763172727

A system of humans creates bona fide novel writing. We don’t know which human is responsible for the novelty in homoerotic fanfiction of the Odyssey, but it wasn’t a lizard. LLMs don’t have this system-of-thinkers bootstrapping effect yet, or if they do it requires an absolutely enormous boost to get going

testaccount28 · 2025-11-14T23:43:15 1763163795

why would you admit on the internet that you fail the reverse turing test?

mikestorrent · 2025-11-15T02:00:41 1763172041

Didn't some fake AI country song just get on the top 100? How novel is novel? A lot of human artists aren't producing anything _novel_.

magicalist · 2025-11-15T03:16:11 1763176571

> Didn't some fake AI country song just get on the top 100?

No

Edit: to be less snarky, it topped the Billboard Country Digital Song Sales Chart, which is a measure of sales of the individual song, not streaming listens. It's estimated it takes a few thousand sales to top that particular chart and it's widely believed to be commonly manipulated by coordinated purchases.

terminalshort · 2025-11-15T03:31:40 1763177500

It was a real AI country song, not a fake one, but yes.

CamperBob2 · 2025-11-15T00:07:37 1763165257

You have no idea if you're talking to an LLM or a human, yourself, so ... uh, wait, neither do I.

greygoo222 · 2025-11-15T00:42:23 1763167343

Because I'm an LLM and you are too

fragmede · 2025-11-14T23:46:50 1763164010

Because not everyone here has a raging ego and no humility?

kazinator · 2025-11-15T03:06:55 1763176015

Because we know that the human only read, say, fifty books since they were born, and watched a few thousand videos, and there is nothing in them which resembles what they wrote.