modulux

modulux@node.isonomia.net

Marxist, law graduate, civil servant, writer, coder. Blind.
Location: Galicia, Spain, EU.
Interests: #writing #poetry #sf #rust #coq #marxism #tech #erotica #bdsm #kink #nsfw ##fedi22
Languages: en es gl de eo
Favourite insult received: commander of the queer communist revolutionary corps.
Runner-up: cipayo de los señores del aire.
Uphold Marxism-Leninism-Martin-Löf thought.
¬(∀x. free(x)) ⇒ ¬∃x. free(x)
(None of us is free until all of us are free.)
The sharpness of a sword results from repeated grinding, while the fragrance of plum blossoms comes from frigid weather.

ActivityPub

modulux

1 year ago

modulux
1 year ago

I'm a little puzzled at the salience that is being given to the Apple conclusions on #LLM #reasoning when we have lots of prior art. For example: LLMs cannot correctly infer a is b, if their corpora only contain b is a. #Paper: arxiv.org/abs/2309.12288

#AI #MachineLearning #logic

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form "A is B", it will not automatically generalize to the reverse direction "B is A".

^arXiv.org

#AI #llm #machinelearning #logic #reasoning #paper

in reply to modulux

mnl mnl mnl mnl mnl

in reply to modulux 1 year ago

I think it's just people looking for confirmation, and the name apple attached somehow is giving that research more public light, when it really is just another paper on how to get better benchmark to study "reasoning" capabilities. I put reasoning in quotes since it means such different things depending on if you are in the field or not.

in reply to mnl mnl mnl mnl mnl

modulux

in reply to mnl mnl mnl mnl mnl 1 year ago

I tend to think so too. I suppose it shouldn't really surprise me, but I expected a bit more critical engagement.

There's lots of evidence on the limits of LLM reasoning, as well as pretty basic self-reference and so on (how many letters l does this statement include?). And yes, reasoning is being used rather technically in the sense of deductive inference.

⇧

modulux

modulux 1 year ago • •

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

mnl mnl mnl mnl mnl

modulux

modulux
1 year ago