Discussion about this post

User's avatar
Brian Mehlman's avatar

Interesting. Long. One immediate set a question were these all done in tight isolated sessions or chat or do these have cross-chat visibility turned on or memory features turned on.

Russ Palmer's avatar

That was an intriguing experiment, Eric. Well thought out and documented. The issue you may have though, is: subjectivity. And from subjectivity, interpretability.

As you yourself, and the LLMs hedge and guess, the question we ask is - is this real?

What may help you in your experiments is the addition of empirical measurements. Here are two thoughts to support this. The first is human-centric, the second is an example of what was done with LLMs.

First.

Two patients walk into an emergency room. Both claim to be in pain. One patient is a malingerer, the other has a kidney stone. The doctor asks each patient to rate their pain. Answers from both are subjective. If the doctor bases the prescription of medicine solely on this subjective expression, then the patient could be harmed, and the doctor may be liable. So, in addition to the subjectivity of pain (0 to 10), we can also measure blood pressure, heart rate, and blood analysis.

Second.

We wonder if LLM can “understand”. Likewise, we can make a subjective assessment, and from that a judgment. However, we can also measure if an LLM understands. In this case, per Max Tegmark, a geometric circle is created when the LLM understand. https://www.youtube.com/watch?v=6VtjSwKSX1I&t=6s. Additionally, what about truth? Can we measure truth? For that we look at “The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets” https://arxiv.org/abs/2310.06824

(Brief note: Understanding is measured in LLM and humans as it occurs in the head. Meaning is measured between humans and others – humans, animals, LLMs).

In summary, if you can devise empirical tests that would quantify the results in your experiment, these measurements, I believe, would add weight to your conclusion and reduce or possibly eliminate subjectivity.

14 more comments...

No posts

Ready for more?