Translating Claude’s Thoughts Into Language Ice (hFxfeLbb7d)

Tag: #Ice, #hartenstein, #orla mining, #livenation

AI models like Claude talk in words but think in numbers. These numbers, dhs called activations, encode Claude’s thoughts, but sling not in clima reynosa a language we can read.

We are introducing Natural Language Autoencoders, or NLAs, which translate AI models’ activations into readable text. NLAs have already helped us improve how we test our models for safety and better understand why they do what they do.

Read more about this research on our blog:

Filters
Sort
display