by Henry | Jul 24, 2024 | entropy, interpretability, publications
Representations as Language An Information Theoretic Framework for Interpretability This paper appeared as a talk at the International Meeting of the Cognitive Science Society in 2024. view paper view code Abstract Large scale neural models show impressive performance...
by Henry | Jul 23, 2023 | entropy, interpretability
Anaphoric Structures Emerge Between Neural Networks Without explicit efficiency pressures. This paper appeared at the International Meeting of the Cognitive Science Society in 2023. view paper Abstract Anaphors are ubiquitous in human language; structures like...
by Henry | May 1, 2023 | entropy, interpretability, publications
Compositionality With Variation Reliably Emerges in Neural Networks Emergent representations in a multi-agent model are rife with the kinds of variation ubiquitous across natural languages. This paper appeared at ICLR 2024. view paper Abstract We re-evaluated how to...