KI News: Kurz und klar.

Anmelden

Resurrecting the Salmon: Rethinking Mechanistic Interpretability with Domain-Specific Sparse Autoencoders

arXiv – cs.LG • 14.08.2025 05:00 • Original

#Sparse-Autoencoder #Großsprachmodell #latente Merkmale #Medizinischer Text #JumpReLU #Interpretierbarkeit #Rekonstruktionsgenauigkeit

Anzeige

Ähnliche Artikel

arXiv – cs.LG • 07.11.2025 05:00

MetaTree: Skalierbares Meta-Lernen von Entscheidungsbäumen mit synthetischen Daten

arXiv – cs.LG • 05.11.2025 05:00

ProtoTSNet: Interpretable Multivariate Time Series Classification With Prototypical Parts

arXiv – cs.AI • 03.11.2025 05:00

Validity Is What You Need

VentureBeat – AI • 29.10.2025 17:00

Anthropic scientists hacked Claude’s brain — and it noticed. Here’s why that’s huge

arXiv – cs.LG • 29.10.2025 04:00

Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders

arXiv – cs.AI • 29.10.2025 04:00

Latent Chain-of-Thought for Visual Reasoning