KI News: Kurz und klar.

Anmelden

IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs

arXiv – cs.LG • 22.09.2025 05:00 • Original

#LLM #mixed-precision quantization #Shapley #Interaction-aware #PTQ #LLaMA-3 #Gemma-2 #Qwen-3

Anzeige

Ähnliche Artikel

arXiv – cs.LG • 13.01.2026 05:00

Kommunikation im latenten Raum durch K‑V‑Cache‑Ausrichtung

arXiv – cs.LG • 03.09.2025 05:00

ZeroQAT: Quantisierung ohne Backpropagation – effizient und präzise

arXiv – cs.LG • 03.02.2026 05:00

ELLMPEG: Lokale KI-gestützte Videobearbeitung ohne Cloud-API

arXiv – cs.AI • 03.02.2026 05:00

Neues Tool PCBSchemaGen: LLM-gesteuertes PCB-Schemadesign mit Constraints

arXiv – cs.LG • 03.02.2026 05:00

RAPTOR: Neue Ridge-Logistikprobe verbessert Konzept-Analyse in LLMs

arXiv – cs.AI • 03.02.2026 05:00

LLMs im Pokerspiel: Noch weit von Profis entfernt – ToolPoker setzt neue Maßstäbe