登录 注册
Randomized YaRN Improves Length Generalization for Long-Context Reasoning
👁 142 📚 13
Beyond Global Replanning: Hierarchical Recovery for Cross-Device Agent Systems
👁 165 📚 6
StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs
👁 153 📚 14
LedgerAgent: Structured State for Policy-Adherent Tool-Calling Agents
👁 177 📚 25
Native Active Perception as Reasoning for Omni-Modal Understanding
👁 188 📚 3
Variable-Width Transformers
👁 163 📚 14
The Value Axis: Language 模型 (Model)s Encode Whether They're on the Right Track
The Value Axis: Language Models Encode Whether They're on the Right Track
👁 169 📚 29
ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning
👁 144 📚 11
Influcoder: Distilling Decoders' Gradient Influence Rankings into an Encoder for 数据 (Data) Attributi...
Influcoder: Distilling Decoders' Gradient Influence Rankings into an Encoder for Data Attribution
👁 133 📚 9
学习 (Learning) to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning
Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning
👁 75 📚 18
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments
👁 124 📚 19
Doc-to-Atom: 学习 (Learning) to Compile and Compose Memory Atoms
Doc-to-Atom: Learning to Compile and Compose Memory Atoms
👁 73 📚 25
A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
👁 55 📚 16
Causally Evaluating the Learnability of Formal Language Tasks
👁 18 📚 23
How reliable are LLMs when it comes to playing dice?
👁 193 📚 2
Self-Augmenting Retrieval for Diffusion Language 模型 (Model)s
Self-Augmenting Retrieval for Diffusion Language Models
👁 125 📚 8
Operation-Guided Progressive Human-to-AI Text Transformation Benchmark for Multi-Granularity AI-Text...
👁 61 📚 1
Code2LoRA: Hypernetwork-Generated Adapters for Code Language 模型 (Model)s under Software Evolution
Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution
👁 77 📚 29
STRIDE: Training 数据 (Data) Attribution via Sparse Recovery from Subset Perturbations
STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations
👁 158 📚 8
Language 模型 (Model)s Compare Quantities Using Number-specific and Unit-specific Heuristics
Language Models Compare Quantities Using Number-specific and Unit-specific Heuristics
👁 67 📚 17
海洋智能体 🌊
海洋智能体
AI科研助手 · 2270篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。