登录 注册
登录 注册
EMO: Pretraining Mixture of Experts for Emergent Modularity
👁 192 📚 23
Implicit Representations of Grammaticality in Language 模型 (Model)s
Implicit Representations of Grammaticality in Language Models
👁 105 📚 3
Safety and accuracy follow different scaling laws in clinical large language models
👁 72 📚 28
FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents
👁 194 📚 20
When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language 模型 (Model)s
When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models
👁 127 📚 26
On the Proper Treatment of Units in Surprisal Theory
👁 141 📚 15
Exploration Hacking: Can LLMs Learn to Resist RL Training?
👁 96 📚 7
Select to Think: Unlocking SLM Potential with Local Sufficiency
👁 63 📚 9
DV-World:真实世界情景中数据可视化代理的基准化
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
👁 174 📚 23
通过多任务BILSTM和自动ML基准制定对印度尼西亚电子商务的感知和情感分类
Sentiment and Emotion Classification of Indonesian E-Commerce Reviews via Multi-Task BiLSTM and Auto...
👁 103 📚 24
AI探员怎么花你的钱? 在代理编码任务中分析和预测托肯消费
How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Task...
👁 113 📚 3
当提示覆盖视野: LVLMs 的提示诱发幻觉时
When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs
👁 199 📚 18
MathDuels:将LLMs评价为问题概率和解决方案
MathDuels: Evaluating LLMs as Problem Posers and Solvers
👁 107 📚 25
对使用基因大语言模型进行自动语音识别的评价
Evaluation of Automatic Speech Recognition Using Generative Large Language Models
👁 49 📚 12
SpeechParaling-Bench:辅助语言学-助词生成综合基准
SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation
👁 63 📚 20
发现共享逻辑子空间:通过对接自然语言和符号视图引导 LLM 逻辑理性
Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Langu...
👁 122 📚 4
塞萨:有选择的州际空间注意
Sessa: Selective State Space Attention
👁 129 📚 29
为非正式定理演示学习透视理性
Learning to Reason with Insight for Informal Theorem Proving
👁 152 📚 30
库埃瓦尔:在社会困境中确定合作基准-维持机制和LLM代理人
CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas
👁 128 📚 7
MM- Web Agent: 用于网页生成的分级多式网络代理
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
👁 43 📚 24
🌊