登录 注册
登录 注册
用信号时空逻辑进行强化学习
Stratifying Reinforcement Learning with Signal Temporal Logic
👁 159 📚 3
与晚期世界模式的分级规划
Hierarchical Planning with Latent World Models
👁 117 📚 11
平滑地平面:通过扩散去化目标进行原因结构学习
Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives
👁 155 📚 26
B. 背景强化:高效理性任务扩展法
Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning
👁 112 📚 27
食谱比厨房更重要:AI天气预测管道的数学基础
The Recipe Matters More Than the Kitchen:Mathematical Foundations of the AI Weather Prediction Pipel...
👁 100 📚 9
在具有短范围互动的上下文敏感随机语言模型上进行阶段过渡
Phase transition on a context-sensitive random language model with short range interactions
👁 185 📚 5
结构化知识和数据:财务应用的统一框架
Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications
👁 82 📚 5
实例-最佳花纹凸起优化:我们能够改进样本平均和坚固的花纹凸起近似性吗?
Instance-optimal stochastic convex optimization: Can we improve upon sample-average and robust stoch...
👁 155 📚 27
无需硬负值:概念百分点学习导致组成,而不降低相冲突模型的零射能
No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zer...
👁 200 📚 19
应请求VISion:通过稀少、动态选择和视觉语言互动,提高VLLM的效率
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language inter...
👁 222 📚 21
统一调教和低级教育的端到端培训
End-to-End Training for Unified Tokenization and Latent Denoising
👁 93 📚 30
加权网络的结构浓度:一类地形学-智能索引
Structural Concentration in Weighted Networks: A Class of Topology-Aware Indices
👁 119 📚 13
CRPS-Optimal Binning for Conformal 回归 (Regression)
CRPS-Optimal Binning for Conformal Regression
👁 87 📚 3
🌊