登录 注册
登录 注册
以模型为基础的加固学习
Focal plane wavefront control with model-based reinforcement learning
👁 10 📚 0
结构化知识和数据:财务应用的统一框架
Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications
👁 10 📚 0
电话用户是否尊重你的隐私?
Do Phone-Use Agents Respect Your Privacy?
👁 12 📚 0
基于流动的政策与分布式强化学习 轨迹优化
Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
👁 9 📚 0
在正加权限制的Boltzmann机器中快速混合
Rapid mixing in positively weighted restricted Boltzmann machines
👁 12 📚 0
机器人操纵混合框架:整合强化学习和大语言模式
Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Mod...
👁 13 📚 0
使用以变换器为基础的源代码表示符自动识别可平行循环
Automatic Identification of Parallelizable Loops Using Transformer-Based Source Code Representations
👁 11 📚 0
相通、正统或冲突:何时可以安全优化思维链?
Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought?
👁 11 📚 0
塔克关注:大致关注机制的概括
Tucker Attention: A generalization of approximate attention mechanisms
👁 12 📚 0
三角认知架构:通过斯帕提奥-时空和Epistemic Friction来构建自主行动
The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Fri...
👁 10 📚 0
颗粒计数中的不显眼模糊:RNA-seq数据的案例
Non-ignorable fuzziness in granular counts: the case of RNA-seq data
👁 133 📚 10
HeMiTo动力学中的米相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相相...
Analytical characterisation of the Mi- and To-phases in HeMiTo dynamics: exponential growth and logi...
👁 103 📚 21
无限地平线优化控制有延迟的前进-后起倒置伏地平线方程式
Infinite Horizon Optimal Control of Forward-Backward Stochastic Volterra Equations with Delay
👁 157 📚 14
在具有短范围互动的上下文敏感随机语言模型上进行阶段过渡
Phase transition on a context-sensitive random language model with short range interactions
👁 188 📚 5
结构化知识和数据:财务应用的统一框架
Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications
👁 85 📚 5
OmniRoam:通过长视全景视频生成世界漫游
OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation
👁 66 📚 14
基于奖励的在线LLM 通过神经UCB运行
Reward-Based Online LLM Routing via NeuralUCB
👁 72 📚 27
超越Beta Lorenz曲线:贫穷和不平等估计的新参数家庭
Beyond the Beta Lorenz Curve: A New Parametric Family for Poverty and Inequality Estimation
👁 94 📚 15
从多元无知到共同的社会保障合同知识
From Pluralistic Ignorance to Common Knowledge with Social Assurance Contracts
👁 141 📚 22
自动市场制造器的选项定价
Option Pricing on Automated Market Maker Tokens
👁 73 📚 13
🌊