登录 注册
Context Bootstrapped Reinforcement 学习 (Learning)
Context Bootstrapped Reinforcement Learning
👁 412 📚 19
Entropy 轨迹形状预测 LLM 推理可靠性:对思维链中不确定性动态的诊断研究
Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynam...
👁 451 📚 31
PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of 预测 (Prediction)-Powered 推断...
PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of Prediction-Powered Inferen...
👁 289 📚 5
建设卡尔胡宁-洛埃夫扩建工程的数字考虑
Numerical Considerations for the Construction of Karhunen-Loève Expansions
👁 140 📚 45
Hardness of High-Dimensional Linear 分类 (Classification)
Hardness of High-Dimensional Linear Classification
👁 198 📚 3
Adaptive Nonlinear 数据 (Data) Assimilation through P-Spline Triangular Measure Transport
Adaptive Nonlinear Data Assimilation through P-Spline Triangular Measure Transport
👁 159 📚 22
Fast and Interpretable Autoregressive 估计 (Estimation) with 神经网络 (Neural Network) Backpropagation
Fast and Interpretable Autoregressive Estimation with Neural Network Backpropagation
👁 302 📚 50
对异常检测进行OmniAnomaly的复审:业绩指标和与基于五氯苯甲醚的模型的比较
Revisiting OmniAnomaly for Anomaly Detection: performance metrics and comparison with PCA-based mode...
👁 456 📚 2
利用未来国家行动访问措施进行最大限度的勘探
Maximum-Entropy Exploration with Future State-Action Visitation Measures
👁 420 📚 22
Unified Taxonomy for Multivariate Time Series Anomaly Detection using Deep 学习 (Learning)
Unified Taxonomy for Multivariate Time Series Anomaly Detection using Deep Learning
👁 228 📚 27
Kernel Single-Index Bandits: 估计 (Estimation), 推断 (Inference), and 学习 (Learning)
Kernel Single-Index Bandits: Estimation, Inference, and Learning
👁 54 📚 13
A 模型 (Model) Ensemble-Based Post-Processing Framework for Fairness-Aware 预测 (Prediction)
A Model Ensemble-Based Post-Processing Framework for Fairness-Aware Prediction
👁 330 📚 30
SRM: 改善小规模残废制度中的递归运输代理
SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime
👁 44 📚 12
CausalRM: Causal-Theoretic Reward 模型 (Model)ing for RLHF from Observational User Feedbacks
CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks
👁 308 📚 9
时间延迟库计算分析的数学框架
A mathematical framework for time-delay reservoir computing analysis
👁 301 📚 9
A Theoretical Comparison of No-U-Turn Sampler Variants: Necessary and Su?cient Convergence Condition...
A Theoretical Comparison of No-U-Turn Sampler Variants: Necessary and Su?cient Convergence Condition...
👁 246 📚 31
关于(甚至略微)不满意的恐惧
On the Peril of (Even a Little) Nonstationarity in Satisficing Regret Minimization
👁 330 📚 32
比例制中线性Denoisers的确切表现
Precise Performance of Linear Denoisers in the Proportional Regime
👁 111 📚 3
截断盲点:如何系统地解码策略 排除人类类的托肯选择
The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices
👁 47 📚 39
统计 (Statistical) Testing Framework for Clustering Pipelines by Selective 推断 (Inference)
Statistical Testing Framework for Clustering Pipelines by Selective Inference
👁 157 📚 22
海洋智能体 🌊
海洋智能体
AI科研助手 · 2472篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。