登录 注册
如果共识是谎言呢? 在测试时间进行选择性补充强化学习
What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
👁 401 📚 14
从事件日志中发现决定同步模式
Discovery of Decision Synchronization Patterns from Event Logs
👁 128 📚 4
关于记忆中Latent 通用化的动态和amp;可转让性
On the Dynamics & Transferability of Latent Generalization during Memorization
👁 316 📚 25
NASimJax:GPU加速入网测试政策学习框架
NASimJax: GPU-Accelerated Policy Learning Framework for Penetration Testing
👁 464 📚 9
IsoCLIP:高效模式内对齐的 CLIP 投影仪
IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
👁 395 📚 27
深入学习在线绘图的失败模式:如何衡量和处理它们
Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them
👁 93 📚 47
利用图神经网络模拟复杂网格上的子网格生产率
Modeling subgrid scale production rates on complex meshes using graph neural networks
👁 154 📚 6
FIPO:用未来-KL影响的政策优化来启发深刻理性
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
👁 286 📚 9
GDEGAN:高斯动态等效地图观测网
GDEGAN: Gaussian Dynamic Equivariant Graph Attention Network for Ligand Binding Site Prediction
👁 326 📚 16
Eye Gaze- Informed and Context-Award Pedestria Tracepression in 共享空间与自动航天飞机:虚拟现实研究
Eye Gaze-Informed and Context-Aware Pedestrian Trajectory Prediction in Shared Spaces with Automated...
👁 159 📚 7
在可缩放电路优化的量子特性图中量化门贡献
Quantifying Gate Contribution in Quantum Feature Maps for Scalable Circuit Optimization
👁 323 📚 33
发展网络与自主运行
Growing Networks with Autonomous Pruning
👁 49 📚 48
双重路径归属:通过层-Wise 目标传播对 SwiGLU 转换器的有效归属
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propa...
👁 491 📚 17
FedPDPO:大语言模式对接的联邦个人化直接优惠优化
FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment
👁 522 📚 1
FedRG: Unleashing the Representation Geometry for Federated 学习 (Learning) with Noisy Clients
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients
👁 111 📚 18
从相似性/多样性和对等性中学习
Learning from Similarity/Dissimilarity and Pairwise Comparison
👁 388 📚 42
差异隐私下的最小和可适应共变矩阵估计
Minimax and Adaptive Covariance Matrix Estimation under Differential Privacy
👁 29 📚 30
遗憾分析睡眠竞争强盗
Regret Analysis of Sleeping Competing Bandits
👁 57 📚 13
在扩展基因模型和Godel-Tarski-Lob Limits中减少返回
Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits
👁 298 📚 11
由次级目标驱动的改进长范围 LLM代理的框架
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
👁 67 📚 29
海洋智能体 🌊
海洋智能体
AI科研助手 · 2434篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。