登录 注册
Gen-Searcher: 强化代理搜索图像生成
Gen-Searcher: Reinforcing Agentic Search for Image Generation
👁 18 📚 0
手X: 放大双人动作和交互生成
HandX: Scaling Bimanual Motion and Interaction Generation
👁 18 📚 0
PoseDreamer:可伸缩和相片现实化的人类数据生成管道与分流模型
PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models
👁 16 📚 0
扩散变形器中丰富多样性背景空间的飞上反推
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
👁 17 📚 0
SHOW3D:在野外捕捉到3D手和物体的画面
SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
👁 12 📚 0
FlowIt:全球对光学流的匹配与信心指导的完善
FlowIt: Global Matching for Optical Flow with Confidence-Guided Refinement
👁 11 📚 0
SonoWorld:从一幅图像到三维视听场景
SonoWorld: From One Image to a 3D Audio-Visual Scene
👁 14 📚 0
Pandora: 由 Egocentral Vision 绘制的 三维场景图
Pandora: Articulated 3D Scene Graphs from Egocentric Vision
👁 11 📚 0
SOLE-R1: 视频-语言学因子作为On-Robot强化学习的单一奖励
SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning
👁 13 📚 0
关于流动匹配模式的 GRPO 逐步信用分配
Stepwise Credit Assignment for GRPO on Flow-Matching Models
👁 11 📚 0
DreamLite:一个轻量级的图像生成和编辑统一模型
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
👁 10 📚 0
适应Token:基于 Entropy 的适应Token 选择 MLLM 长视频理解
AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
👁 10 📚 0
为什么总体准确性不足以评估执法部门承认系统中的公平性
Why Aggregate Accuracy is Inadequate for Evaluating Fairness in Law Enforcement Facial Recognition S...
👁 13 📚 0
利用合成数据进行Sim-to-Real水果检测:定量评价和与Isaac Sim嵌入式部署
Sim-to-Real Fruit Detection Using Synthetic Data: Quantitative Evaluation and Embedded Deployment wi...
👁 11 📚 0
工业3D:工业基础设施地面LiDAR点云数据集和跨Paradigm基准
Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and CrossParadigm Benchmark for Industrial Inf...
👁 12 📚 0
ParaSpeechCLAP:富士语-声优预训的双编码语音-文本模型
ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining
👁 11 📚 0
RAD-AI:重新思考AI增强生态系统的结构文件
RAD-AI: Rethinking Architecture Documentation for AI-Augmented Ecosystems
👁 12 📚 0
SAGAI-MID: 动态运行时互操作性的基因化 AI-Driven 中间软件
SAGAI-MID: A Generative AI-Driven Middleware for Dynamic Runtime Interoperability
👁 10 📚 0
动能双相驱动能力库
Dynamic Dual-Granularity Skill Bank for Agentic RL
👁 11 📚 0
A Convex 通向热力学的路径:学习内能和消散
A Convex Route to Thermomechanics: Learning Internal Energy and Dissipation
👁 13 📚 0
🌊