登录 注册
找到 12 个结果

Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech

Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data collection is labor-intensive and model training is technically complex. To address these lim...

👤 Niclas Pokel|Yiming Zhao|Pehuén Moure|Yi... 📰 arXiv 📅 2026 👁 178 📚 41

Integrating Heterogeneous Information in Randomized Experiments: A Unified Calibration Framework

In modern randomized experiments, large-scale data collection increasingly yields rich baseline covariates and auxiliary information from multiple sources. Such information offers opportunities for mo...

👤 Wei Ma|Zeqi Wu|Zheng Zhang 📰 arXiv 📅 2026 👁 294 📚 28

When do trajectories matter? Identifiability analysis for stochastic transport phenomena

Stochastic models of diffusion are routinely used to study dispersal of populations, including populations of animals, plants, seeds and cells. Advances in imaging and field measurement technologies m...

👤 Matthew J Simpson | Michael J Plank 📰 arXiv 📅 2026 👁 61 📚 28

Building informative materials datasets beyond targeted objectives

Materials science data collection can be expensive, making the reuse and long-term utility of datasets critical important for future discovery campaigns. In practice, researchers prioritize a subset o...

👤 Rafael Espinosa Castañeda | Ashley Dale ... 📰 arXiv 📅 2026 👁 195 📚 26

Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Autonomous aerial vehicles (AAVs) empower sixth-generation (6G) Internet-of-Things (IoT) networks through mobility-driven data collection. However, conventional reward-driven reinforcement learning fo...

👤 Xiucheng Wang|Zhenye Chen|Nan Cheng 📰 arXiv 📅 2026 👁 255 📚 24

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

On-policy distillation (OPD) trains student models under their own induced distribution while leveraging supervision from stronger teachers. We identify a failure mode of OPD: as training progresses, ...

👤 Feng Luo | Yu-Neng Chuang | Guanchu Wang... 📰 arXiv 📅 2026 👁 84 📚 20

AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Recent advances in machine learning and large-scale biological data collections have revived the prospect of building a virtual cell, a computational model of cellular behavior that could accelerate b...

👤 Edward De Brouwer | Carl Edwards | Alexa... 📰 arXiv 📅 2026 👁 176 📚 19

ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning

Building trustworthy medical multimodal large language models (MLLMs) is critical for reliable clinical decision support. Existing medical hallucination benchmarks mainly focus on data collection, but...

👤 Sicheng Yang | Hangjie Yuan | Wenjun Zha... 📰 arXiv 📅 2026 👁 144 📚 11

123D: Unifying Multi-Modal Autonomous Driving Data at Scale

The pursuit of autonomous driving has produced one of the richest sensor data collections in all of robotics. However, its scale and diversity remain largely untapped. Each dataset adopts different 2D...

👤 Daniel Dauner | Valentin Charraut | Bast... 📰 arXiv 📅 2026 👁 201 📚 10

Exploring ocean data: comprehensive approaches to data collection and the role of public databases in facilitating the interconnection between ocean and human health.

Since the year 2000, oceanic research has seen a surge in data collection, with approximately 500,000 sets of measurements for a single variable (e.g., temperature) recorded annually. Yet, further adv...

👤 Muratore Anna, Notargiacomo Lorenza, Mat... 📰 未知期刊 📅 2026 👁 18 📚 8

Hidden in Plain Sight: Decades of Industrial-Scale Fishing in the Ocean's Twilight Zone.

There is a common misconception among ocean scientists and policy makers that mesopelagic (200-1000 m) food webs are an unexploited "final frontier" of living marine resources. It is true that there a...

👤 Arostegui Martin C, Thorrold Simon R, Br... 📰 Global Change Biology 📅 2026 👁 46 📚 4

AutoDex: An Automated Real-World System for Dexterous Grasping Data Collection

Learning robust dexterous grasping requires real-world data that records the physical outcomes of grasp attempts. Such data is hard to obtain at scale: teleoperation yields valid physical outcomes but...

👤 Mingi Choi | Gunhee Kim | Jisoo Kim | Ta... 📰 arXiv 📅 2026 👁 64 📚 2
海洋智能体 🌊
海洋智能体
AI科研助手 · 2270篇文献
你在高级搜索页面,告诉我你想找什么方向的文献,我来帮你定位。