登录 注册
找到 367 个结果

On the Promises and Limits of Multi-omics Integration for Deconvolution: The HADACA3 Benchmark

Understanding the cellular composition of complex tissues, such as tumors, is a key challenge in biology and medicine. A common approach, known as deconvolution, aims to estimate the cellular composit...

👤 Hugo Barbot | Elise Amblard | Nicolas Ho... 📰 arXiv 📅 2026 👁 102 📚 25

S2A3: Thompson Sampling and Stochastic Exposure Control for High-Stakes CATs

High-stakes computerized adaptive tests (CATs) require a continuous supply of calibrated items, yet traditional item piloting is slow, expensive, and operationally hazardous. We introduce the S2A3 fra...

👤 James Sharpnack | Alexander Tsigler | J.... 📰 arXiv 📅 2026 👁 83 📚 25

A Practical Guide to Instrumental Variables Methods with Heterogeneous Treatment Effects

Instrumental variables (IV) methods are central to applied microeconomics. While classical approaches assume linear models with constant effects, recent literature has shifted toward the local average...

👤 Tymon Słoczyński | Liyang Sun | S. Derya... 📰 arXiv 📅 2026 👁 57 📚 25

PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization

Social platforms serve as central hubs for information exchange, where user behaviors and platform interventions jointly shape opinions. However, intervention policies like recommendation and content ...

👤 Renhong Huang|Ning Tang|Jiarong Xu|Yuxua... 📰 arXiv 📅 2026 👁 572 📚 24

Ranking Reasoning LLMs under Test-Time Scaling

Test-time scaling evaluates reasoning LLMs by sampling multiple outputs per prompt, but ranking models in this regime remains underexplored. We formalize dense benchmark ranking under test-time scalin...

👤 Mohsen Hariri|Michael Hinczewski|Jing Ma... 📰 arXiv 📅 2026 👁 439 📚 24

CoverageBench: Evaluating Information Coverage across Tasks and Domains

We wish to measure the information coverage of an ad hoc retrieval algorithm, that is, how much of the range of available relevant information is covered by the search results. Information coverage is...

👤 Saron Samuel|Andrew Yates|Dawn Lawrie|Ia... 📰 arXiv 📅 2026 👁 434 📚 24

Machine Learning Based Mesh Movement for Non-Hydrostatic Tsunami Simulation

This study investigates the use of machine learning based mesh adaptivity, specifically mesh movement methods (UM2N), with depth integrated non-hydrostatic shallow water models. Motivation for this co...

👤 Yezhang Li|Stephan C. Kramer|Matthew D. ... 📰 arXiv 📅 2026 👁 196 📚 24

Bitcoin's Power Law: Weak Structure, Strong Forecasts

Bitcoin's price has been described as following a power law (PL) in time, $P \sim t^β$ with $\hatβ\approx 5.7$ over 2010-2026. We test this claim using the Clauset-Shalizi-Newman protocol applied to B...

👤 Carlos Baquero | Raquel Menezes 📰 arXiv 📅 2026 👁 184 📚 24

MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage

Vision Language Models (VLMs) are increasingly used for tasks like medical report generation and visual question answering. However, fluent diagnostic text does not guarantee safe visual understanding...

👤 Ufaq Khan | Umair Nawaz | L D M S S Teja... 📰 arXiv 📅 2026 👁 131 📚 24

Simpler and Improved Replacement Path Coverings

An important tool in the design of fault-tolerant graph data structures are $(L,f)$-replacement path coverings (RPCs). An RPC is a family $\mathcal{G}$ of subgraphs of a given graph $G$ such that, for...

👤 Davide Bilò | Shiri Chechik | Keerti Cho... 📰 arXiv 📅 2026 👁 127 📚 24

Continuous Hidden Markov Models for Equity Returns: Heavy-Tail Emission Families and Regime-Conditional Value-at-Risk

Synthetic generators of daily equity returns let practitioners stress test, backtest, and design scenarios that a single realized market history cannot supply, but only if the generator reproduces the...

👤 Abdulrahman Alswaidan | Cade Jin | Jeffr... 📰 arXiv 📅 2026 👁 89 📚 24

Testing Preferential Sampling

Geostatistics aims to infer a spatially continuous phenomenon from observations collected at a finite number of locations, frequently measured with error. Whenever there is stochastic dependence betwe...

👤 Isabel Natario | Andreia Monteiro 📰 arXiv 📅 2026 👁 82 📚 24

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This convention couples depth scaling with lin...

👤 Minbin Huang | Han Shi | Chuanyang Zheng... 📰 arXiv 📅 2026 👁 72 📚 24

Generating Financial Time Series by Matching Random Convolutional Features

Generating realistic financial time series is challenging as training data is often limited to a single historical path. With such scarce data, overfitting is hard to avoid, especially under adversari...

👤 Konrad J. Mueller | Nikita Zozoulenko | ... 📰 arXiv 📅 2026 👁 57 📚 24

Power Analysis for Prediction-Powered Inference

Modern studies increasingly leverage outcomes predicted by machine learning and artificial intelligence (AI/ML) models, and recent work, such as prediction-powered inference (PPI), has developed valid...

👤 Yiqun T. Chen, Moran Guo, Shengy Li 📰 arXiv 📅 2026 👁 47 📚 24

A Job I Like or a Job I Can Get: Designing Job Recommender Systems Using Field Experiments

Recommendation systems (RSs) are increasingly used to guide job seekers on online platforms, yet the algorithms currently deployed are typically optimized for predictive objectives such as clicks, app...

👤 Guillaume Bied | Philippe Caillou | Brun... 📰 arXiv 📅 2026 👁 222 📚 23

Efficacy of Scalable Airline-led Contrail Avoidance

Contrails account for a large portion of aviation's contribution to anthropogenic climate change. Navigational contrail avoidance is a promising solution to mitigate the warming caused by contrails. P...

👤 Tharun Sankar|Thomas Dean|Tristan Abbott... 📰 arXiv 📅 2026 👁 212 📚 23

How Transparent is DiffusionGemma?

LLM reasoning transparency is a critical affordance for understanding model decisions, mitigating misuse and misalignment, and debugging surprising model behaviors. However, DiffusionGemma performs a ...

👤 Joshua Engels | Callum McDougall | Bilal... 📰 arXiv 📅 2026 👁 194 📚 23

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Real-world data visualization (DV) requires native environmental grounding, cross-platform evolution, and proactive intent alignment. Yet, existing benchmarks often suffer from code-sandbox confinemen...

👤 Jinxiang Meng | Shaoping Huang | Fangyu ... 📰 arXiv 📅 2026 👁 186 📚 23

Sometimes nonparametrics beat parametrics, even when the model is right

A basic issue in both teaching of and practice of statistics is the interplay between modelling assumptions and inference performance. The general message conveyed is that stronger assumptions lead to...

👤 Morten Byholt, Nils Lid Hjort 📰 arXiv 📅 2026 👁 155 📚 23
海洋智能体 🌊
海洋智能体
AI科研助手 · 2300篇文献
你在高级搜索页面,告诉我你想找什么方向的文献,我来帮你定位。