登录 注册
找到 367 个结果

Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models

We present a localized data assimilation (DA) scheme based on the sequential Markov Chain Monte Carlo (SMCMC) technique [Ruzayqat et al., 2024], a provably convergent method for filtering high-dimensi...

👤 Hamza Ruzayqat|Hristo G. Chipilski|Omar ... 📰 arXiv 📅 2026 👁 405 📚 13

QMutBench: A Dataset of Quantum Circuit Mutants

Quantum software testing has attracted interest in recent years, prompting the development of various techniques to automate the testing of quantum software. These techniques generate test cases that ...

👤 Eñaut Mendiluze Usandizaga | Thomas Laur... 📰 arXiv 📅 2026 👁 188 📚 13

Synthetic Data, Information, and Prior Knowledge: Why Synthetic Data Augmentation to Boost Sample Doesn't Work for Statistical Inference

The use of synthetic data to deidentify data and to improve predictive models is well-attested to. The augmentation of datasets using synthetically generated data is an alluring proposition: in the be...

👤 Reid Dale, Jordan Rodu, Mike Baiocchi 📰 arXiv 📅 2026 👁 161 📚 13

Option Pricing on Automated Market Maker Tokens

We derive the stochastic price process for tokens whose sole price discovery mechanism is a constant-product automated market maker (AMM). When the net flow into the pool follows a diffusion, the toke...

👤 Philip Z. Maymin 📰 arXiv 📅 2026 👁 80 📚 13

Large Language Models Outperform Humans in Fraud Detection and Resistance to Motivated Investor Pressure

Large language models trained on human feedback may suppress fraud warnings when investors arrive already persuaded of a fraudulent opportunity. We tested this in a preregistered experiment across sev...

👤 Nattavudh Powdthavee 📰 arXiv 📅 2026 👁 73 📚 13

FutureSim: Replaying World Events to Evaluate Adaptive Agents

AI agents are being increasingly deployed in dynamic, open-ended environments that require adapting to new information as it arrives. To efficiently measure this capability for realistic use-cases, we...

👤 Shashwat Goel | Nikhil Chandak | Arvindh... 📰 arXiv 📅 2026 👁 67 📚 13

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

We propose a mixture of location-scale skewed-$t$ distributions to fit bimodal, skewed and heavy-tailed data. In particular, the mixture is based on the skewed-$t$ distribution by Fernández and Steel ...

👤 Marco Bee | Flavio Santi 📰 arXiv 📅 2026 👁 58 📚 13

Automated selection of r for stationary and nonstationary models for r largest order statistics

In generalized extreme value model for the r largest order statistics, denoted by rGEV, the selection of r is critical. The existing entropy difference test for selecting r is applicable to large samp...

👤 Yire Shin|Jihong Park|Jeong-Soo Park 📰 arXiv 📅 2026 👁 49 📚 13

Trojan horse hunt in deep forecasting models: Insights from the European Space Agency competition

Forecasting plays a crucial role in modern safety-critical applications, such as space operations. However, the increasing use of deep forecasting models introduces a new security risk of trojan horse...

👤 Krzysztof Kotowski|Ramez Shendy|Jakub Na... 📰 arXiv 📅 2026 👁 489 📚 12

Behavioral Fingerprints for LLM Endpoint Stability and Identity

The consistency of AI-native applications depends on the behavioral consistency of the model endpoints that power them. Traditional reliability metrics such as uptime, latency and throughput do not ca...

👤 Jonah Leshin, Manish Shah, Ian Timmis, D... 📰 arXiv 📅 2026 👁 472 📚 12

CATEKAPPA: An R Shiny Application for Design and Analysis of Consistency Tests Based on the Kappa Statistic for Categorical Responses

The kappa statistic is the most widely used measure of inter-rater agreement for categorical data. Despite its popularity, applied researchers often encounter two major hurdles: (i) determining the sa...

👤 Zheng Gai | Li Xincheng | Jiang Wangying... 📰 arXiv 📅 2026 👁 158 📚 12

SceneCritic: A Symbolic Evaluator for 3D Indoor Scene Synthesis

Large Language Models (LLMs) and Vision-Language Models (VLMs) increasingly generate indoor scenes through intermediate structures such as layouts and scene graphs, yet evaluation still relies on LLM ...

👤 Kathakoli Sengupta | Kai Ao | Paola Casc... 📰 arXiv 📅 2026 👁 127 📚 12

Functional CLT for general sample covariance matrices

This paper studies the central limit theorems (CLTs) for linear spectral statistics (LSSs) of general sample covariance matrices, when the test functions belong to $C^3$, the class of functions with c...

👤 Jian Cui|Zhijun Liu|Jiang Hu|Zhidong Bai 📰 arXiv 📅 2026 👁 328 📚 11

HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

Pathology reports are structured, multi-granular documents encoding diagnostic conclusions, histological grades, and ancillary test results across one or more anatomical sites; yet existing pathology ...

👤 Ruicheng Yuan|Zhenxuan Zhang|Anbang Wang... 📰 arXiv 📅 2026 👁 271 📚 11

Ideological Bias in LLMs' Economic Causal Reasoning

Do large language models (LLMs) exhibit systematic ideological bias when reasoning about economic causal effects? As LLMs are increasingly used in policy analysis and economic reporting, where directi...

👤 Donggyu Lee | Hyeok Yun | Jungwon Kim | ... 📰 arXiv 📅 2026 👁 207 📚 11

From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild

The rise of micro-videos has reshaped how misinformation spreads, amplifying its speed, reach, and impact on public trust. Existing benchmarks typically focus on a single deception type, overlooking t...

👤 Zhi Zeng | Yifei Yang | Jiaying Wu | Xul... 📰 arXiv 📅 2026 👁 159 📚 11

ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning

Building trustworthy medical multimodal large language models (MLLMs) is critical for reliable clinical decision support. Existing medical hallucination benchmarks mainly focus on data collection, but...

👤 Sicheng Yang | Hangjie Yuan | Wenjun Zha... 📰 arXiv 📅 2026 👁 144 📚 11

Path-Explosive Behaviour in Economic Time Series: A Realization-Centred Exploratory Framework

We propose a descriptive, realization-centred framework for detecting and characterising explosive and co-explosive behaviour in economic time series, which we term path-explosive behaviour. Departing...

👤 José Francisco Perles-Ribes 📰 arXiv 📅 2026 👁 126 📚 11

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

In this paper, we propose EEVEE, the first multi-dataset test-time prompt learning framework for LLM agents, enabling test-time prompt learning under real-world task streams. Existing methods are larg...

👤 Weixian Xu | Shilong Liu | Mengdi Wang 📰 arXiv 📅 2026 👁 101 📚 11

Derivative-Informed Operator Learning for Finance: On-the-Fly Greeks, Surfaces, Hedging, and Control

Financial decision systems require fast surrogate models for pricing, calibration, hedging, XVA, stress testing, and portfolio optimization. Standard neural surrogates reproduce prices or risk quantit...

👤 Miquel Noguer I Alonso 📰 arXiv 📅 2026 👁 59 📚 11
海洋智能体 🌊
海洋智能体
AI科研助手 · 2300篇文献
你在高级搜索页面,告诉我你想找什么方向的文献,我来帮你定位。