登录 注册
找到 367 个结果

Variance Estimation with Dependence and Heterogeneous Means

This paper considers the problem of estimating the variance of a sum of a triangular array of random vectors with heterogeneous means. When random vectors exhibit two-way cluster dependence or weak de...

👤 Luther Yap 📰 arXiv 📅 2026 👁 365 📚 50

FinTradeBench: A Financial Reasoning Benchmark for LLMs

Real-world financial decision-making is a challenging problem that requires reasoning over heterogeneous signals, including company fundamentals derived from regulatory filings and trading signals com...

👤 Yogesh Agrawal, Aniruddha Dutta, Md Maha... 📰 arXiv 📅 2026 👁 311 📚 50

On the Ability of Transformers to Verify Plans

Transformers have shown inconsistent success in AI planning tasks, and theoretical understanding of when generalization should be expected has been limited. We take important steps towards addressing ...

👤 Yash Sarrof|Yupei Du|Katharina Stein|Ale... 📰 arXiv 📅 2026 👁 236 📚 50

Authority-Level Priors: An Under-Specified Constraint in Hierarchical Predictive Processing

Hierarchical predictive processing explains adaptive behaviour through precision-weighted inference. Explicit belief revision often fails to produce corresponding changes in stress reactivity or auton...

👤 Marcela Palejova 📰 arXiv 📅 2026 👁 359 📚 49

Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages

Understanding the distance between human languages is central to linguistics, anthropology, and tracing human evolutionary history. Yet, while linguistics has long provided rich qualitative accounts o...

👤 Yue Zhao, Jiatao Gu, Paloma Jeretič, Wei... 📰 arXiv 📅 2026 👁 174 📚 49

RXNRECer Enables Fine-grained Enzymatic Function Annotation through Active Learning and Protein Language Models

A key challenge in enzyme annotation is identifying the biochemical reactions catalyzed by proteins. Most existing methods rely on Enzyme Commission (EC) numbers as intermediaries: they first predict ...

👤 Zhenkun Shi|Jun Zhu|Dehang Wang|BoYu Che... 📰 arXiv 📅 2026 👁 514 📚 48

I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems

Large language models are increasingly proposed as autonomous agents for high-stakes public workflows, yet we lack systematic evidence about whether they would follow institutional rules when granted ...

👤 Vedanta S P, Ponnurangam Kumaraguru 📰 arXiv 📅 2026 👁 322 📚 48

P vs NP Problem in Portfolio Optimization: Integrating the Markowitz-CAPM Framework with Cardinality Constraints and Black-Scholes Derivative Pricing

This paper makes the Millennium Prize problem P vs NP operational in quantitative finance by studying cardinality-constrained portfolio selection. Starting from the convex Markowitz mean-variance prog...

👤 Davit Gondauri 📰 arXiv 📅 2026 👁 320 📚 48

LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding strong performance in automatic speech recognition (ASR) under high-resource conditions. H...

👤 Jianan Chen|Xiaoxue Gao|Tatsuya Kawahara... 📰 arXiv 📅 2026 👁 383 📚 47

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Autonomous web agents such as \textbf{OpenClaw} are rapidly moving into high-impact real-world workflows, but their security robustness under live network threats remains insufficiently evaluated. Exi...

👤 Haochen Zhao|Shaoyang Cui 📰 arXiv 📅 2026 👁 342 📚 47

Beyond Binary Success: Sample-Efficient and Statistically Rigorous Robot Policy Comparison

Generalist robot manipulation policies are becoming increasingly capable, but are limited in evaluation to a small number of hardware rollouts. This strong resource constraint in real-world testing ne...

👤 David Snyder|Apurva Badithela|Nikolai Ma... 📰 arXiv 📅 2026 👁 313 📚 47

LOO-PIT predictive model checking

We consider predictive checking for Bayesian model assessment using leave-one-out probability integral transform (LOO-PIT). LOO-PIT values are conditional cumulative predictive probabilities given LOO...

👤 Herman Tesso|Aki Vehtari 📰 arXiv 📅 2026 👁 259 📚 47

ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models

Text-to-image diffusion models achieve high visual fidelity but surprisingly exhibit systematic failures in numerical control when prompts specify explicit object counts. To address this limitation, w...

👤 Mohammad Shahab Sepehri|Asal Mehradfar|B... 📰 arXiv 📅 2026 👁 138 📚 47

Multivariate normality test based on the uniform distribution on the Stiefel manifold

This study presents a new procedure for necessary tests of multivariate normality based on the uniform distribution on the Stiefel manifold. We demonstrate that the test statistic, which is formed by ...

👤 Koki Shimizu|Toshiya Iwashita 📰 arXiv 📅 2026 👁 65 📚 46

CarbonBench: A Global Benchmark for Upscaling of Carbon Fluxes Using Zero-Shot Learning

Accurately quantifying terrestrial carbon exchange is essential for climate policy and carbon accounting, yet models must generalize to ecosystems underrepresented in sparse eddy covariance observatio...

👤 Aleksei Rozanov|Arvind Renganathan|Yimen... 📰 arXiv 📅 2026 👁 335 📚 45

Causal Attribution of Coastal Water Clarity Degradation to Nickel Processing Expansion at the Indonesia Morowali Industrial Park, Sulawesi

Indonesia's nickel ore export ban has driven rapid expansion of smelting and hydrometallurgical processing capacity at the Indonesia Morowali Industrial Park (IMIP), now the world's largest integrated...

👤 Sandy Hardian Susanto Herho|Alfita Puspa... 📰 arXiv 📅 2026 👁 319 📚 45

ForeComp: An R Package for Comparing Predictive Accuracy Using Fixed-Smoothing Asymptotics

We introduce ForeComp, an R package for comparing predictive accuracy using Diebold-Mariano type tests of equal predictive ability with standard and fixed smoothing inference. The package provides a c...

👤 Minchul Shin|Nathan Schor 📰 arXiv 📅 2026 👁 287 📚 45

Bayesian-guided inverse design of hyperelastic microstructures: Application to stochastic metamaterials

From a given pool of all feasible design variants, our aim is to identify a structure that achieves a target macroscopic stress response. For each candidate design, the response is obtained from a hig...

👤 Hooman Danesh|Henning Wessels 📰 arXiv 📅 2026 👁 276 📚 45

Minimizing Type 2 Errors in an Experiment-Rich Regime via Optimal Resource Allocation

Randomized experiments (often known as "A/B tests") are widely used to evaluate product and service innovations. We study how to allocate limited experimentation resources across M concurrent experime...

👤 Fenghua Yang, Dae Woong Ham, Stefanus Ja... 📰 arXiv 📅 2026 👁 179 📚 45

Numerical Considerations for the Construction of Karhunen-Loève Expansions

This report examines numerical aspects of constructing Karhunen-Loève expansions (KLEs) for second-order stochastic processes. The KLE relies on the spectral decomposition of the covariance operator v...

👤 Cosmin Safta, Habib N. Najm 📰 arXiv 📅 2026 👁 137 📚 45
海洋智能体 🌊
海洋智能体
AI科研助手 · 2300篇文献
你在高级搜索页面,告诉我你想找什么方向的文献,我来帮你定位。