Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual 学习 (Learning)
Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

作者
Authors Fatema Siddika | Md Anwar Hossen | Tanwi Mallick | Ali Jannesari

期刊
Journal arXiv

年份
Year 2026

分类
Category 数据分析
Data Analysis

国家
Country -

🔗 访问原文
🔗 Access Paper

📝 摘要
Abstract

Continual learning in Large Language Models (LLMs) is hindered by the plasticity-stability dilemma, where acquiring new capabilities often leads to catastrophic forgetting of previous knowledge. Existing methods typically treat parameters uniformly, failing to distinguish between specific task knowledge and shared capabilities. We introduce Mixture of Sparse Experts for Task Agnostic Continual Learning (SETA), a framework that resolves the plasticity-stability conflict through adaptive sparse subspace decomposition into task-specific expert modules. Unlike standard updates, where tasks compete for the same parameters, SETA separates knowledge into unique experts, designed to isolate task-specific patterns, and shared experts, responsible for capturing common features. This structure is maintained through adaptive elastic anchoring and a routing-aware regularization that jointly protect shared knowledge at both the weight and routing levels and enable a unified gating network to automatically retrieve the correct expert combination during inference. Extensive experiments across diverse domain-specific benchmarks demonstrate that SETA achieves competitive or superior overall performance relative to state-of-the-art continual learning baselines, with particularly strong retention of early-task knowledge and improved backward transfer on LLaMA-2 7B and Qwen3-4B.

📊 文章统计
Article Statistics

基础数据
Basic Stats

42 浏览
Views

0 下载
Downloads

10 引用
Citations

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

影响因子分析
Impact Analysis

2.70 综合评分
Overall Score

引用影响力
Citation Impact

浏览热度
View Popularity

下载频次
Download Frequency

Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual 学习 (Learning)
Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

📝 摘要
Abstract

📊 文章统计
Article Statistics

基础数据
Basic Stats

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

相关关键词
Related Keywords

影响因子分析
Impact Analysis

📄 相关文章
Related Articles

Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual 学习 (Learning)Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

📝 摘要Abstract

📊 文章统计Article Statistics

基础数据Basic Stats

引用趋势Citation Trend

阅读国家分布Country Distribution

阅读机构分布Institution Distribution

月度浏览趋势Monthly Views

相关关键词Related Keywords

影响因子分析Impact Analysis

📄 相关文章Related Articles

海洋智能分析Ocean AI Analysis

Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual 学习 (Learning)
Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

📝 摘要
Abstract

📊 文章统计
Article Statistics

基础数据
Basic Stats

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

相关关键词
Related Keywords

影响因子分析
Impact Analysis

📄 相关文章
Related Articles