LLM tuning: neural language persistence through adaptive mixture

Banik, Mridul, author; ACM, publisher

doi:https://doi.org/10.1145/3774791.3774803

LLM tuning: neural language persistence through adaptive mixture

Files

FACF_ACMOA_3774791.3774803.pdf (574.62 KB)

Date

2025-12-09

Authors

Banik, Mridul, author

ACM, publisher

Abstract

This paper presents a novel architectural paradigm addressing knowledge degradation in large language models during continual fine-tuning. The framework leverages a Mixture-of-Experts-style approach, integrating multiple low-rank adapters governed by an intelligent routing mechanism. By freezing core model parameters and dynamically allocating task-specific expertise, this method preserves inherent world knowledge while enhancing performance across diverse downstream applications. The proposed Dynamic LoRA-Experts with Prototype-Ensemble Matching (DLEPM) framework demonstrates superior performance on sequential NLP benchmarks, achieving 89.2% average accuracy with only 5.4% forgetting—outperforming existing continual learning methods. Empirical evaluations validate the framework's efficacy in maintaining large language model fidelity during continuous adaptation.

Subject

continual learning

catastrophic forgetting

parameter-efficient finetuning

large language models

low-rank adaptation

URI

https://hdl.handle.net/10217/242555

Collections

Publications

Full item page

LLM tuning: neural language persistence through adaptive mixture

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By