Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of Sequence Modeling Architectures

The Role of State Matrix Initialization in SSMs: A Perspective on the Approximation-Estimation Tradeoff

Fusheng Liu · Qianxiao Li


Abstract:

State space models (SSMs) have shown great potential in sequence modeling across various ap- plications. Current initialization schemes for the SSM state matrix primarily rely on the HiPPO framework that is built on function approximation. In this paper, we aim to understand the role of the state matrix initialization in a different angle which can be used to explain the ben- efit of complex-valued SSMs. Furthermore, for complex-valued initialization, we uncover an approximation-estimation tradeoff when training SSMs with a specific class of target functions.

Chat is not available.