Skip to content

elephantmipt/real_lru

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real LRU

This repository contains an implementation of the Linear Recurrent Unit using real numbers for computation.

Motivation

In today's world, training large models requires a significant amount of computation. We can reduce training time by employing AMP training. To encourage research in finding alternative architectures for Large Language Models, it is essential to speed up training and bring LRU on par with the training times of transformers.

Experiments

Results obtained by running bench.py.

speed_figure

Plan

  • Mixed Precision computation
  • Materializing $\Lambda$ through real numbers
  • Simple LM with LRU
  • Wikitext trained model

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages