← Back to feed
2026-05-25datacode

Length Generalization with Log-Depth Recurrent Units

Charles Pert, Dalal Alrajeh, Alessandra Russo

PDF preview unavailable
Read on arXiv →

Key claim

MLP-LDRU achieves 100% accuracy on 18 regular-language tasks.

The paper presents MLP-LDRU, a novel architecture that effectively addresses length generalization in neural networks. It achieves outstanding accuracy on various regular-language tasks, outperforming existing recurrent and attention-based models. This advancement could lead to improved performance in tasks requiring understanding of sequence length.

In plain English

The paper presents MLP-LDRU, a novel architecture that effectively addresses length generalization in neural networks. It achieves outstanding accuracy on various regular-language tasks, outperforming existing recurrent and attention-based models. This advancement could lead to improved performance in tasks requiring understanding of sequence length.

Novelty
8.0/10

The introduction of MLP-LDRU presents a significant new method for addressing length generalization in neural networks.

Reliability
8.0/10

The paper evaluates MLP-LDRU on multiple tasks with strong performance metrics, indicating solid experimental validation.

Deep reliability assessment

The methodology supports the claim that MLP-LDRU achieves high out-of-distribution accuracy on regular language tasks, but the generalization to more complex settings remains less substantiated. The paper may overclaim the robustness of the model without extensive testing across diverse language structures.

Reproducibility

Yes, the authors mention that code will be released upon publication.

Discussion questions

  1. How does the assumption of associativity bias in operators hold up in more complex language tasks beyond regular languages?
  2. What are the practical implications of using MLP-LDRU for real-world sequence processing tasks in industries like NLP or robotics?
  3. What specific conditions or changes in the training data would lead to a failure in achieving the reported out-of-distribution accuracy?

Key figure

Figure 1 illustrates the prefix language P2,2, showing the states and transitions of the Moore machine used for testing long-range dependency handling.

Benchmark results

Regular tasksOOD accuracy: 100vs RNN+22.6%SOTA
Regular tasksOOD accuracy: 100vs Gated DeltaNet+23.5%SOTA
Regular tasksOOD accuracy: 100vs LSTM+0.0%SOTA
Codelink
1Code will be released upon publication.Official