The Sequence Knowledge #874: Transformers or Not'

Posted by Alumni from Substack

June 10, 2026

The Transformer is currently the reference architecture for serious AI. Not because it is obviously the most brain-like, elegant, or efficient design, but because it has the best scaling story. You add data, parameters, compute, context length, better training recipes, better post-training, and the model gets better in a surprisingly smooth way. That is rare. In deep learning, many ideas are clever. Few are industrial. The Transformer's superpower is attention. Every token can look at every other token and decide what matters. This is an incredibly general operation. It works for language, code, images, audio, video, protein sequences, robotics tokens, and tool traces. The architecture is simple enough to scale, parallel enough to train efficiently, and expressive enough to absorb huge datasets. But it has an obvious tax: attention is expensive. Full self-attention scales badly with sequence length. In autoregressive generation, the model accumulates a key-value cache, which grows... learn more

Expertise

Find out how we connect targeted research expertise in academia to your business requirements. Discover how we accelerate business innovation and take care of the paperwork (hourly fees, fixed price, IP acquisition, seed funding)

Learn more about our events, organized by our ambassadors. Discover events organized by circle, university, metro area, and more.

Connect with Unicircles members at the universities and schools in our network.

Investors

Discover the opportunities for investors.

Find out how we facilitate investments with startups

Learn more about the opportunity behind startup investments

Corporates

Discover the opportunities for corporates.

Find out more about methodology behind how we facilitate collaboration between startups and corporates.

Learn more about the services tailored to corporates.

Check out our case studies.

Community

A global ecosystem of innovators empowering other innovators.

A global ecosystem of innovators empowering other innovators.

Find out more about partner opportunities

Check out our global events.

Unicircles

The marketplace for academic expertise and innovation.

Our story and expertise.

Send us a message, we will get back ASAP.

Join our team.

Company news, case studies, articles and more.

The Sequence Knowledge #874: Transformers or Not'

JOIN UNICIRCLES The leading marketplace for advanced expertise and funding. learn more

JOIN UNICIRCLES
The leading marketplace for advanced expertise and funding. learn more