Skip to content

Commit

Permalink
Add some info about the model
Browse files Browse the repository at this point in the history
  • Loading branch information
Rishit-dagli committed Apr 13, 2021
1 parent 2c931d8 commit 3eab441
Showing 1 changed file with 9 additions and 1 deletion.
10 changes: 9 additions & 1 deletion README.md
Expand Up @@ -38,9 +38,17 @@ cd perceiver
pip install -e .[dev]
```

## A bit about Perceiver

The Perceiver model aims to deal with arbitrary configurations of different modalities using a single transformer-based architecture. Transformers are often flexible and make few assumptions about their inputs, but that also scale quadratically with the number of inputs in terms of both memory and computation. This model proposes a mechanism that makes it possible to deal with high-dimensional inputs, while retaining the expressivity and flexibility to deal with arbitrary input configurations.

![](images/architecture.PNG)

The idea here is to introduce a small set of latent units that forms an attention bottleneck through which the inputs must pass. This avoids the quadratic scaling problem of all-to-all attention of a classical transformer. The model can be seen as performing a fully end-to-end clustering of the inputs, with the latent units as the cluster centres, leveraging a highly asymmetric crossattention layer. For spatial information the authors compensate for the lack of explicit grid structures in our model by associating Fourier feature encodings.

## Usage

```
```python
from perceiver import Perceiver
import tensorflow as tf

Expand Down

0 comments on commit 3eab441

Please sign in to comment.