Search Results for author: Franz A. Heinsen

Found 5 papers, 5 papers with code

Softmax Attention with Constant Cost per Token

1 code implementation8 Apr 2024 Franz A. Heinsen

We propose a simple modification to the conventional attention mechanism applied by Transformers: Instead of quantifying pairwise query-key similarity with scaled dot-products, we quantify it with the logarithms of scaled dot-products of exponentials.

Efficient Parallelization of a Ubiquitous Sequential Computation

1 code implementation27 Oct 2023 Franz A. Heinsen

We find a succinct expression for computing the sequence $x_t = a_t x_{t-1} + b_t$ in parallel with two prefix sums, given $t = (1, 2, \dots, n)$, $a_t \in \mathbb{R}^n$, $b_t \in \mathbb{R}^n$, and initial value $x_0 \in \mathbb{R}$.

An Algorithm for Routing Vectors in Sequences

1 code implementation20 Nov 2022 Franz A. Heinsen

We propose a routing algorithm that takes a sequence of vectors and computes a new sequence with specified length and vector size.

Image Classification Sentiment Analysis +1

Tree Methods for Hierarchical Classification in Parallel

2 code implementations21 Sep 2022 Franz A. Heinsen

We propose methods that enable efficient hierarchical classification in parallel.

Classification

An Algorithm for Routing Capsules in All Domains

1 code implementation2 Nov 2019 Franz A. Heinsen

Building on recent work on capsule networks, we propose a new, general-purpose form of "routing by agreement" that activates output capsules in a layer as a function of their net benefit to use and net cost to ignore input capsules from earlier layers.

Image Classification Sentiment Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.