no code implementations • 19 May 2021 • M. Lautaro Hickmann, Fabian Wurzberger, Megi Hoxhalli, Arne Lochner, Jessica Töllich, Ansgar Scherp
We observe a high correlation between the attention weights and this reference metric, especially on the the later decoding layers of the transformer architecture.
2 code implementations • 11 Feb 2021 • Ishwar Venugopal, Jessica Töllich, Michael Fairbank, Ansgar Scherp
In contrast to existing studies, we evaluate our models' performance at different stages of a process, determined by quartiles of the number of events and normalized quarters of the case duration.