1 code implementation • 19 Jul 2022 • Houkun Zhu, Dominik Scheinert, Lauritz Thamsen, Kordian Gontarska, Odej Kao
Distributed file systems are widely used nowadays, yet using their default configurations is often not optimal.
1 code implementation • 27 Aug 2021 • Dominik Scheinert, Houkun Zhu, Lauritz Thamsen, Morgan K. Geldenhuys, Jonathan Will, Alexander Acker, Odej Kao
Distributed dataflow systems like Spark and Flink enable the use of clusters for scalable data analytics.
1 code implementation • 29 Jul 2021 • Dominik Scheinert, Lauritz Thamsen, Houkun Zhu, Jonathan Will, Alexander Acker, Thorsten Wittkopp, Odej Kao
First, a general model is trained on all the available data for a specific scalable analytics algorithm, hereby incorporating data from different contexts.