Search Results for author: Jiri Hron

Found 16 papers, 6 papers with code

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

no code implementations • 11 Dec 2023 • Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

To do so, we investigate a simple self-training method based on expectation-maximization, which we call ReST$^{EM}$, where we (1) generate samples from the model and filter them using binary feedback, (2) fine-tune the model on these samples, and (3) repeat this process a few times.

Math

Paper
Add Code

Optimising Human-Machine Collaboration for Efficient High-Precision Information Extraction from Text Documents

no code implementations • 18 Feb 2023 • Bradley Butcher, Miri Zilka, Darren Cook, Jiri Hron, Adrian Weller

We argue for the utility of a human-in-the-loop approach in applications where high precision is required, but purely manual extraction is infeasible.

Paper
Add Code

Modeling Content Creator Incentives on Algorithm-Curated Platforms

no code implementations • 27 Jun 2022 • Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus, Sarah Dean

To this end, we propose tools for numerically finding equilibria in exposure games, and illustrate results of an audit on the MovieLens and LastFM datasets.

Paper
Add Code

Wide Bayesian neural networks have a simple weight posterior: theory and accelerated sampling

no code implementations • 15 Jun 2022 • Jiri Hron, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein

We introduce repriorisation, a data-dependent reparameterisation which transforms a Bayesian neural network (BNN) posterior to a distribution whose KL divergence to the BNN prior vanishes as layer widths grow.

Paper
Add Code

On component interactions in two-stage recommender systems

no code implementations • 28 Jun 2021 • Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus

Thanks to their scalability, two-stage recommenders are used by many of today's largest online platforms, including YouTube, LinkedIn, and Pinterest.

Recommendation Systems Vocal Bursts Valence Prediction

Paper
Add Code

Exploration in two-stage recommender systems

no code implementations • 1 Sep 2020 • Jiri Hron, Karl Krauth, Michael. I. Jordan, Niki Kilbertus

In this work, we focus on the complementary issue of exploration.

Recommendation Systems Vocal Bursts Valence Prediction

Paper
Add Code

Exact posterior distributions of wide Bayesian neural networks

1 code implementation • 18 Jun 2020 • Jiri Hron, Yasaman Bahri, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein

Recent work has shown that the prior over functions induced by a deep Bayesian neural network (BNN) behaves as a Gaussian process (GP) as the width of all layers becomes large.

Paper
Code

Infinite attention: NNGP and NTK for deep attention networks

1 code implementation • ICML 2020 • Jiri Hron, Yasaman Bahri, Jascha Sohl-Dickstein, Roman Novak

There is a growing amount of literature on the relationship between wide neural networks (NNs) and Gaussian processes (GPs), identifying an equivalence between the two for a variety of NN architectures.

Deep Attention Gaussian Processes

2,243

Paper
Code

Neural Tangents: Fast and Easy Infinite Neural Networks in Python

3 code implementations • ICLR 2020 • Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon Lee, Alexander A. Alemi, Jascha Sohl-Dickstein, Samuel S. Schoenholz

Neural Tangents is a library designed to enable research into infinite-width neural networks.

Bayesian Inference

2,243

Paper
Code

Orthogonal Estimation of Wasserstein Distances

no code implementations • 9 Mar 2019 • Mark Rowland, Jiri Hron, Yunhao Tang, Krzysztof Choromanski, Tamas Sarlos, Adrian Weller

Wasserstein distances are increasingly used in a wide variety of applications in machine learning.

BIG-bench Machine Learning reinforcement-learning +1

Paper
Add Code

Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning

2 code implementations • NeurIPS 2019 • David Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, José Miguel Hernández-Lobato, Sebastian Tschiatschek

Posterior sampling for reinforcement learning (PSRL) is an effective method for balancing exploration and exploitation in reinforcement learning.

Decision Making reinforcement-learning +1

Paper
Code

Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes

no code implementations • 11 Oct 2018 • Roman Novak, Lechao Xiao, Jaehoon Lee, Yasaman Bahri, Greg Yang, Jiri Hron, Daniel A. Abolafia, Jeffrey Pennington, Jascha Sohl-Dickstein

There is a previously identified equivalence between wide fully connected neural networks (FCNs) and Gaussian processes (GPs).

Gaussian Processes

Paper
Add Code

Variational Bayesian dropout: pitfalls and fixes

no code implementations • ICML 2018 • Jiri Hron, Alexander G. de G. Matthews, Zoubin Ghahramani

Dropout, a stochastic regularisation technique for training of neural networks, has recently been reinterpreted as a specific type of approximate inference algorithm for Bayesian neural networks.

Paper
Add Code

Gaussian Process Behaviour in Wide Deep Neural Networks

2 code implementations • ICLR 2018 • Alexander G. de G. Matthews, Mark Rowland, Jiri Hron, Richard E. Turner, Zoubin Ghahramani

Whilst deep neural networks have shown great empirical success, there is still much work to be done to understand their theoretical properties.

Gaussian Processes

Paper
Code

Variational Gaussian Dropout is not Bayesian

no code implementations • 8 Nov 2017 • Jiri Hron, Alexander G. de G. Matthews, Zoubin Ghahramani

Gaussian multiplicative noise is commonly used as a stochastic regularisation technique in training of deterministic neural networks.

Bayesian Inference

Paper
Add Code

Concrete Dropout

5 code implementations • NeurIPS 2017 • Yarin Gal, Jiri Hron, Alex Kendall

Dropout is used as a practical tool to obtain uncertainty estimates in large vision models and reinforcement learning (RL) tasks.

Reinforcement Learning (RL)

244

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.