no code implementations • 20 Jul 2021 • Diego Pino, Javier García, Fernando Fernández, Svitlana S Vyetrenko
Regarding the second one, this paper uses Probabilistic Policy Reuse to balance the exploitation/exploration in the learning of a new financial MDP according to the similarity of the previous financial MDPs whose knowledge is reused.
no code implementations • 9 Mar 2021 • Jingyi Wang, Guglielmo Mastroserio, Erin Kara, Javier García, Adam Ingram, Riley Connors, Michiel van der Klis, Thomas Dauser, James Steiner, Douglas Buisson, Jeroen Homan, Matteo Lucchini, Andrew Fabian, Joe Bright, Rob Fender, Edward Cackett, Ron Remillard
We find the corona expansion (as probed by reverberation) precedes a radio flare by ~5 days, which may suggest that the hard-to-soft transition is marked by the corona expanding vertically and launching a jet knot that propagates along the jet stream at relativistic velocities.
High Energy Astrophysical Phenomena
no code implementations • 8 Mar 2021 • Álvaro Visús, Javier García, Fernando Fernández
Although the notion of task similarity is potentially interesting in a wide range of areas such as curriculum learning or automated planning, it has mostly been tied to transfer learning.
no code implementations • 12 Feb 2021 • Rubén Majadas, Javier García, Fernando Fernández
Reinforcement Learning (RL) algorithms have led to recent successes in solving complex games, such as Atari or Starcraft, and to a huge impact in real-world applications, such as cybersecurity or autonomous driving.