Search results
Enabling high throughput deep reinforcement learning with first principles to investigate catalytic...
Nature· 2 days agoWe model the evolution of the chemical reaction path as a Markov decision process (MDP)28, defined by a state space S, an action space A, a probability transition (P({s}_{ ...