RL is more information inefficient than you thought
https://www.dwarkesh.com/p/bits-per-sample
#HackerNews #RL #information #inefficiency #AI #research #machinelearning #dataanalysis
RL is more information inefficient than you thought
https://www.dwarkesh.com/p/bits-per-sample
#HackerNews #RL #information #inefficiency #AI #research #machinelearning #dataanalysis