RL is more information inefficient than you thought
https://www.dwarkesh.com/p/bits-per-sample
#HackerNews #RL #information #inefficiency #AI #research #machinelearning #dataanalysis
#Tag
RL is more information inefficient than you thought
https://www.dwarkesh.com/p/bits-per-sample
#HackerNews #RL #information #inefficiency #AI #research #machinelearning #dataanalysis
A space for Bonfire maintainers and contributors to communicate