R.A. Fisher wrote that the purpose of statisticians was "constructing a hypothetical infinite population of which the actual data are regarded as constituting a random sample." ( p. 311 here ). In The Zeroth Problem Colin Mallows wrote "As Fisher pointed out, statisticians earn their living by using two basic tricks-they regard data as being realizations of random variables, and they assume that they know an appropriate specification for these random variables."
Some of the pathological beliefs we attribute to techbros were already present in this view of statistics that started forming over a century ago. Our writing is just data; the real, important object is the “hypothetical infinite population” reflected in a large language model, which at base is a random variable. Stable Diffusion, the image generator, is called that because it is based on latent diffusion models, which are a way of representing complicated distribution functions--the hypothetical infinite populations--of things like digital images. Your art is just data; it’s the latent diffusion model that’s the real deal. The entities that are able to identify the distribution functions (in this case tech companies) are the ones who should be rewarded, not the data generators (you and me).
So much of the dysfunction in today’s machine learning and AI points to how problematic it is to give statistical methods a privileged place that they don’t merit. We really ought to be calling out Fisher for his trickery and seeing it as such.
#AI #GenAI #GenerativeAI #LLM #StableDiffusion #statistics #StatisticalMethods #DiffusionModels #MachineLearning #ML
R.A. Fisher wrote that the purpose of statisticians was "constructing a hypothetical infinite population of which the actual data are regarded as constituting a random sample." ( p. 311 here ). In The Zeroth Problem Colin Mallows wrote "As Fisher pointed out, statisticians earn their living by using two basic tricks-they regard data as being realizations of random variables, and they assume that they know an appropriate specification for these random variables."
Some of the pathological beliefs we attribute to techbros were already present in this view of statistics that started forming over a century ago. Our writing is just data; the real, important object is the “hypothetical infinite population” reflected in a large language model, which at base is a random variable. Stable Diffusion, the image generator, is called that because it is based on latent diffusion models, which are a way of representing complicated distribution functions--the hypothetical infinite populations--of things like digital images. Your art is just data; it’s the latent diffusion model that’s the real deal. The entities that are able to identify the distribution functions (in this case tech companies) are the ones who should be rewarded, not the data generators (you and me).
So much of the dysfunction in today’s machine learning and AI points to how problematic it is to give statistical methods a privileged place that they don’t merit. We really ought to be calling out Fisher for his trickery and seeing it as such.
#AI #GenAI #GenerativeAI #LLM #StableDiffusion #statistics #StatisticalMethods #DiffusionModels #MachineLearning #ML
Japan has requested that OpenAI seek approval in advance to prevent copyright infringement on its Sora 2 short-form video app, after a deluge of Japanese anime characters flooded the platform. https://www.japantimes.co.jp/business/2025/10/16/companies/japan-opt-in-model-sora2/?utm_medium=Social&utm_source=mastodon #business #companies #samaltman #openai #masaakitaira #minorukiuchi #ai #tech #sora2
@thejapantimes It's like watching people walk casually through a museum dragging a paintbrush across every artifact. I wish more people were disgusted by products like #Sora.
So all those "wasteful" research funding grants to fruit fly research motivated and led to the biggest discovery fueling the whole of the modern "AI" boom. One never knows where basic research will lead, it's impossible to predict. Hence basic research is not at all wasteful, on the contrary, it's essential, it's the foundation of a rich, wealthy, creative society. And also very cheap, comparatively: https://albert.rierol.net/tell/20160601_Unintended_consequences_of_untimely_research.html
Search also for the returns on the human genome project, or on the humble origins of DNA sequencing, to name just two among many.
So all those "wasteful" research funding grants to fruit fly research motivated and led to the biggest discovery fueling the whole of the modern "AI" boom. One never knows where basic research will lead, it's impossible to predict. Hence basic research is not at all wasteful, on the contrary, it's essential, it's the foundation of a rich, wealthy, creative society. And also very cheap, comparatively: https://albert.rierol.net/tell/20160601_Unintended_consequences_of_untimely_research.html
Search also for the returns on the human genome project, or on the humble origins of DNA sequencing, to name just two among many.
@emma @jaredwhite I use locally hosted Stable Diffusion to improve on my visual work in which I am not really talented. I think there is a way to incorporate these tools creatively, I mostly just make backgrounds for YouTube thumbnails, or to restore old family photos.
I think there is a meaningful difference between "let the AI do the work for you" and "use AI tools to improve the way you express yourself", which distinction is not allowed under the "anti-AI" term.