Open sourcing Dicer: Databricks's auto-sharder
https://www.databricks.com/blog/open-sourcing-dicer-databricks-auto-sharder
#HackerNews #OpenSourcing #Dicer #Databricks #AutoSharder #DataEngineering #BigData
Open sourcing Dicer: Databricks's auto-sharder
https://www.databricks.com/blog/open-sourcing-dicer-databricks-auto-sharder
#HackerNews #OpenSourcing #Dicer #Databricks #AutoSharder #DataEngineering #BigData
“Analysis of Biological Data with Python” started today for our biology bachelor students 🧬🐍
If you are interested in learning Python for data analysis, the course slides are freely available here:
https://github.com/bpucker/teaching/tree/master/WBIO-A-07
#Python #DataAnalysis #Biology #OpenEducation #OpenScience #Teaching #Bioinformatics #BigData #DataScience
@PuckerLab
“Analysis of Biological Data with Python” started today for our biology bachelor students 🧬🐍
If you are interested in learning Python for data analysis, the course slides are freely available here:
https://github.com/bpucker/teaching/tree/master/WBIO-A-07
#Python #DataAnalysis #Biology #OpenEducation #OpenScience #Teaching #Bioinformatics #BigData #DataScience
@PuckerLab
I would really, really like a book-length overview like Smith's "Software Build Systems" https://isbnsearch.org/isbn/9780132171939 but for package management and package managers - something that compares and contrasts existing examples at a design/architectural/needs level and provides a framework for thinking about the entire class of problem.
@gvwilson Until this happens, the Package Management devroom track at the upcoming #FOSDEM26 might be useful, as well as some of the talks at the #HPC, #BigData, and #DataScience one:
https://fosdem.org/2026/schedule/track/package-management/
https://fosdem.org/2026/schedule/track/hpc-big-data-data-science/
Two further results from the project "Human.Machine.Culture" (https://mmk.sbb.berlin/?lang=en) at @stabi_berlin published in Open Access
Guidelines for the Documentation of Ethical, Legal and Social Issues (ELSI) in Cultural Data
https://doi.org/10.5281/zenodo.16418345
Guidelines for the Publication of Cultural Data for AI Research
https://doi.org/10.5281/zenodo.15878097
Feedback to these publications is most welcome!
#bigdata #ML #culturalheritage #ELSI #digitalculturalheritage
Two further results from the project "Human.Machine.Culture" (https://mmk.sbb.berlin/?lang=en) at @stabi_berlin published in Open Access
Guidelines for the Documentation of Ethical, Legal and Social Issues (ELSI) in Cultural Data
https://doi.org/10.5281/zenodo.16418345
Guidelines for the Publication of Cultural Data for AI Research
https://doi.org/10.5281/zenodo.15878097
Feedback to these publications is most welcome!
#bigdata #ML #culturalheritage #ELSI #digitalculturalheritage
Absolutely, 100%, no way, in hell.
I’ll take a 1960s Roper refrigerator over this overpriced, tech-ridden garbage. The sales guy approached me as I was laughing at it and mentioned Samsung has decided to start running ads on them. Folks who fell for the scam can now have ads magically appear in their kitchens.
#InternetOfShit #AIshit #aigarbage #bigtech #bigdata #surveillancecapitalism
Once this infrastructure exists, mission creep is inevitable.
- What starts as ‘voluntary’ becomes mandatory
- A system that is just for workers expands to everyone, including children
https://action.openrightsgroup.org/tell-your-mp-attend-debate-digital-ids
#PoliceState #SurveillanceCapitalism #ToxicLabour #DigitalID #BigData #Cyberattack #hacking #security #privacy #Orwellian #LabourLies #KierStalin #Starmer
[2/2]
Absolutely, 100%, no way, in hell.
I’ll take a 1960s Roper refrigerator over this overpriced, tech-ridden garbage. The sales guy approached me as I was laughing at it and mentioned Samsung has decided to start running ads on them. Folks who fell for the scam can now have ads magically appear in their kitchens.
#InternetOfShit #AIshit #aigarbage #bigtech #bigdata #surveillancecapitalism
Once this infrastructure exists, mission creep is inevitable.
- What starts as ‘voluntary’ becomes mandatory
- A system that is just for workers expands to everyone, including children
https://action.openrightsgroup.org/tell-your-mp-attend-debate-digital-ids
#PoliceState #SurveillanceCapitalism #ToxicLabour #DigitalID #BigData #Cyberattack #hacking #security #privacy #Orwellian #LabourLies #KierStalin #Starmer
[2/2]
Das Milliardengeschäft mit den Nutzerdaten | c’t uplink
Unternehmen sammeln Klicks, Likes und Standortdaten und leiten daraus psychologische Profile ab. Wie das abläuft, erläutern die c’t-Experten im c’t uplink.
#BigData #ct #ctuplink #Datenschutz #IT #Journal #KünstlicheIntelligenz #Netzpolitik #Werbebranche #Tracking #Überwachung #Verbraucherschutz #news
Das Milliardengeschäft mit den Nutzerdaten | c’t uplink
Unternehmen sammeln Klicks, Likes und Standortdaten und leiten daraus psychologische Profile ab. Wie das abläuft, erläutern die c’t-Experten im c’t uplink.
#BigData #ct #ctuplink #Datenschutz #IT #Journal #KünstlicheIntelligenz #Netzpolitik #Werbebranche #Tracking #Überwachung #Verbraucherschutz #news
Conisglio vivamente
👇🏻
Teresa Numerico
Big data e algoritmi - Prospettive critiche
Carocci editore
#MastoLibri #Letture #Algoritmo #BigData
Conisglio vivamente
👇🏻
Teresa Numerico
Big data e algoritmi - Prospettive critiche
Carocci editore
#MastoLibri #Letture #Algoritmo #BigData
Deadline for submissions for the 11th #HPC, #BigData, and #DataScience devroom at #FOSDEM26 (Brussels, Sat-Sun 31 Jan + 1 Feb 2026) is Mon 1 Dec 2025. Please see details at the link below. Looking forward to another dynamic, exciting, packed session! https://hpc-bigdata-fosdem26.github.io/
Deadline for submissions for the 11th #HPC, #BigData, and #DataScience devroom at #FOSDEM26 (Brussels, Sat-Sun 31 Jan + 1 Feb 2026) is Mon 1 Dec 2025. Please see details at the link below. Looking forward to another dynamic, exciting, packed session! https://hpc-bigdata-fosdem26.github.io/
Wie Europol mit Microsoft, Palantir, Clearview & Co. auf Kuschelkurs geht
Statewatch beklagt eine unheilige Allianz zwischen Europol und US-Tech-Unternehmen, die massive Interessenkonflikte und Transparenzprobleme mit sich bringe.
#BigData #Datenschutz #KünstlicheIntelligenz #Microsoft #Netzpolitik #Palantir #Überwachung #news
We’re going to be digging out of this data mess for decades! 👨🌾
Once you put trash into your AI 🤖 , you have a trash AI. So it reasons that anything with science and medical research would be the same. And we know the current US government isn’t reasonable.
Problem is it’s really hard to identity where the trash is once the models running. We’re going down the slipperiest of slopes.
More tech folks feel free to expand.
#ai #bigtech #uspol #bigdata #energy #MAGA
One of my #universityofnebraska #lincolnne undergraduates speaking up to save our #statistics department. Department website: http://saveourstats.com/
Public comment form: https://apc.unl.edu/fall-2025-budget-reduction-feedback-form/.
Variable assessment: https://srvanderplas.github.io/2025-stat-apc-report/metrics-analysis (it features a possibly Cauchy distributed variable that they then made into a Z-score).
It's insane to eliminate #stats in the age of #datascience, #bigdata, and #ai