G'morning 💕 , guys plz wish me luck 🥹
Today I have an interview for a scholarship to a data science training program with Heal Palestine, and I'm currently training for it. If anyone has any advice or tips, please share them with me.
@aral @fabio my legend in tech , do you have any advice or info I should know before the interview?
#datascience #lm #machinelearning #technology
The unreasonable effectiveness of pattern matching
https://arxiv.org/abs/2601.11432
#HackerNews #unreasonableeffectiveness #patternmatching #machinelearning #datascience #arxiv #research
Command-line Tools can be 235x Faster than your Hadoop Cluster (2014)
https://adamdrake.com/command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html
#HackerNews #CommandLineTools #HadoopCluster #Performance #BigData #TechInsights #DataScience
Counterfactual evaluation for recommendation systems
https://eugeneyan.com/writing/counterfactual-evaluation/
#HackerNews #CounterfactualEvaluation #RecommendationSystems #MachineLearning #DataScience #ArtificialIntelligence
Moin #Segeln & #CCC! ⛵️💻
Ich bin Segler & will das „Nenner-Problem“ der #Orca-Angriffe lösen. Wir haben die Events, aber uns fehlt die #AIS-Statistik.
Ich starte OAIC, um Vorfallsdaten mit AIS-Grundgesamtheiten zu korrelieren. Mythen durch Mathematik ersetzen!
Suche IT-Nerds für NMEA-Parsing & Statistik. Vision steht, wer liefert Code?
Details: github.com
Bitte boosten! 🚀
#DataScience #Python #PostGIS #OpenSource #Orcas #Hacking
A new preprint from my group available: "A Century of Migration (1830–1939): 735,000 Enriched Records from Bremen’s Ship Passenger Lists"
A new preprint from my group available: "A Century of Migration (1830–1939): 735,000 Enriched Records from Bremen’s Ship Passenger Lists"
New article by @kiru outlines a training course in bibliographic data science, bridging library science, digital humanities, and data science.
It argues that bibliographic data analysis should become a core methodological skill for understanding large-scale cultural and historical patterns.
The piece makes a strong case for structured training programs to close this gap in DH and library education.
#DigitalHumanities #BibliographicData #DataScience
https://bibliodata.substack.com/p/an-outline-of-an-imagined-training
An inspiring essay from @kiru setting out a potential curriculum for a bibliographic #DataScience training course. "...for those who already have some knowledge in one of the relevant fields (e.g., library science, cultural history, literary sociology, information technology)".
An outline of an imagined training course on #bibliographic #data science https://bibliodata.substack.com/p/an-outline-of-an-imagined-training #DigitalLibraries #metadata #DigitalHumanities
An inspiring essay from @kiru setting out a potential curriculum for a bibliographic #DataScience training course. "...for those who already have some knowledge in one of the relevant fields (e.g., library science, cultural history, literary sociology, information technology)".
An outline of an imagined training course on #bibliographic #data science https://bibliodata.substack.com/p/an-outline-of-an-imagined-training #DigitalLibraries #metadata #DigitalHumanities
New article by @kiru outlines a training course in bibliographic data science, bridging library science, digital humanities, and data science.
It argues that bibliographic data analysis should become a core methodological skill for understanding large-scale cultural and historical patterns.
The piece makes a strong case for structured training programs to close this gap in DH and library education.
#DigitalHumanities #BibliographicData #DataScience
https://bibliodata.substack.com/p/an-outline-of-an-imagined-training
“Analysis of Biological Data with Python” started today for our biology bachelor students 🧬🐍
If you are interested in learning Python for data analysis, the course slides are freely available here:
https://github.com/bpucker/teaching/tree/master/WBIO-A-07
#Python #DataAnalysis #Biology #OpenEducation #OpenScience #Teaching #Bioinformatics #BigData #DataScience
@PuckerLab
“Analysis of Biological Data with Python” started today for our biology bachelor students 🧬🐍
If you are interested in learning Python for data analysis, the course slides are freely available here:
https://github.com/bpucker/teaching/tree/master/WBIO-A-07
#Python #DataAnalysis #Biology #OpenEducation #OpenScience #Teaching #Bioinformatics #BigData #DataScience
@PuckerLab
rtopy: an R to Python bridge – novelties
https://thierrymoudiki.github.io/blog/2026/01/08/r/python/rtopy
Various shape regularization algorithms
https://github.com/nickponline/shreg
#HackerNews #shapeRegularization #algorithms #machineLearning #dataScience #GitHub #research
Hello Mastodon!
I’m a Data Analyst taking my next step into Data Science by beginning my Master’s degree at CU Boulder this year.
I’m excited to connect with you all and am hoping to hear some insights and learn from you!
If you have tips, resources, or just want to connect and muse about interesting topics, please reach out!
rtopy: an R to Python bridge – novelties
https://thierrymoudiki.github.io/blog/2026/01/08/r/python/rtopy
Repli-scooping some of what I find in a soon-to-be finished paper about correlations and effects between reflective reasoning and philosophical thought experiments across multiple participant samples, the Brauer lab finds
- mTurk workers offered lower quality and lower value data than Prolific workers, students, and even CloudResearch's approved mTurk workers
- Qualtrics panels had the least value but moderate quality
- Students seemed to offer the highest value
I forgot to share the #mTurk data quality result that got scooped:
“In late 2020…. Participants from the United States were recruited from Amazon Mechanical Turk, - #CloudResearch, #Prolific, and a #university. One participant source yielded up to 18 times as many low-quality respondents as the other three.”
https://doi.org/10.1093/analys/anaf015
#psychology #philosophy #surveyMethods #quantMethods #dataScience #qualityControl
Reflection-philosophy order effects and correlations across samples
Do you have data that drives you crazy?
Questions about data access, privacy, ethics, or governance you’ve never had the right space to ask?
The EconData Workshop creates exactly that space.
Organised by BERD@NFDI with DRA and IAB, this unconference brings people working with data together to tackle real challenges from reproducibility and data governance to AI, LLMs, and unstructured data.
26–27 Feb 2026📍 Nuremberg
Free registration: https://eveeno.com/econ-data-workshop
#OpenScience #DataScience