alcinnz
alcinnz boosted

New server, new #introduction:

👋 Hi! I'm Eric! I'm a scientific programmer & educator who writes mostly rstats code. I currently work part time at #UniversityOfArizona where I help researchers by making R packages, #Shiny apps, automated data workflows, and training them on #ReproducibleResearch practices. I also mentor data scientists for @Posit Academy. I am #OpenToWork as a contractor if any of those skills sound useful to you.

My background is in plant chemical ecology and population ecology and for my PhD I studied #tea and did field work in China. I love tea (although I've been drinking coffee more and more lately) and practice #GongFuCha when I can. I love #foraging and tasting new things.

I currently live in #Tucson but the #BayArea will always be my home.

#rstats #rse #datascience #dataviz #ecology #chemistry #statistics

New server, new #introduction:

👋 Hi! I'm Eric! I'm a scientific programmer & educator who writes mostly rstats code. I currently work part time at #UniversityOfArizona where I help researchers by making R packages, #Shiny apps, automated data workflows, and training them on #ReproducibleResearch practices. I also mentor data scientists for @Posit Academy. I am #OpenToWork as a contractor if any of those skills sound useful to you.

My background is in plant chemical ecology and population ecology and for my PhD I studied #tea and did field work in China. I love tea (although I've been drinking coffee more and more lately) and practice #GongFuCha when I can. I love #foraging and tasting new things.

I currently live in #Tucson but the #BayArea will always be my home.

#rstats #rse #datascience #dataviz #ecology #chemistry #statistics

Want to really understand how RAG, vector search & chunking work?

Then stop reading theory and build your own chatbot.

This guide shows you how to create a local PDF chatbot using:

☕ LangChain

☕ FAISS (vector DB)

☕ Mistral via Ollama

☕ Python & Streamlit

Step-by-step, from environment setup to deployment. Ideal for learning how Retrieval-Augmented Generation works in practice.

👉 https://medium.com/data-science-collective/rag-in-action-build-your-own-local-pdf-chatbot-as-a-beginner-96c2833869ff

Comment “WANT” if you need the friends link to the article, as you don’t have paid Medium.

#rag #tech #Technology #chatbot #AI #ki #python #vector #langchain #datascience #DataScientist #streamlit

Want to really understand how RAG, vector search & chunking work?

Then stop reading theory and build your own chatbot.

This guide shows you how to create a local PDF chatbot using:

☕ LangChain

☕ FAISS (vector DB)

☕ Mistral via Ollama

☕ Python & Streamlit

Step-by-step, from environment setup to deployment. Ideal for learning how Retrieval-Augmented Generation works in practice.

👉 https://medium.com/data-science-collective/rag-in-action-build-your-own-local-pdf-chatbot-as-a-beginner-96c2833869ff

Comment “WANT” if you need the friends link to the article, as you don’t have paid Medium.

#rag #tech #Technology #chatbot #AI #ki #python #vector #langchain #datascience #DataScientist #streamlit

Data manipulation within the US Federal Government [1]

👉Government datasets modified without notice.

▪️We gathered metadata from the US Department of Health and Human Services, CDC, and Veterans Affairs database harvest sources [... ] that were modified between Jan 20 and March 25, 2025.

▪️We found that 114 (49%) of the 232 included datasets were substantially altered.

⭐SOME CHANGES⭐

- “Social determinants of health” to “non
medical factors”
- “Gender” to “sex”
- “female details” column deleted

▪️The agencies involved have not issued any statements confirming or explaining these changes [...]

▪️Despite Secretary Robert F Kennedy Jr's (Department of Health and Humana Services) calls for “radical transparency”, unlogged data manipulation moves away from meaningful transparency.

▪️Only 15 (13%) of the 114 altered datasets logged or otherwise indicated that the change had occurred.

[1] 🌐https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(25)01249-8/fulltext

@publichealth @psychology @sociology @datascience #publichealth #datascience #science #research #health #government #criticalthinking #hhs #cdc #va @bicmay

Data manipulation within the US Federal Government [1]

👉Government datasets modified without notice.

▪️We gathered metadata from the US Department of Health and Human Services, CDC, and Veterans Affairs database harvest sources [... ] that were modified between Jan 20 and March 25, 2025.

▪️We found that 114 (49%) of the 232 included datasets were substantially altered.

⭐SOME CHANGES⭐

- “Social determinants of health” to “non
medical factors”
- “Gender” to “sex”
- “female details” column deleted

▪️The agencies involved have not issued any statements confirming or explaining these changes [...]

▪️Despite Secretary Robert F Kennedy Jr's (Department of Health and Humana Services) calls for “radical transparency”, unlogged data manipulation moves away from meaningful transparency.

▪️Only 15 (13%) of the 114 altered datasets logged or otherwise indicated that the change had occurred.

[1] 🌐https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(25)01249-8/fulltext

@publichealth @psychology @sociology @datascience #publichealth #datascience #science #research #health #government #criticalthinking #hhs #cdc #va @bicmay