Update. In response to this problem (previous post, this thread), some publishers are desk-rejecting papers based on open health datasets. The problem is not the quality of the data, but the absence of additional work to validate findings.
Two reports:
1. "Journals and publishers crack down on research from open health data sets," Science, Oct 8, 2025.
https://www.science.org/content/article/journals-and-publishers-crack-down-research-open-health-data-sets
2. "AI: Journals are automatically rejecting public health dataset papers to combat paper mills," BMJ, Oct 15, 2025.
https://www.bmj.com/content/391/bmj.r2170
( #paywalled)