New study: #ChatGPT is not very good at predicting the #reproducibility of a research article from its methods section.
https://link.springer.com/article/10.1007/s10115-025-02428-z
PS: Five years ago, I asked this question on Twitter/X: "If a successful replication boosts the credibility a research article, then does a prediction of a successful replication, from an honest prediction market, do the same, even to a small degree?"
https://x.com/petersuber/status/1259521012196167681
What if #LLMs eventually make these predictions better than prediction markets? Will research #assessment committees (notoriously inclined to resort to simplistic #metrics) start to rely on LLM replication or reproducibility predictions?
