STAT
A recent review by MIT researchers finds that “only about 23% of machine learning studies in health care used multiple datasets to establish their results, compared to 80% in the adjacent field of computer vision, and 58% in natural language processing,” writes Casey Ross for STAT. “If the performance results are not reproduced in clinical care to the standard that was used during [a study], then we risk approving algorithms that we can’t trust,” says graduate student Matthew McDermott. “They may actually end up worsening patient care.”