Skip to main content

Yoked learning in molecular data science

Publication ,  Journal Article
Li, Z; Xiang, Y; Wen, Y; Reker, D
Published in: Artificial Intelligence in the Life Sciences
June 1, 2024

Active machine learning is an established and increasingly popular experimental design technique where the machine learning model can request additional data to improve the model's predictive performance. It is generally assumed that this data is optimal for the machine learning model since it relies on the model's predictions or model architecture and therefore cannot be transferred to other models. Inspired by research in pedagogy, we here introduce the concept of yoked machine learning where a second machine learning model learns from the data selected by another model. We found that in 48% of the benchmarked combinations, yoked learning performed similar or better than active learning. We analyze distinct cases in which yoked learning can improve active learning performance. In particular, we prototype yoked deep learning (YoDeL) where a classic machine learning model provides data to a deep neural network, thereby mitigating challenges of active deep learning such as slow refitting time per learning iteration and poor performance on small datasets. In summary, we expect the new concept of yoked (deep) learning to provide a competitive option to boost the performance of active learning and benefit from distinct capabilities of multiple machine learning models during data acquisition, training, and deployment.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Artificial Intelligence in the Life Sciences

DOI

EISSN

2667-3185

Publication Date

June 1, 2024

Volume

5
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, Z., Xiang, Y., Wen, Y., & Reker, D. (2024). Yoked learning in molecular data science. Artificial Intelligence in the Life Sciences, 5. https://doi.org/10.1016/j.ailsci.2023.100089
Li, Z., Y. Xiang, Y. Wen, and D. Reker. “Yoked learning in molecular data science.” Artificial Intelligence in the Life Sciences 5 (June 1, 2024). https://doi.org/10.1016/j.ailsci.2023.100089.
Li Z, Xiang Y, Wen Y, Reker D. Yoked learning in molecular data science. Artificial Intelligence in the Life Sciences. 2024 Jun 1;5.
Li, Z., et al. “Yoked learning in molecular data science.” Artificial Intelligence in the Life Sciences, vol. 5, June 2024. Scopus, doi:10.1016/j.ailsci.2023.100089.
Li Z, Xiang Y, Wen Y, Reker D. Yoked learning in molecular data science. Artificial Intelligence in the Life Sciences. 2024 Jun 1;5.

Published In

Artificial Intelligence in the Life Sciences

DOI

EISSN

2667-3185

Publication Date

June 1, 2024

Volume

5