Skip to main content
Journal cover image

Wide and deep learning for automatic cell type identification.

Publication ,  Journal Article
Wilson, CM; Fridley, BL; Conejo-Garcia, JR; Wang, X; Yu, X
Published in: Comput Struct Biotechnol J
2021

Cell type classification is an important problem in cancer research, especially with the advent of single cell technologies. Correctly identifying cells within the tumor microenvironment can provide oncologists with a snapshot of how a patient's immune system reacts to the tumor. Wide and deep learning (WDL) is an approach to construct a cell-classification prediction model that can learn patterns within high-dimensional data (deep) and ensure that biologically relevant features (wide) remain in the final model. In this paper, we demonstrate that regularization can prevent overfitting and adding a wide component to a neural network can result in a model with better predictive performance. In particular, we observed that a combination of dropout and ℓ 2 regularization can lead to a validation loss function that does not depend on the number of training iterations and does not experience a significant decrease in prediction accuracy compared to models with ℓ 1 , dropout, or no regularization. Additionally, we show WDL can have superior classification accuracy when the training and testing of a model are completed data on that arise from the same cancer type but different platforms. More specifically, WDL compared to traditional deep learning models can substantially increase the overall cell type prediction accuracy (36.5 to 86.9%) and T cell subtypes (CD4: 2.4 to 59.1%, and CD8: 19.5 to 96.1%) when the models were trained using melanoma data obtained from the 10X platform and tested on basal cell carcinoma data obtained using SMART-seq. WDL obtains higher accuracy when compared to state-of-the-art cell classification algorithms CHETAH (70.36%) and SingleR (70.59%).

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Comput Struct Biotechnol J

DOI

ISSN

2001-0370

Publication Date

2021

Volume

19

Start / End Page

1052 / 1062

Location

Netherlands

Related Subject Headings

  • 4601 Applied computing
  • 3101 Biochemistry and cell biology
  • 0802 Computation Theory and Mathematics
  • 0103 Numerical and Computational Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wilson, C. M., Fridley, B. L., Conejo-Garcia, J. R., Wang, X., & Yu, X. (2021). Wide and deep learning for automatic cell type identification. Comput Struct Biotechnol J, 19, 1052–1062. https://doi.org/10.1016/j.csbj.2021.01.027
Wilson, Christopher M., Brooke L. Fridley, José R. Conejo-Garcia, Xuefeng Wang, and Xiaoqing Yu. “Wide and deep learning for automatic cell type identification.Comput Struct Biotechnol J 19 (2021): 1052–62. https://doi.org/10.1016/j.csbj.2021.01.027.
Wilson CM, Fridley BL, Conejo-Garcia JR, Wang X, Yu X. Wide and deep learning for automatic cell type identification. Comput Struct Biotechnol J. 2021;19:1052–62.
Wilson, Christopher M., et al. “Wide and deep learning for automatic cell type identification.Comput Struct Biotechnol J, vol. 19, 2021, pp. 1052–62. Pubmed, doi:10.1016/j.csbj.2021.01.027.
Wilson CM, Fridley BL, Conejo-Garcia JR, Wang X, Yu X. Wide and deep learning for automatic cell type identification. Comput Struct Biotechnol J. 2021;19:1052–1062.
Journal cover image

Published In

Comput Struct Biotechnol J

DOI

ISSN

2001-0370

Publication Date

2021

Volume

19

Start / End Page

1052 / 1062

Location

Netherlands

Related Subject Headings

  • 4601 Applied computing
  • 3101 Biochemistry and cell biology
  • 0802 Computation Theory and Mathematics
  • 0103 Numerical and Computational Mathematics