Scholars@Duke publication: A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract)

A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract)

Publication , Conference

Gupta, A; Liu, A; Haines, E; Alghamdi, R; Kota, AS; Collins, LM; Mainsah, BO

Published in: Proceedings of the Aaai Conference on Artificial Intelligence

January 1, 2026

The current standard for training brain-computer interface (BCI) machine learning models is user-specific. There is a high interest in developing generic models that are trained on data from other users to minimize BCI calibration time; however, this is limited by noisy, non-stationary brain signals and high inter-user variability. We investigate the trade-off between training data quality and quantity on P300 BCI performance in individuals with amyotrophic lateral sclerosis (ALS) with representative traditional machine learning (stepwise linear discriminant analysis, SWLDA) and deep learning (EEGNet) models. Results show that data quality and domain alignment are more critical than dataset size: user-specific models trained on significantly less data outperformed generic models; generic models trained on ALS data outperformed models trained on non-ALS data; block-averaging of features was mostly detrimental to EEGNet but beneficial to SWLDA; and accounting for inter-stimulus interval differences between ALS and non-ALS data had minimal effect. Our findings highlight the importance of individualized model tuning for reliable P300 BCIs.

Duke Scholars

Author Leslie M. Collins Pierre R. Lamond Department of Electrical and Computer Engin ...

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

10.1609/aaai.v40i48.42219

EISSN

2374-3468

ISSN

2159-5399

Publication Date

January 1, 2026

Volume

Issue

Start / End Page

41217 / 41219

Citation

APA

Chicago

ICMJE

MLA

NLM

Gupta, A., Liu, A., Haines, E., Alghamdi, R., Kota, A. S., Collins, L. M., & Mainsah, B. O. (2026). A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). In Proceedings of the Aaai Conference on Artificial Intelligence (Vol. 40, pp. 41217–41219). https://doi.org/10.1609/aaai.v40i48.42219

Gupta, A., A. Liu, E. Haines, R. Alghamdi, A. S. Kota, L. M. Collins, and B. O. Mainsah. “A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract).” In Proceedings of the Aaai Conference on Artificial Intelligence, 40:41217–19, 2026. https://doi.org/10.1609/aaai.v40i48.42219.

Gupta A, Liu A, Haines E, Alghamdi R, Kota AS, Collins LM, et al. A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). In: Proceedings of the Aaai Conference on Artificial Intelligence. 2026. p. 41217–9.

Gupta, A., et al. “A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract).” Proceedings of the Aaai Conference on Artificial Intelligence, vol. 40, no. 48, 2026, pp. 41217–19. Scopus, doi:10.1609/aaai.v40i48.42219.

Gupta A, Liu A, Haines E, Alghamdi R, Kota AS, Collins LM, Mainsah BO. A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). Proceedings of the Aaai Conference on Artificial Intelligence. 2026. p. 41217–41219.

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

10.1609/aaai.v40i48.42219

EISSN

2374-3468

ISSN

2159-5399

Publication Date

January 1, 2026

Volume

Issue

Start / End Page

41217 / 41219