Skip to main content

A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract)

Publication ,  Conference
Gupta, A; Liu, A; Haines, E; Alghamdi, R; Kota, AS; Collins, LM; Mainsah, BO
Published in: Proceedings of the Aaai Conference on Artificial Intelligence
January 1, 2026

The current standard for training brain-computer interface (BCI) machine learning models is user-specific. There is a high interest in developing generic models that are trained on data from other users to minimize BCI calibration time; however, this is limited by noisy, non-stationary brain signals and high inter-user variability. We investigate the trade-off between training data quality and quantity on P300 BCI performance in individuals with amyotrophic lateral sclerosis (ALS) with representative traditional machine learning (stepwise linear discriminant analysis, SWLDA) and deep learning (EEGNet) models. Results show that data quality and domain alignment are more critical than dataset size: user-specific models trained on significantly less data outperformed generic models; generic models trained on ALS data outperformed models trained on non-ALS data; block-averaging of features was mostly detrimental to EEGNet but beneficial to SWLDA; and accounting for inter-stimulus interval differences between ALS and non-ALS data had minimal effect. Our findings highlight the importance of individualized model tuning for reliable P300 BCIs.

Duke Scholars

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

EISSN

2374-3468

ISSN

2159-5399

Publication Date

January 1, 2026

Volume

40

Issue

48

Start / End Page

41217 / 41219
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Gupta, A., Liu, A., Haines, E., Alghamdi, R., Kota, A. S., Collins, L. M., & Mainsah, B. O. (2026). A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). In Proceedings of the Aaai Conference on Artificial Intelligence (Vol. 40, pp. 41217–41219). https://doi.org/10.1609/aaai.v40i48.42219
Gupta, A., A. Liu, E. Haines, R. Alghamdi, A. S. Kota, L. M. Collins, and B. O. Mainsah. “A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract).” In Proceedings of the Aaai Conference on Artificial Intelligence, 40:41217–19, 2026. https://doi.org/10.1609/aaai.v40i48.42219.
Gupta A, Liu A, Haines E, Alghamdi R, Kota AS, Collins LM, et al. A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). In: Proceedings of the Aaai Conference on Artificial Intelligence. 2026. p. 41217–9.
Gupta, A., et al. “A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract).” Proceedings of the Aaai Conference on Artificial Intelligence, vol. 40, no. 48, 2026, pp. 41217–19. Scopus, doi:10.1609/aaai.v40i48.42219.
Gupta A, Liu A, Haines E, Alghamdi R, Kota AS, Collins LM, Mainsah BO. A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract). Proceedings of the Aaai Conference on Artificial Intelligence. 2026. p. 41217–41219.

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

EISSN

2374-3468

ISSN

2159-5399

Publication Date

January 1, 2026

Volume

40

Issue

48

Start / End Page

41217 / 41219