Skip to main content

OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery

Publication ,  Conference
Inkawhich, M; Inkawhich, N; Yang, H; Zhang, J; Linderman, R; Chen, Y
Published in: Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024
January 1, 2024

An object detector's ability to detect and flag novel objects during open-world deployments is critical for many real-world applications. Unfortunately, much of the work in open object detection today is disjointed and fails to adequately address applications that prioritize unknown object recall in addition to known-class accuracy. To close this gap, we present a new task called Open-Set Object Detection and Discovery (OSODD) and as a solution propose the Open-Set Regions with ViT features (OSR-ViT) detection framework. OSR-ViT combines a class-agnostic proposal network with a powerful ViT-based classifier. Its modular design simplifies optimization and allows users to easily swap proposal solutions and feature extractors to best suit their application. Using our multifaceted evaluation protocol, we show that OSR-ViT obtains performance levels that far exceed state-of-the-art supervised methods. Our method also excels in low-data settings, outperforming supervised baselines using a fraction of the training data.

Duke Scholars

Published In

Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024

DOI

Publication Date

January 1, 2024

Start / End Page

928 / 937
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Inkawhich, M., Inkawhich, N., Yang, H., Zhang, J., Linderman, R., & Chen, Y. (2024). OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery. In Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024 (pp. 928–937). https://doi.org/10.1109/BigData62323.2024.10826036
Inkawhich, M., N. Inkawhich, H. Yang, J. Zhang, R. Linderman, and Y. Chen. “OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery.” In Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 928–37, 2024. https://doi.org/10.1109/BigData62323.2024.10826036.
Inkawhich M, Inkawhich N, Yang H, Zhang J, Linderman R, Chen Y. OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery. In: Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024. 2024. p. 928–37.
Inkawhich, M., et al. “OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery.” Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, pp. 928–37. Scopus, doi:10.1109/BigData62323.2024.10826036.
Inkawhich M, Inkawhich N, Yang H, Zhang J, Linderman R, Chen Y. OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery. Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024. 2024. p. 928–937.

Published In

Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024

DOI

Publication Date

January 1, 2024

Start / End Page

928 / 937