Siamese BERT for authorship verification
Publication
, Conference
Tyo, J; Dhingra, B; Lipton, Z
Published in: CEUR Workshop Proceedings
January 1, 2021
The PAN 2021 authorship verification (AV) challenge focuses on determining if two texts are written by the same author or not, specifically when faced with new, unseen, authors. In our approach, we construct a Siamese network initialized with pretrained BERT encoders, employing a learning objective that incentives the model to map texts written by the same author to nearby embeddings while mapping texts written by different authors to comparatively distant embeddings. Additionally, inspired by related work in computer vision, we attempt to incorporate triplet losses but are unable to realize any benefit. Our method results in a slight performance gain of 0.9% overall score over the baseline and an increase of 8% in F1 score.
Duke Scholars
Published In
CEUR Workshop Proceedings
ISSN
1613-0073
Publication Date
January 1, 2021
Volume
2936
Start / End Page
2169 / 2177
Related Subject Headings
- 4609 Information systems
Citation
APA
Chicago
ICMJE
MLA
NLM
Tyo, J., Dhingra, B., & Lipton, Z. (2021). Siamese BERT for authorship verification. In CEUR Workshop Proceedings (Vol. 2936, pp. 2169–2177).
Tyo, J., B. Dhingra, and Z. Lipton. “Siamese BERT for authorship verification.” In CEUR Workshop Proceedings, 2936:2169–77, 2021.
Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. In: CEUR Workshop Proceedings. 2021. p. 2169–77.
Tyo, J., et al. “Siamese BERT for authorship verification.” CEUR Workshop Proceedings, vol. 2936, 2021, pp. 2169–77.
Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. CEUR Workshop Proceedings. 2021. p. 2169–2177.
Published In
CEUR Workshop Proceedings
ISSN
1613-0073
Publication Date
January 1, 2021
Volume
2936
Start / End Page
2169 / 2177
Related Subject Headings
- 4609 Information systems