Skip to main content

Siamese BERT for authorship verification

Publication ,  Conference
Tyo, J; Dhingra, B; Lipton, Z
Published in: CEUR Workshop Proceedings
January 1, 2021

The PAN 2021 authorship verification (AV) challenge focuses on determining if two texts are written by the same author or not, specifically when faced with new, unseen, authors. In our approach, we construct a Siamese network initialized with pretrained BERT encoders, employing a learning objective that incentives the model to map texts written by the same author to nearby embeddings while mapping texts written by different authors to comparatively distant embeddings. Additionally, inspired by related work in computer vision, we attempt to incorporate triplet losses but are unable to realize any benefit. Our method results in a slight performance gain of 0.9% overall score over the baseline and an increase of 8% in F1 score.

Duke Scholars

Published In

CEUR Workshop Proceedings

ISSN

1613-0073

Publication Date

January 1, 2021

Volume

2936

Start / End Page

2169 / 2177

Related Subject Headings

  • 4609 Information systems
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Tyo, J., Dhingra, B., & Lipton, Z. (2021). Siamese BERT for authorship verification. In CEUR Workshop Proceedings (Vol. 2936, pp. 2169–2177).
Tyo, J., B. Dhingra, and Z. Lipton. “Siamese BERT for authorship verification.” In CEUR Workshop Proceedings, 2936:2169–77, 2021.
Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. In: CEUR Workshop Proceedings. 2021. p. 2169–77.
Tyo, J., et al. “Siamese BERT for authorship verification.” CEUR Workshop Proceedings, vol. 2936, 2021, pp. 2169–77.
Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. CEUR Workshop Proceedings. 2021. p. 2169–2177.

Published In

CEUR Workshop Proceedings

ISSN

1613-0073

Publication Date

January 1, 2021

Volume

2936

Start / End Page

2169 / 2177

Related Subject Headings

  • 4609 Information systems