Scholars@Duke publication: Siamese BERT for authorship verification

Siamese BERT for authorship verification

Publication , Conference

Tyo, J; Dhingra, B; Lipton, Z

Published in: Ceur Workshop Proceedings

January 1, 2021

The PAN 2021 authorship verification (AV) challenge focuses on determining if two texts are written by the same author or not, specifically when faced with new, unseen, authors. In our approach, we construct a Siamese network initialized with pretrained BERT encoders, employing a learning objective that incentives the model to map texts written by the same author to nearby embeddings while mapping texts written by different authors to comparatively distant embeddings. Additionally, inspired by related work in computer vision, we attempt to incorporate triplet losses but are unable to realize any benefit. Our method results in a slight performance gain of 0.9% overall score over the baseline and an increase of 8% in F1 score.

Duke Scholars

Author Bhuwan Dhingra Computer Science

Published In

Ceur Workshop Proceedings

ISSN

1613-0073

Publication Date

January 1, 2021

Volume

2936

Start / End Page

2169 / 2177

Related Subject Headings

4609 Information systems

Citation

APA

Chicago

ICMJE

MLA

NLM

Tyo, J., Dhingra, B., & Lipton, Z. (2021). Siamese BERT for authorship verification. In Ceur Workshop Proceedings (Vol. 2936, pp. 2169–2177).

Tyo, J., B. Dhingra, and Z. Lipton. “Siamese BERT for authorship verification.” In Ceur Workshop Proceedings, 2936:2169–77, 2021.

Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. In: Ceur Workshop Proceedings. 2021. p. 2169–77.

Tyo, J., et al. “Siamese BERT for authorship verification.” Ceur Workshop Proceedings, vol. 2936, 2021, pp. 2169–77.

Tyo J, Dhingra B, Lipton Z. Siamese BERT for authorship verification. Ceur Workshop Proceedings. 2021. p. 2169–2177.

Published In

Ceur Workshop Proceedings

ISSN

1613-0073

Publication Date

January 1, 2021

Volume

2936

Start / End Page

2169 / 2177

Related Subject Headings

4609 Information systems