arxiv:2503.00958

Layered Insights: Generalizable Analysis of Authorial Style by Leveraging All Transformer Layers

Published on Mar 2

Authors:

Abstract

Using various layers from pre-trained transformer models improves the robustness of authorship attribution, especially out-of-domain.

AI-generated summary

We propose a new approach for the authorship attribution task that leverages the various linguistic representations learned at different layers of pre-trained transformer-based models. We evaluate our approach on three datasets, comparing it to a state-of-the-art baseline in in-domain and out-of-domain scenarios. We found that utilizing various transformer layers improves the robustness of authorship attribution models when tested on out-of-domain data, resulting in new state-of-the-art results. Our analysis gives further insights into how our model's different layers get specialized in representing certain stylistic features that benefit the model when tested out of the domain.

View arXiv page View PDF Add to collection