T5-REF-CORRUPT-EN: Automatic Error Correction of Academic Referencing According to Institutional Guidelines of the Center for Translation Studies (CTS) of University of Vienna

Objective: This model corrects errors in academic referencing. For example:

Input (wrong sentence): According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts

Output (clean sentence): According to Smith and Peterson (2016: 56), the translation reveals patterns that suggest underlying semantic shifts.

Model Details:

Model name: T5-REF-CORRUPT-EN
Base model: T5-base
Language: English
Training data: Synthetically generated using LLMs and synthetically corrupted real student sentences.

Usage Cases: Error correction of academic references according to CTS guidelines.

Example

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

model_id = "elizaveta-dev/T5-REF-CORRUPT-EN"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSeq2SeqLM.from_pretrained(model_id)

text = "According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts."
inputs = tokenizer(text, return_tensors="pt")

outputs = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Use-Cases

The model can perform automatic corrections of various referencing errors, including:

1. Incorrect Citation Type (Parenthetical vs. Narrative)

Example of mistake: (Lopez 2018; Chen 2012) found that cultural context strongly influences translation strategies.

Example of correction: Lopez (2018) and Chen (2012) found that cultural context strongly influences translation strategies.

Example of mistake: This topic has been widely researched Baker (2006).

Example of correction: This topic has been widely researched (Baker 2006).

2. Incorrect Citation for Two Authors

Example of mistake: The concept of functional equivalence was analyzed by Baker & Green (2007).

Example of correction: The concept of functional equivalence was analyzed by Baker and Green (2007).

Example of mistake: Previous research (Müller, Schmidt 2001) highlights challenges in literary translation.

Example of correction: Previous research (Müller & Schmidt 2001) highlights challenges in literary translation.

3. Incorrect Placement of Citations

Example of mistake: According to Williams, translation theory continues to evolve (2011: 77).

Example of correction: According to Williams (2011: 77), translation theory continues to evolve.

4. Redundant Entities

Example of mistake: As Lee (2009) explains, equivalence is central in translation (Lee 2009).

Example of correction: As Lee (2009) explains, equivalence is central in translation.

Downloads last month: 54

Safetensors

Model size

0.2B params

Tensor type

F32