naist-nlp
/

IMPARA-GED

Text Classification

Model card Files Files and versions

IMPARA-GED / README.md

yusuke1997's picture

Update README.md

85524f8 verified 3 months ago

|

history blame contribute delete

3.42 kB

	---
	language:
	- en
	base_model:
	- answerdotai/ModernBERT-large
	pipeline_tag: text-classification
	tags:
	- GEC
	---

	# IMPARA-GED

	This is the repository of the IMPARA-GED, a quality estimator for the grammatical error correction evaluation.

	---
	## Abstract
	From [IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator](https://aclanthology.org/2025.findings-acl.1315/) (Findings of ACL2025)

	> We propose IMPARA-GED, a novel reference-free automatic grammatical error correction (GEC) evaluation method with grammatical error detection (GED) capabilities. We focus on the quality estimator of IMPARA, an existing automatic GEC evaluation method, and construct that of IMPARA-GED using a pre-trained language model with enhanced GED capabilities. Experimental results on SEEDA, a meta-evaluation dataset for automatic GEC evaluation methods, demonstrate that IMPARA-GED achieves the highest correlation with human sentence-level evaluations.

	---
	## Usage

	STEP 1:
	```
	pip install gec-metric
	```

	STEP 2:
	```
	from gec_metrics import get_metric, get_meta_eval
	from pprint import pprint
	c = get_metric('impara')
	metric = c(c.Config(
	model_qe='naist-nlp/IMPARA-GED',
	threshold=-999.0, # Ignore similarity scores
	pooling='mean' # Our model was trained with mean pooling
	))
	print(metric.config)
	c = get_meta_eval('seeda')
	meta = c(c.Config('base'))

	print('=== System-level meta-evaluation ===')
	system_results = meta.corr_system(metric, aggregation='trueskill')
	print('SEEDA-S')
	pprint(system_results.ts_sent)
	print('SEEDA-E')
	pprint(system_results.ts_edit)

	print('=== Sentence-level meta-evaluation ===')
	sentence_results = meta.corr_sentence(metric)
	print('SEEDA-S')
	pprint(sentence_results.sent)
	print('SEEDA-E')
	pprint(sentence_results.edit)
	```



	See details: [`gec-metric`](https://github.com/gotutiyan/gec-metrics).



	---
	## Citation

	Bibkey (For [anthology.bib](http://aclweb.org/anthology/anthology.bib)):

	```sakai-etal-2025-impara```

	Bibtex:
	```
	@inproceedings{sakai-etal-2025-impara,
	title = "{IMPARA}-{GED}: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator",
	author = "Sakai, Yusuke and
	Goto, Takumi and
	Watanabe, Taro",
	editor = "Che, Wanxiang and
	Nabende, Joyce and
	Shutova, Ekaterina and
	Pilehvar, Mohammad Taher",
	booktitle = "Findings of the Association for Computational Linguistics: ACL 2025",
	month = jul,
	year = "2025",
	address = "Vienna, Austria",
	publisher = "Association for Computational Linguistics",
	url = "https://aclanthology.org/2025.findings-acl.1315/",
	doi = "10.18653/v1/2025.findings-acl.1315",
	pages = "25647--25654",
	ISBN = "979-8-89176-256-5",
	abstract = "We propose IMPARA-GED, a novel reference-free automatic grammatical error correction (GEC) evaluation method with grammatical error detection (GED) capabilities. We focus on the quality estimator of IMPARA, an existing automatic GEC evaluation method, and construct that of IMPARA-GED using a pre-trained language model with enhanced GED capabilities. Experimental results on SEEDA, a meta-evaluation dataset for automatic GEC evaluation methods, demonstrate that IMPARA-GED achieves the highest correlation with human sentence-level evaluations."
	}
	```

	## Contact

	Yusuke Sakai ([@yusuke1997](https://huggingface.co/yusuke1997))