| # Key Information Extraction | |
| ## Overview | |
| The structure of the key information extraction dataset directory is organized as follows. | |
| ```text | |
| βββ wildreceipt | |
| βββ class_list.txt | |
| βββ dict.txt | |
| βββ image_files | |
| βββ openset_train.txt | |
| βββ openset_test.txt | |
| βββ test.txt | |
| βββ train.txt | |
| ``` | |
| ## Preparation Steps | |
| ### WildReceipt | |
| - Just download and extract [wildreceipt.tar](https://download.openmmlab.com/mmocr/data/wildreceipt.tar). | |
| ### WildReceiptOpenset | |
| - Step0: have [WildReceipt](#WildReceipt) prepared. | |
| - Step1: Convert annotation files to OpenSet format: | |
| ```bash | |
| # You may find more available arguments by running | |
| # python tools/data/kie/closeset_to_openset.py -h | |
| python tools/data/kie/closeset_to_openset.py data/wildreceipt/train.txt data/wildreceipt/openset_train.txt | |
| python tools/data/kie/closeset_to_openset.py data/wildreceipt/test.txt data/wildreceipt/openset_test.txt | |
| ``` | |
| :::{note} | |
| You can learn more about the key differences between CloseSet and OpenSet annotations in our [tutorial](../tutorials/kie_closeset_openset.md). | |
| ::: | |