Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 997 Bytes

README.md

File metadata and controls

25 lines (19 loc) · 997 Bytes

NICE

The files used to create this dataset are downloaded from the World Intellectual Property Organization (WIPO).

Retrieved July 17, 2022

ncl-20220101-classification_top_structure-20210623.zip > ncl-20220101-classification_top_structure-20210623.xml as labels.xml

ncl-20220101-classification_texts-20210623.zip > ncl-20220101-en-classification_texts-20210623.xml as texts.xml

Setup

To create the dataset, put the downloaded files in the following structure:

.
├── NICE
├── NICE_binary
├── creation.ipynb
├── creation.py
├── README.md
└── source
    ├── labels.xml
    └── texts.xml

Now run the creation.py script or creation.ipynb notebook. This will create the NICE-45 and NICE-2 datasets.