Official Code of "Improving Chinese Named Entity Recognition by Search Engine Augmentation"
-
torch=1.10.2+cu113
-
transformers=4.17
-
running with CUDA 11.4 and Python 3.9.7
-
download BERT pretrained weights from huggingface to
prev_trained_model
directory. We use bert-base chinese -
modify config parameters in scripts/run_ner_crf_xxx.sh
-
run
sh scripts/run_ner_crf_xxx.sh
- Experiments were conducted on three public datasets (Chinese Resume / People's Daily / Weibo NER) and our self-built unconverntional NER datasets from social media BiliBili.
- "Unconventional" means named entities contains more polysemous words and grammatical ambiguities. Please contact me if you are interested about this dataset.
If you find FiD-NER interesting and helps your research, please consider citing our work
@article{mao2022improving,
title={Improving Chinese Named Entity Recognition by Search Engine Augmentation},
author={Mao, Qinghua and Li, Jiatong and Meng, Kui},
journal={arXiv preprint arXiv:2210.12662},
year={2022}
}