bert-base-cased-NER-reranker

A BERT model trained on the synthetic literary NER context retrieval dataset Amalvy et. al, 2023 (arXiv).

To use this model, construct a text of the form NER-sentence [SEP] context-sentence. The model should predict the positive class if context-sentence is useful to predict NER-sentence, and the negative class otherwise.

Performance Metrics

The model obtains 98.34 F1 on the synthetic test set. See Amalvy et. al, 2023 for details about NER performance gains when using this retriever model to assit a NER model at inference.

How to Reproduce Training

See the training script here.

Citation

If you use this model in your research, please cite:

@InProceedings{Amalvy2023,
  title	       = {Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset},
  author       = {Amalvy, A. and Labatut, V. and Dufour, R.},
  booktitle    = {2023 Conference on Empirical Methods in Natural Language Processing},
  year	       = {2023},
  doi	       = {10.18653/v1/2023.emnlp-main.642},
  pages	       = {10372-10382},
}

Downloads last month: 10

Safetensors

Model size

0.1B params

Tensor type

I64

F32

Inference Providers NEW

Text Ranking

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for compnet-renard/bert-base-cased-NER-reranker

Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset

Paper • 2310.10118 • Published Oct 16, 2023