LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty

Repo for LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty

Demo Version

We provide a demo version of the code for easy copying.

Supported Models

Currently, the LLM supports Llama, Qwen, and GPT series. Additional models can be added as needed.

Installation

Run the following command to install the required dependencies:

pip install -r requirement.txt

🚀 Quick Start

We offer the following scripts to facilitate training and inference. The parameters that need to be modified are represented as placeholders in the scripts.

Step 1: Train Local Models

Train Spanner

To train the Spanner model, execute:

bash scripts/spanner_train.sh

Train E-NER

To train the E-NER model, execute:

bash scripts/ener_train.sh

Step 2: Link to LLM

After obtaining the checkpoint from Step 1, run the following script for inference and save the results:

bash scripts/local_inference.sh

Use the LLM to refine uncertainty data that exceeds the specified threshold:

bash scripts/llm_inference.sh

Note

You can also choose to have local processing and LLM inference work together for each piece of data by including LLM calls during inference.

Dataset

The download links for the datasets used in this work are as follows:

We also provide several training and test sets based on CoNLL 2003 in the data/ directory.

Prepare Models

For E-NER, we utilize BERT-base, BERT-large

API_BASE

We use the OpenAI API, specifically gpt-3.5-turbo.

For Llama 3, we utilize Llama-3-8b-it.

For Qwen 2.5, we utilize Qwen/Qwen2.5-7B-Instruct.

🌟 Acknowledgement

When citing our work, please kindly consider citing the original papers. The relevant citation information is listed here.

@inproceedings{10.1145/3589334.3645414,
author = {Zhang, Zhen and Zhao, Yuhua and Gao, Hang and Hu, Mengting},
title = {LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty},
year = {2024},
isbn = {9798400701719},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3589334.3645414},
doi = {10.1145/3589334.3645414},
abstract = {},
booktitle = {Proceedings of the ACM Web Conference 2024},
pages = {4047–4058},
numpages = {12},
keywords = {information extraction, large language models, robustness, uncertainty estimation},
location = {Singapore, Singapore},
series = {WWW '24}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
IMG		IMG
dataloaders		dataloaders
dataprocess		dataprocess
dataset/conll03		dataset/conll03
metrics		metrics
models		models
scripts		scripts
src		src
README.md		README.md
api_config_example.py		api_config_example.py
args_config.py		args_config.py
extract2select.py		extract2select.py
main.py		main.py
requirements.txt		requirements.txt
run_llm.py		run_llm.py
run_localModel.py		run_localModel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty

Demo Version

Supported Models

Installation

🚀 Quick Start

Step 1: Train Local Models

Train Spanner

Train E-NER

Step 2: Link to LLM

Note

Dataset

Prepare Models

API_BASE

🌟 Acknowledgement

About

Releases

Packages

Languages

TheSDEs/LinkNER

Folders and files

Latest commit

History

Repository files navigation

LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty

Demo Version

Supported Models

Installation

🚀 Quick Start

Step 1: Train Local Models

Train Spanner

Train E-NER

Step 2: Link to LLM

Note

Dataset

Prepare Models

API_BASE

🌟 Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages