Co-occurrence Rate Networks

This verision is used only for verifying the experimental results in the paper (Linear co-occurrence rate networks (L-CRNs) for sequence labeling, Zhemin Zhu, Djoerd Hiemstra, Peter Apers, Statistical Language and Speech Processing 2014, Springer. pp. 185-196). All rights of the datasets belong to their original authors.

Introduction

Co-occurrence rate networks are for sequence labeling tasks, such as named entity recognition, part-of-speech tagging … The applications of this software are similar to CRFs (http://crfpp.googlecode.com/svn/trunk/doc/index.html). But CRN can be trained much faster and obtain better or very competitive results.

System and Compiler

The Ubuntu 12.04 and gcc 4.7.3 are used for compiling the software. We do not know if this works on other systems. If your gcc is old version, you can update it using these steps to gcc 4.7.3:

sudo add-apt-repository ppa:ubuntu-toolchain-r/test
sudo apt-get update
sudo apt-get install gcc-4.7
sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-4.6 60 --slave /usr/bin/g++ g++ /usr/bin/g++-4.6
sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-4.7 40 --slave /usr/bin/g++ g++ /usr/bin/g++-4.7
sudo update-alternatives --config gcc

Installation

sudo make

Usage

./train train_file template_file model_folder
./decode model_folder test_file result_file

Using these commands to reproduce the experiments on NER datasets in the submitted paper:

./train ./data/ner_en.train ./data/ner.template ./model/
./decode ./model/ ./data/ner_en.testa testa.result
./decode ./model/ ./data/ner_en.testb testb.result

Data Format

Training and decoding data has the same format, see examples in the ./data.

Template Format

See example in the ./data for the template format.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
liblinear		liblinear
model		model
CPT.cpp		CPT.cpp
CPT.h		CPT.h
CR.cpp		CR.cpp
CR.h		CR.h
Corpus.cpp		Corpus.cpp
Corpus.h		Corpus.h
DecodeModel.cpp		DecodeModel.cpp
DecodeModel.h		DecodeModel.h
IndexedCorpus.cpp		IndexedCorpus.cpp
IndexedCorpus.h		IndexedCorpus.h
Indices.cpp		Indices.cpp
Indices.h		Indices.h
Makefile		Makefile
README.md		README.md
Result.cpp		Result.cpp
Result.h		Result.h
SVRLinearModel.cpp		SVRLinearModel.cpp
SVRLinearModel.h		SVRLinearModel.h
Template.cpp		Template.cpp
Template.h		Template.h
TrainModel.cpp		TrainModel.cpp
TrainModel.h		TrainModel.h
Utils.cpp		Utils.cpp
Utils.h		Utils.h
ViterbiDecoder.cpp		ViterbiDecoder.cpp
ViterbiDecoder.h		ViterbiDecoder.h
decode.cpp		decode.cpp
liblinear.h		liblinear.h
pmi-crf.cpp		pmi-crf.cpp
testa.result		testa.result
testb.result		testb.result
train.cpp		train.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Co-occurrence Rate Networks

Introduction

System and Compiler

Installation

Usage

Using these commands to reproduce the experiments on NER datasets in the submitted paper:

Data Format

Template Format

About

Releases

Packages

Languages

zheminzhu/Co-occurrence-Rate-Networks

Folders and files

Latest commit

History

Repository files navigation

Co-occurrence Rate Networks

Introduction

System and Compiler

Installation

Usage

Using these commands to reproduce the experiments on NER datasets in the submitted paper:

Data Format

Template Format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages