You will need to grab my arsenal repository and stick it on your python path.
Example data for citation segmentation is included as well as very simple feature extraction (not a serious feature set).
-
Regularization
-
L-BFGS optimization
-
Parameter averaging for sgd and perceptron
The example dataset tagged_references.txt
is due to Andrew McCallum. It is
available here.