Skip to content

An R package for text embedding regression.

Notifications You must be signed in to change notification settings

petershan1119/conText

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

69 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

An R package for estimating text embedding regression models as described in Rodriguez, Spirling and Stewart (2021).

How to Install

devtools::install_github("prodriguezsosa/conText")

Datasets

To use conText you will need three datasets:

  1. A corpus with the text and corresponding metadata you want to evaluate.
  2. A set of pre-trained embeddings (a V by D matrix) used to embed context words.
  3. A transformation matrix (D by D) specific to the pre-trained embeddings.

In this Dropbox folder (see the /data folder) we have included the three datasets we use in the Quick Start Guide along with their documentation. Due to memory constraints we could not include them directly in the package. We'll be adding other useful datasets to this folder in the near future.

Quick Start Guide

Check out this Quick Start Guide to get going with conText. If it makes sense to estimate your own embeddings and transformation matrix, also check out this Quick Start Guide - Local Transform.

About

An R package for text embedding regression.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%