Stars
100+ Chinese Word Vectors 上百种预训练中文词向量
Code for using and evaluating SpanBERT.
Four word embedding models implemented in Python. Supporting arbitrary context features
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic p…
Tevatron - A flexible toolkit for neural retrieval research and development.
A tool for holistic analysis of language generations systems
KnowBert -- Knowledge Enhanced Contextual Word Representations
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Neural network toolkit for sentence pair modeling.
Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
K-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Python scripts preprocessing Penn Treebank and Chinese Treebank
A Greek edition of BERT pre-trained language model
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
An instruction-based benchmark for text improvements.
Neural Module Network for Reasoning over Text, ICLR 2020
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
A simple Pytorch implementation of Gated Graph Neural Networks
PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
NAACL 2019 "Structured Minimally Supervised Learning for Neural Relation Extraction"
Graph revised convolutional network (ECML-PKDD 2020)