A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.

106 8 Updated May 6, 2019

feyzaakyurek / dune

Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.

Python 23 1 Updated Sep 4, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,534 207 Updated Aug 11, 2024

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,003 430 Updated Nov 18, 2024

ContextualAI / gritlm

Generative Representational Instruction Tuning

Jupyter Notebook 623 42 Updated Mar 14, 2025

KarelDO / xmc.dspy

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Python 422 26 Updated Feb 13, 2024

stanfordnlp / pyvene

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 734 82 Updated Apr 23, 2025

jensjorisdecorte / Synthetic-ESCO-Skill-Sentences

Dataset of synthetic job ad sentences tagged with ESCO skills. From the paper Extreme Multi-Label Skill Extraction Training using Large Language Models.

2 Updated Jan 11, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 833 51 Updated Apr 22, 2025

stanfordnlp / stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,443 902 Updated Apr 23, 2025

jensjorisdecorte / Skill-Extraction-benchmark

Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.

13 Updated Jul 18, 2024

kris927b / SkillSpan

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Perl 60 15 Updated Feb 13, 2025

jensjorisdecorte / JobBERT-evaluation-dataset

The dataset used to evaluate JobBERT on the task of job title normalization.

26 2 Updated Sep 10, 2022

LouisTsiattalou / tfidf_matcher

TFIDF / KNN based string matching

Python 53 13 Updated Apr 6, 2023

KarelDO / wl-coref

Forked from vdobrovolskii/wl-coref

State-of-the-art efficient coreference. This repository contains the code for the CRAC-2023 paper "CAW-coref: Conjunction-Aware Word-level Coreference Resolution". Forked from the EMNLP-2021 paper …

Python 9 3 Updated Nov 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Karel D'Oosterlinck KarelDO

Achievements

Achievements

Highlights

Block or report KarelDO

Stars

huggingface / trl

castorini / pyserini

castorini / rank_llm

AnswerDotAI / rerankers

p-lambda / wilds

lavague-ai / LaVague

google-research-datasets / wiki-atomic-edits

feyzaakyurek / dune

eric-mitchell / direct-preference-optimization

OpenBMB / ToolBench

ContextualAI / gritlm

KarelDO / xmc.dspy

stanfordnlp / pyvene

jensjorisdecorte / Synthetic-ESCO-Skill-Sentences

ContextualAI / HALOs

stanfordnlp / stanza

jensjorisdecorte / Skill-Extraction-benchmark

kris927b / SkillSpan

jensjorisdecorte / JobBERT-evaluation-dataset

LouisTsiattalou / tfidf_matcher

KarelDO / wl-coref

evandez / REMEDI

KarelDO / BioDEX

LAION-AI / Open-Assistant

shawwn / llama-dl

artidoro / qlora

bigcode-project / starcoder

abertsch72 / unlimiformer

huggingface / peft

stanford-crfm / helm