Skip to content
View louiseGAN514's full-sized avatar

Block or report louiseGAN514

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,568 1,192 Updated Feb 1, 2025

New dataset

Python 301 21 Updated Aug 31, 2021

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 21,746 1,904 Updated Jan 23, 2025

CHisIEC An Information Extraction Corpus for Ancient Chinese History

6 1 Updated Jun 20, 2024

An evaluation bentchmark for classical Chinese

Python 12 2 Updated Dec 13, 2023

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,879 111 Updated Jun 1, 2023

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 80 5 Updated Nov 17, 2024

A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network

Python 284 19 Updated Sep 28, 2024

Language model Prompt And Query Archive

Shell 158 13 Updated May 11, 2021

An implementation of "Fair Attribute Completion on Graph with Missing Attributes" paper. Accepted TMLR

Jupyter Notebook 2 Updated Nov 9, 2024

Source code for the paper "CAT: Interpretable Concept-based Taylor Additive Models".

Jupyter Notebook 18 Updated Aug 26, 2024

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Python 152 8 Updated Jan 16, 2025
Python 4 Updated Jan 6, 2025

The repo for the article: CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search

Jupyter Notebook 5 1 Updated Dec 12, 2024

Get your documents ready for gen AI

Python 20,493 1,120 Updated Feb 11, 2025

Text of the Dhammapadi (Pali language) with Latin translation

Jupyter Notebook 3 Updated Nov 14, 2023

Repository for Fine-grained Contrastive Learning for Relation Extraction

Python 4 1 Updated Apr 17, 2023

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,831 971 Updated Feb 15, 2023

[NAACL 2021] A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812

Python 800 122 Updated Jul 7, 2022

This repository implements our EMNLP 2022 research paper A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach.

Python 27 2 Updated Dec 13, 2022
Python 4 Updated Feb 8, 2024
Python 29 5 Updated Nov 16, 2022

The source code of KDD2024 paper RMR.

Python 4 Updated Sep 29, 2024

[KDD'2024] "LLM4Graph: A Survey of Large Language Models for Graphs"

278 12 Updated Sep 1, 2024

[KDD'2024] "HiGPT: Heterogenous Graph Language Models"

Python 119 6 Updated Jun 5, 2024

[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

Python 113 10 Updated Sep 13, 2024

Official implementation of paper "Autonomous Data Selection with Language Models for Mathematical Texts" (As Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)

Python 79 5 Updated Nov 4, 2024

PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning (ICML 2024)

Python 11 9 Updated Nov 15, 2024

Learning from Negative samples for Biomedical Generative Entity Linking

Python 17 Updated Sep 11, 2024
Next