Skip to content
View umbrellabeach's full-sized avatar
  • Zhejiang University
  • Hangzhou, China

Block or report umbrellabeach

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Retrieval and Retrieval-augmented LLMs

Python 8,318 609 Updated Jan 23, 2025

利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。

Python 124 18 Updated Apr 5, 2023

同义词表,反义词表,否定词表

525 204 Updated Oct 17, 2024
Python 1,128 407 Updated Nov 3, 2023

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,248 379 Updated Jan 17, 2024

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中

Python 2,193 254 Updated Aug 29, 2023

超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题

Python 125 29 Updated Oct 9, 2021

Entity and Relation Extraction Based on TensorFlow and BERT. 基于TensorFlow和BERT的管道式实体及关系抽取,2019语言与智能技术竞赛信息抽取任务解决方案。Schema based Knowledge Extraction, SKE 2019

Python 1,227 272 Updated Jun 1, 2020

Automatic Detection of Sexist Statements Commonly Used at the Workplace (PAKDD LDRC '20)

Python 8 9 Updated Dec 8, 2022

移动app知识图谱

54 1 Updated Jul 21, 2024

Source code for ACL 2021 finding paper: CasEE: A Joint Learning Framework with Cascade Decoding for Overlapping Event Extraction.

Python 80 18 Updated Dec 3, 2021

Creating class-based TF-IDF matrices

Python 82 19 Updated Oct 14, 2022

文本聚类(Kmeans、DBSCAN、LDA、Single-pass)

Python 337 89 Updated May 12, 2021
Python 278 79 Updated Apr 26, 2022

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

Python 297 57 Updated Oct 23, 2023

科大讯飞2020事件抽取挑战赛第一名解决方案&完整事件抽取系统

Python 538 122 Updated Dec 29, 2020

Tensorflow implementation of "Language Modeling with Gated Convolutional Networks"

Python 271 98 Updated Jan 16, 2017

中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Python 751 130 Updated May 3, 2023

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Python 2,186 247 Updated Jul 17, 2024

A curated list of network embedding techniques.

2,598 504 Updated Dec 8, 2020

Collections of Chinese NLP corpus

Python 887 209 Updated Dec 28, 2020

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Python 2,133 429 Updated Mar 11, 2023

Multi-stage passage ranking: monoBERT + duoBERT

Python 112 15 Updated Nov 23, 2020

📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Python 11,037 2,594 Updated Dec 26, 2023

transform multi-label classification as sentence pair task, with more training data and information

Python 178 29 Updated Dec 13, 2019

史上最大规模1.4亿中文知识图谱开源下载

Python 4,980 728 Updated Dec 6, 2023

YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources

Java 733 87 Updated Jul 5, 2022

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Python 2,394 439 Updated Sep 3, 2024

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Python 1,775 246 Updated Feb 18, 2023
Next