Skip to content
View WhiteJr's full-sized avatar

Block or report WhiteJr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

百度资讯爬虫

Python 5 Updated Jul 11, 2024

小红书数据采集,小红书逆向,小红书 x-s逆向,小红书爬虫

11 Updated Aug 29, 2024

小红书内容自动爬取,selenium+fiddler+微信小程序

Python 101 23 Updated Mar 13, 2022

Open source free capture HTTP(S) traffic software ProxyPin, supporting full platform systems

Dart 7,875 679 Updated Jan 18, 2025

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 37,060 4,550 Updated Jan 21, 2025

a spider for tencent gongyi data

JavaScript 3 Updated Jun 9, 2023

一款入门级的人脸、视频、文字检测以及识别的项目.

Python 10,847 2,519 Updated Apr 16, 2020

A curated list of graph-based fraud, anomaly, and outlier detection papers & resources

1,488 261 Updated Nov 27, 2024

Source Code for 'Python Data Analytics, 2nd Edition' by Fabio Nelli

Jupyter Notebook 71 93 Updated Oct 2, 2018

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,562 1,550 Updated May 23, 2024

中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)

426 45 Updated Feb 10, 2020

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,052 543 Updated May 23, 2024

An elegent pytorch implement of transformers

Python 1,268 161 Updated Jan 19, 2025

A python tool for evaluating the quality of sentence embeddings.

Python 2,091 309 Updated Mar 19, 2024

本项目在电网内网邮箱系统使用中记录的问答数据上,设计基于知识图谱的智能问答客服系统,主要涉及到的算法为无监督文本相似度算法:simCSE。

Python 7 2 Updated Sep 17, 2021

SimCSE中文语义相似度对比学习模型

Python 79 11 Updated Mar 7, 2022

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,487 521 Updated Oct 16, 2024

experiments of some semantic matching models and comparison of experimental results.

Python 161 15 Updated Jun 12, 2023

一个用于训练句子embedding的工具,支持Cosent以及Simcse

Python 17 1 Updated Nov 19, 2024

文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

Python 73 12 Updated Nov 26, 2024

Tutorial notebook on SimCSE (Ja)

Jupyter Notebook 10 Updated Nov 9, 2023

mSimCSE: Multilingual SimCSE

Python 34 1 Updated Nov 14, 2022

句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。

Python 96 13 Updated Oct 29, 2022

This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distributed training is possible with Amazon SageMaker.

Jupyter Notebook 23 7 Updated Oct 6, 2023

基于simcse的中文句向量生成

Python 15 4 Updated Jun 8, 2022

A simple implementation of SimCSE

Python 76 10 Updated Oct 31, 2022

中文数据集下SimCSE+ESimCSE的实现

Python 192 33 Updated May 21, 2022

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,234 394 Updated Sep 29, 2023

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

Python 293 27 Updated Oct 27, 2022

中文无监督SimCSE Pytorch实现

Python 133 31 Updated Jul 8, 2021
Next