Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
Diaphora, the most advanced Free and Open Source program diffing tool.
Binary Code Similarity Analysis (BCSA) Benchmark
Binary Code Similarity Analysis (BCSA) Tool
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Java implementation of the Aho-Corasick algorithm for efficient string matching
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Implementation of triplet loss in TensorFlow
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
Build your neural network easy and fast, 莫烦Python中文教学
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …