` ^##\s -> #\s
^\s\s\d. -> \s\s- `
-
https://github.com/labmlai/annotated_deep_learning_paper_implementations
-
https://github.com/CLUEbenchmark/DataCLUE
- 不同与传统的以模型为中心的AI,该repo是关于以数据为中心的AI
-
https://github.com/IDEA-CCNL/Fengshenbang-LM
- 封神榜系列模型
- 官网:https://fengshenbang-lm.com
- huggingface web: https://huggingface.co/IDEA-CCNL?sort_models=downloads#models
-
Benchmark测试基准
- ZeroCLUE
- 零样本学习测评基准,中文版
- https://github.com/alibaba-research/ChineseBLUE
- https://github.com/CLUEbenchmark/CLUE
- GLUE:https://gluebenchmark.com/
- https://github.com/CLUEbenchmark/pCLUE
- 中文提示学习排行榜,将多个不同NLP任务转化成基于prompt格式,使用生成式模型进行预测
- https://github.com/clue-ai/PromptCLUE
- FewNLU
- ZeroCLUE
- https://github.com/DengBoCong/nlp-paper
- https://github.com/changwookjun/nlp-paper
- https://github.com/iwangjian/Paper-Reading
- https://github.com/sebastianruder/NLP-progress
- https://github.com/bplank/awesome-neural-adaptation-in-NLP
- https://github.com/zhpmatrix/nlp-competitions-list-review
- https://github.com/ivan-bilan/The-NLP-Pandect
- https://github.com/DA-southampton/Tech_Aarticle
- https://github.com/GanjinZero/awesome_Chinese_medical_NLP
- https://github.com/NiuTrans/ABigSurvey
- https://github.com/MLNLP-World/Top-AI-Conferences-Paper-with-Code
- https://github.com/crownpku/Awesome-Chinese-NLP
- https://github.com/fighting41love/funNLP
- https://github.com/xcfcode/What-I-Have-Read
- https://github.com/PaddlePaddle/awesome-DeepLearning
-
https://github.com/alibaba/AliceMind
- 集成了多种领域的预训练语言模型
-
chinese nlp pkgs
- https://github.com/dongrixinyu/JioNLP
- https://github.com/HIT-SCIR/ltp
- LTP(Language Technology Platform) 提供了一系列中文自然语言处理工具,用户可以使用这些工具对于中文文本进行分词、词性标注、句法分析等等工作。
- 分析transformer模型的参数量、计算量、中间激活、KV cache
- PyTorch显存分配原理——以BERT为例
- Self Attention 固定激活值显存分析与优化及PyTorch实现
- BertLarge 中间激活值分析
- [实践] Sequence Parallel
- https://pytorch.org/tutorials/intermediate/autograd_saved_tensors_hooks_tutorial.html
- torchinfo
- 使用 torchinfo 可以查看模型共有多少参数,比直接print(model)的信息更全,可视化效果更好
- 如果指定输入向量的shape,可以得到每一层输出的shape,总共占用多少显存
- https://github.com/PaddlePaddle/PaddleSpeech
- https://github.com/yeyupiaoling/PPASR
- https://github.com/nkrnrnk/BertPunc
- https://github.com/geekinglcq/CDCS.git
- https://github.com/EliotAndres/kaggle-past-solutions.git
- https://github.com/HuangCongQing/AI_competitions.git
- https://github.com/iphysresearch/DataSciComp.git
- https://github.com/Smilexuhc/Data-Competition-TopSolution.git
- https://github.com/yaleimeng/Final_word_Similarity
- https://github.com/goto456/stopwords
- 四种语言的相关停用词
- https://github.com/chatopera/Synonyms
- https://github.com/tonybeltramelli/pix2code
- 将图片转成html
- https://github.com/BYVoid/OpenCC
- 中文繁体转简体
- https://github.com/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
- https://github.com/huyingxi/Synonyms.git
- 中文近义词工具包
- https://github.com/yaleimeng/Final_word_Similarity.git
- 中文近义词
-
JAVA
-
https://github.com/google-research/tuning_playbook
- 科学化调参
-
https://github.com/chehongshu/machine-learning-interview-chinese
- 传统机器学习算法
-
https://github.com/DA-southampton/NLP_ability
- NLP基础、NLP任务paper解析
-
https://paddlepedia.readthedocs.io/en/latest/index.html
- paddle深度学习基础内容和面试资源,ML、DL、CV、NLP
-
https://github.com/amusi/Deep-Learning-Interview-Book
- 面试问题,但是不一定包含问题解析
-
https://github.com/ShanghaiTechAIClub/DLInterview
- 简单基础
-
https://github.com/DarLiner/Algorithm_Interview_Notes-Chinese
-
百面机器学习、百面深度学习(书)
-
https://github.com/wangyuGithub01/Machine_Learning_Resources
-
https://github.com/scutan90/DeepLearning-500-questions.git
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。
-
https://github.com/kmario23/deep-learning-drizzle.git
- 深度学习、强化学习、机器学习、计算机视觉、和NLP的讲座! !
-
C++
- https://github.com/TheAlgorithms/Python
- https://github.com/labuladong/fucking-algorithm
- https://github.com/youngyangyang04/leetcode-master
- https://github.com/MisterBooo/LeetCodeAnimation
- https://github.com/labuladong/fucking-algorithm
- Labelme标注工具 https://github.com/wkentaro/labelme
- Doccano https://github.com/doccano/doccano
- https://github.com/jiesutd/YEDDA
- https://github.com/SophonPlus/ChineseAnnotator
- 构想:中文文本标注工具(内附多个开源文本标注工具)
- https://github.com/heartexlabs/awesome-data-labeling
- https://github.com/mingx9527/Data_Label_Tools
- 超强大NLP标注工具——Prodigy
- https://github.com/yizhen20133868/Plot-Pictures-Tutorial-for-Paper
- https://mp.weixin.qq.com/s/v64js_pupc7VjjWNq9Kosw
- 神经网络作图
- https://www.heywhale.com/mw/project/63c274b8c421c0f295c1c268
- Matplotlib 最具价值的50个可视化项目
- https://github.com/soimort/you-get
- https://github.com/github/gitignore
- https://github.com/wistbean/learn_python3_spider