-
Naver corp.
- https://lovit.github.io
Highlights
- Pro
-
clustering4docs Public
Clustering algorithm library. Implemented spherical kmeans
-
soynlp Public
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
-
-
kmrd Public
Synthetic dataset for recommender system created from Naver Movie rating system
-
-
synthetic_dataset Public
Synthetic data generator for machine learning
-
python_upload_webserver Public
Flask, Waitress based file upload webserver
-
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpacaCode and documentation to train Stanford's Alpaca models, and generate the data.
Python Apache License 2.0 UpdatedMar 23, 2023 -
parallelformers Public
Forked from tunib-ai/parallelformersParallelformers: An Efficient Model Parallelization Toolkit for Deployment
Python Apache License 2.0 UpdatedJul 26, 2022 -
KR-WordRank Public
비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다
-
-
tokenizers Public
Forked from huggingface/tokenizers💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
textrank Public
Implementation TextRank and related utils
-
naver_news_search_scraper Public
검색어 기준으로 네이버뉴스와 댓글을 수집하는 파이썬 코드
-
-
levenshtein_finder Public
Similar string search in Levenshtein distance
-
text-dedup Public
Python package for memory-friendly text de-duplication
-
-
naver_movie_scraper Public
네이버 영화 정보 및 사용자 작성 영화평/평점 데이터 수집기
-
-
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
-
namuwikitext Public
Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)
-
WordPieceModel Public
Word Piece Model python light version with functions tokenize/save/load
-
kwnlp-sql-parser Public
Forked from kensho-technologies/kwnlp-sql-parserUtilities for parsing Wikipedia MySQL/MariaDB dumps.
Python Apache License 2.0 UpdatedOct 1, 2020 -
-
pycrfsuite_spacing Public
python-crfsuite를 이용한 한국어 띄어쓰기 교정기
-
-
huggingface_konlpy Public
Training Transformers of Huggingface with KoNLPy
-
wikiextractor Public
Forked from attardi/wikiextractorA tool for extracting plain text from Wikipedia dumps
Python GNU Affero General Public License v3.0 UpdatedAug 14, 2020 -