Skip to content
View Youly172's full-sized avatar

Block or report Youly172

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • pdfplumber Public

    Forked from jsvine/pdfplumber

    Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

    Python MIT License Updated Feb 5, 2020
  • Convert PDF to HTML without losing text or format.

    HTML Other Updated Feb 2, 2020
  • Kashgari Public

    Forked from BrikerMan/Kashgari

    Kashgari is a Production-ready NLP Transfer learning framework for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

    Python Apache License 2.0 Updated Jan 28, 2020
  • Official implementation of Character Region Awareness for Text Detection (CRAFT)

    Python MIT License Updated Jan 27, 2020
  • Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

    Python Apache License 2.0 Updated Jan 21, 2020
  • models-1 Public

    Forked from PaddlePaddle/models

    Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)

    Python Apache License 2.0 Updated Jan 18, 2020
  • kaldi Public

    Forked from kaldi-asr/kaldi

    This is the official location of the Kaldi project.

    Shell Other Updated Jan 18, 2020
  • 中文机器阅读理解数据集

    1 Updated Jan 15, 2020
  • funNLP Public

    Forked from fighting41love/funNLP

    中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

    Python Updated Jan 14, 2020
  • HanLP Public

    Forked from hankcs/HanLP

    Natural Language Processing for the next decade

    Python Apache License 2.0 Updated Jan 10, 2020
  • ALBERT Public

    Forked from google-research/albert

    ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

    Python Updated Jan 9, 2020
  • albert_zh Public

    Forked from brightmart/albert_zh

    A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

    Python Updated Jan 7, 2020
  • Speech recognition module for Python, supporting several engines and APIs, online and offline.

    Python BSD 3-Clause "New" or "Revised" License Updated Jan 4, 2020
  • rq Public

    Forked from rq/rq

    Simple job queues for Python

    Python Other Updated Jan 3, 2020
  • bert Public

    Forked from google-research/bert

    TensorFlow code and pre-trained models for BERT

    Python Apache License 2.0 Updated Jan 3, 2020
  • models Public

    Forked from tensorflow/models

    Models and examples built with TensorFlow

    Python Apache License 2.0 Updated Jan 2, 2020
  • 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

    Python Apache License 2.0 Updated Jan 2, 2020
  • stanfordnlp Public

    Forked from stanfordnlp/stanza

    Official Stanford NLP Python Library for Many Human Languages

    Python Other Updated Dec 25, 2019
  • nlp-journey Public

    Forked from msgi/nlp-journey

    NLP 相关的一些文档、论文及代码, 包括主题模型(Topic Model)、词向量(Word Embedding)、命名实体识别(Named Entity Recognition)、文本分类(Text Classificatin)、文本生成(Text Generation)、文本相似性(Text Similarity)计算、机器翻译(Machine Translation)等,涉及到各种与…

    Python Updated Dec 23, 2019
  • Mapping a variable-length sentence to a fixed-length vector using BERT model

    Python MIT License Updated Dec 23, 2019
  • 一个相对完整的文档分析和识别项目

    Python Apache License 2.0 Updated Dec 11, 2019
  • pyecharts Public

    Forked from pyecharts/pyecharts

    🎨 Python Echarts Plotting Library

    Python MIT License Updated Nov 29, 2019
  • Latex_OCR

    Jupyter Notebook Updated Nov 24, 2019
  • A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

    Python GNU General Public License v3.0 Updated Nov 15, 2019
  • Python Updated Nov 12, 2019
  • 📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

    Python Updated Oct 25, 2019
  • Run pdf2htmlEX in a Docker container.

    Python Apache License 2.0 Updated Oct 21, 2019
  • chinese_ocr Public

    Forked from YCG09/chinese_ocr

    CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

    Python Apache License 2.0 Updated Oct 8, 2019
  • This library provides common speech features for ASR including MFCCs and filterbank energies.

    Python MIT License Updated Sep 13, 2019
  • LaTeX OCR 的数据仓库

    Updated Aug 26, 2019