Starred repositories
The official home of the Presto distributed SQL query engine for big data
Apache Pulsar - distributed pub-sub messaging system
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…
Tink is a multi-language, cross-platform, open source library that provides cryptographic APIs that are secure, easy to use correctly, and hard(er) to misuse.
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Apache Beam is a unified programming model for Batch and Streaming data processing.
Alluxio, data orchestration for analytics and machine learning in the cloud
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
An Engine-Agnostic Deep Learning Framework in Java
Generates a unified GraphQL schema from gRPC microservices and other Protobuf sources
Deep Learning (Python, C, C++, Java, Scala, Go)
Apache Nutch is an extensible and scalable web crawler
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
基于canal 的 mysql 与 redis/memcached/mongodb 的 nosql 数据实时同步方案 案例 demo canal client
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with…
Anserini is a Lucene toolkit for reproducible information retrieval research
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction …
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Apache Nutch Plugins for AJAX page fetch, parse, index
Stanford NLP framework wrapped with a REST API.
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with…