Stars
System design patterns for machine learning
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
This is a repo with links to everything you'd ever want to learn about data engineering
A collection of localized (Korean) AWS AI/ML workshop materials for hands-on labs.
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
SRT(Super Rapid Train: https://etk.srail.kr/) wrapper for python
Provides a fluent query builder api around pymongo.
The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-…
Example application code for the python architecture book
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment anal…
Roadmap to becoming a data engineer in 2021
Datascience-Interview-Questions for Korean
Vulnerability scanner and mitigation patch for Log4j2 CVE-2021-44228
Scrapy, a fast high-level web crawling & scraping framework for Python.
Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
Korean BERT pre-trained cased (KoBERT)
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI …