Skip to content
View hppRC's full-sized avatar
🏠
sleepy
🏠
sleepy

Block or report hppRC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,649 278 Updated Apr 12, 2025

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings

Python 41 2 Updated Mar 6, 2024

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 431 51 Updated Mar 28, 2025

Negima is a Python package to extract phrases in Japanese text by using the part-of-speeches based rules you defined.

Python 14 3 Updated Aug 20, 2018

Wikipediaを用いた日本語の固有表現抽出データセット

136 9 Updated Sep 2, 2023

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,098 1,116 Updated Apr 11, 2025

A daily digest web app that scrapes and summarizes blogs, Reddit threads, GitHub trending, and Hacker-News-trending articles all in one place.

Python 231 41 Updated Apr 8, 2025

A blazing fast inference solution for text embeddings models

Rust 3,411 240 Updated Apr 9, 2025
Jupyter Notebook 5 2 Updated Feb 8, 2025

Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"

Python 101 7 Updated Jan 30, 2025

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 728 101 Updated Dec 17, 2024

Simple RL training for reasoning

Python 3,427 253 Updated Apr 10, 2025

AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)

Python 6 Updated Jan 13, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,318 657 Updated Apr 10, 2025

Supercharge Your Model Training

Python 5,333 437 Updated Apr 12, 2025

Preferred Generation Benchmark

Python 78 8 Updated Mar 26, 2025
Python 1 Updated Dec 29, 2024

Bringing BERT into modernity via both architecture changes and scaling

Python 1,314 109 Updated Mar 25, 2025

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 730 80 Updated Apr 1, 2025

A modern alternative to ls

Rust 14,928 278 Updated Apr 10, 2025
Jupyter Notebook 32 2 Updated Jun 6, 2024
Python 9 Updated Sep 3, 2024

Scripts for creating a Japanese-English parallel corpus and training NMT models

Python 16 Updated Nov 9, 2021

YAST - Yet Another SPLADE or Sparse Trainer

Python 16 Updated Feb 17, 2025

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

Python 578 17 Updated Apr 3, 2025

A Python package for intrinsic dimension estimation

Python 85 18 Updated Feb 10, 2025

Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)

Python 26 5 Updated Apr 17, 2024
Next