Stars
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
AnyLoc: Universal Visual Place Recognition (RA-L 2023)
Official implementation of "DepthLab: From Partial to Complete"
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
Real-time dense scene reconstruction with SLAM3R
[CVPR 2023 - Highlight] Accelerated Coordinate Encoding (ACE): Learning to Relocalize in Minutes using RGB and Poses
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
A framework to easily use 32 (and growing) different image matching methods
Convolutional Neural Networks for Denoising Gyroscopes of Low-Cost IMUs
RoNIN: Robust Neural Inertial Navigation in the Wild
AdaLAM is a fully handcrafted realtime outlier filter integrating several best practices into a single efficient and effective framework. It detects inliers by searching for significant local affin…
Codes of MVSFormer++: Revealing the Devil in Transformer’s Details for Multi-View Stereo (ICLR2024)
SNAP: Self-supervised Neural Maps for Visual Positioning and Semantic Understanding (NeurIPS 2023)