-
HUST
- Viet Nam
-
06:54
(UTC +07:00) - tuongtranngoc.github.io
- in/tuong-tran-ngoc-885482154
Lists (13)
Sort Name ascending (A-Z)
C++ programming
Citation-Recommendation
ComputerVision
Object detection, Segmentation, Pose Estimation algorithmsData Structure and Algorithm
Data Structure and AlgorithmDeployment
Document AI
Document-AI algorithms include Visual Information Extraction, Table Structure Recognition, Layout Analysis, Document Classification, Document VQA, etc.Export model
Facial-anti-spoofing
Stars
Recipes to scale inference-time compute of open models
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Create and modify Word documents with Python
A suite of image and video neural tokenizers
Accepted by CVPR Workshop 2024
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
Understand kubernetes step by step. A simple repo for beginners 🔥
A set of exercises to prepare for Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
High-resolution models for human tasks.
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Modeling, training, eval, and inference code for OLMo
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
SPECTER: Document-level Representation Learning using Citation-informed Transformers
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A citation recommendation system that allows users to find relevant citations for their paper drafts. The tool is backed by Semantic Scholar's OpenCorpus dataset.
Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
PaddleSlim is an open-source library for deep model compression and architecture search.
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.