-
Tencent
- China
-
11:52
(UTC +08:00)
Lists (17)
Sort Name ascending (A-Z)
- All languages
- AppleScript
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Cuda
- D
- Dart
- Dockerfile
- F#
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Nix
- PLpgSQL
- Perl
- Python
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Stan
- Swift
- Tcl
- TeX
- TypeScript
- Typst
- V
- VHDL
- Vala
- Verilog
- Vim Script
- Vue
- XSLT
Starred repositories
This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training data, instruction fine-tuning data, and In-Context learning …
Deep learning software for colorizing black and white images with a few clicks.
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Synthetic data generation pipelines for Pixmo-docs.
💥 Blazing fast terminal file manager written in Rust, based on async I/O.
Python tool for converting files and office documents to Markdown.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
ETL, Analytics, Versioning for Unstructured Data
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
轻量、灵活、易上手的Python剪映草稿生成及导出工具,构建全自动化视频剪辑/混剪流水线
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Automatic colorization using deep neural networks. "Colorful Image Colorization." In ECCV, 2016.
Simple package to extract text with coordinates from programmatic PDFs
Tool to parse wiki tables from the HTML dump of Wikipedia
difPy - Python package for finding duplicate and similar images
🎁 5,400,000+ Unsplash images made available for research and machine learning
A python library to define and validate data types in Docling.
Zotero Plugins Collection | Zotero 插件合集 | Awesome Zotero Plugins
A plugin template for Zotero.
ImageBind One Embedding Space to Bind Them All
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
A large scale camera-taken table detection and recognition dataset.