Skip to content
View zhaowei-wang-nlp's full-sized avatar
🀄
真中
🀄
真中

Highlights

  • Pro

Block or report zhaowei-wang-nlp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 289 27 Updated Nov 19, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,000 1,294 Updated Feb 1, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,415 1,319 Updated Feb 11, 2025

Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"

Python 29 2 Updated Sep 8, 2023

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 10,523 1,000 Updated Feb 14, 2025

s1: Simple test-time scaling

Python 5,273 593 Updated Feb 13, 2025

Witness the aha moment of VLM with less than $3.

Python 2,419 179 Updated Feb 14, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,412 177 Updated Jan 23, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 223 6 Updated Feb 13, 2025

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,679 132 Updated Sep 19, 2023

Entropy Based Sampling and Parallel CoT Decoding

Python 3,307 320 Updated Nov 13, 2024

Open Thoughts: Fully Open Data Curation for Thinking Models

Python 968 69 Updated Feb 12, 2025

Fully open reproduction of DeepSeek-R1

Python 19,837 1,693 Updated Feb 14, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,016 91 Updated Aug 5, 2024

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

Python 66 8 Updated Jul 4, 2021

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,464 567 Updated Feb 14, 2025

Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external memory/knowledge augmented MLLM.

11 Updated Sep 5, 2024

Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits

Jupyter Notebook 19 Updated Dec 15, 2022

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 140 16 Updated Jan 18, 2025
Python 188 32 Updated May 3, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 172 13 Updated Oct 28, 2024

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Python 56 1 Updated May 9, 2022

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 85 8 Updated Oct 10, 2023

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 258 8 Updated Jan 15, 2025

Development kit for the data of the Places365-Standard and Places365-Challenge

MATLAB 125 45 Updated Aug 15, 2017

Large Concept Models: Language modeling in a sentence representation space

Python 1,911 157 Updated Jan 29, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,549 274 Updated Feb 14, 2025

iNaturalist competition details

Python 745 110 Updated May 26, 2021
Next