Skip to content
View lyrain2001's full-sized avatar

Highlights

  • Pro

Block or report lyrain2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The public repository for the Proteomic Data Commons UI and APIs

HTML 14 6 Updated May 23, 2024

Code for finetuning TabPFN on one downstream tabular dataset.

Python 20 1 Updated Feb 11, 2025

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 639 65 Updated Sep 19, 2024
Python 1,287 182 Updated Nov 20, 2024

Modeling, training, eval, and inference code for OLMo

Python 5,297 564 Updated Mar 4, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 637 54 Updated Dec 16, 2024

Everything you need to build state-of-the-art foundation models, end-to-end.

Python 7,577 534 Updated Mar 4, 2025

ArcheType uses LLMs to automatically assign custom labels to your tabular data

Jupyter Notebook 13 1 Updated Apr 11, 2024

Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models

Python 4 1 Updated Dec 2, 2024
Python 2 Updated Dec 11, 2024
Python 290 50 Updated Nov 2, 2023

A novel approach for synthesizing tabular data using pretrained large language models

Python 300 49 Updated Oct 29, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 835 42 Updated Nov 23, 2024

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,243 45 Updated Dec 11, 2024

TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.

Python 16 3 Updated Dec 4, 2024

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python 2,832 234 Updated Mar 4, 2025

Recipes to train reward model for RLHF.

Python 1,217 88 Updated Feb 9, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,653 313 Updated Feb 20, 2025

state-of-the-art search over vector embeddings and structured data (SIGMOD '24)

C++ 66 15 Updated Jun 19, 2024

This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"

Python 487 45 Updated Mar 26, 2024

DSPy: The framework for programming—not prompting—language models

Python 22,257 1,705 Updated Mar 4, 2025

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

383 29 Updated Dec 19, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,104 71 Updated Feb 24, 2025

TransGNN, SIGIR 2024

Python 44 6 Updated Jul 11, 2024

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Python 407 25 Updated Feb 13, 2024

[ICLR 2024 spotlight] Making Pre-trained Language Models Great on Tabular Prediction

Python 52 8 Updated Jul 12, 2024

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

369 29 Updated Dec 22, 2024
Python 66 10 Updated May 21, 2024

For calculating Shapley values via linear regression.

Python 67 13 Updated Jun 6, 2021
SAS 2 2 Updated Mar 14, 2022
Next