Skip to content
View Jiaxin-Pei's full-sized avatar

Block or report Jiaxin-Pei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 41,719 4,478 Updated Jan 16, 2025
Jupyter Notebook 31 2 Updated Oct 14, 2024

The official repo for SocKET: Social Knowledge Evaluation Tests

Python 22 1 Updated Oct 24, 2023

A curated list of awesome Active Learning

744 70 Updated Oct 20, 2024

potato: portable text annotation tool

Jupyter Notebook 310 51 Updated Jan 16, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,741 674 Updated Jan 6, 2025

Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022

Python 13 1 Updated Oct 20, 2022

A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.

Python 306 77 Updated Dec 13, 2024

An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.

Python 31 7 Updated Sep 11, 2024

Charsiu: A neural phonetic aligner.

Jupyter Notebook 288 35 Updated Sep 19, 2022

Tools for collecting social media data around focal events

Python 84 15 Updated Mar 29, 2022

Crawl BookCorpus

Python 814 108 Updated Jul 14, 2023

Official repository for the ICWSM '21 paper "More than meets the tie: Examining the Role of Interpersonal Relationships in Social Networks"

Python 12 2 Updated Apr 26, 2023

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,702 161 Updated Aug 18, 2024

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

Python 477 123 Updated Jul 2, 2021

计算精神病学在线文献报告讨论会(Computational psychiatry online journal club(CPoJC))

52 9 Updated Aug 31, 2022
4 Updated Sep 2, 2023

Topic Modeling for The New York Times News Dataset

Python 19 12 Updated May 23, 2017

A dataset contains 37 million douban dushu comments

58 6 Updated Dec 1, 2018

Demographic and Economic Data for Tracts and Counties

1 Updated May 29, 2019

For associated data that can be mashed up with ours. This is data like Census demographics, CDC flu rates, and hospital beds

Jupyter Notebook 28 20 Updated May 26, 2020

State-by-state presidential election results, population data from the 2010 census, electoral college makeup, statistical analysis scripts.

R 7 3 Updated Mar 28, 2018

Mapping of US Zipcode, county, and state information from Census data

Ruby 67 38 Updated Sep 6, 2024

County Level Election Results Analysis (2016)

R 14 3 Updated Nov 27, 2016

A simple interface to the Project Gutenberg corpus.

Python 323 60 Updated Jan 12, 2023
Next