Skip to content
View shyamupa's full-sized avatar

Organizations

@CogComp

Block or report shyamupa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

πŸ“° Must-read papers and blogs on LLM based Long Context Modeling πŸ”₯

912 33 Updated Oct 16, 2024

Numbers every LLM developer should know

4,080 139 Updated Jan 16, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 1,699 300 Updated Oct 15, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 29,573 3,425 Updated Oct 14, 2024

An index of algorithms for reinforcement learning from human feedback (rlhf))

86 1 Updated Apr 17, 2024

Inference code for Llama models

Python 56,044 9,523 Updated Aug 18, 2024

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Python 3,394 675 Updated Dec 14, 2022

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 134 11 Updated Oct 22, 2022

My 500 LED xmas tree

Python 653 96 Updated Dec 18, 2021

Unsupervised text tokenizer focused on computational efficiency

C++ 954 101 Updated Mar 29, 2024

Must-read Papers on pre-trained language models.

3,319 436 Updated Nov 6, 2022

curated collection of papers for the nlp practitioner πŸ“–πŸ‘©β€πŸ”¬

1,075 91 Updated Aug 5, 2020

Giant Language Model Test Room

TypeScript 456 110 Updated Jan 18, 2024

NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)

Python 63 5 Updated Dec 8, 2022

πŸ“– A collection of pure bash alternatives to external processes.

Shell 36,488 3,276 Updated Nov 28, 2023

learning to search in pytorch

Python 111 12 Updated Feb 18, 2020
Python 176 32 Updated Jul 31, 2020

Codebase for testing whether hidden states of neural networks encode discrete structures.

Python 379 77 Updated Mar 15, 2024

With Holoviews, your data visualizes itself.

Python 2,696 401 Updated Oct 16, 2024

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Python 1,790 250 Updated Jul 24, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,272 328 Updated Jul 21, 2024

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 13,175 1,164 Updated Jul 29, 2024

A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and Transferability of Contextual Representations" (NAACL 2019).

Python 210 30 Updated Oct 20, 2021

Seamless operability between C++11 and Python

C++ 15,628 2,097 Updated Oct 12, 2024

Papers from the computer science community to read and discuss.

Shell 87,533 5,730 Updated Oct 2, 2024

Best practice and tips & tricks to write scientific papers in LaTeX, with figures generated in Python or Matlab.

Python 3,623 251 Updated May 17, 2023

Performs string manipulation tasks by learning from the provided example(s), instead of having to program them out explicitly.

Roff 546 17 Updated May 13, 2020

A toolkit for processing Vietnamese texts

Java 16 3 Updated Oct 20, 2022

πŸ€– My collection of highly opinionated and amazing configs

Shell 260 20 Updated Feb 29, 2024

A general-purpose neural semantic parser for mapping natural language queries into machine executable code

Python 460 111 Updated Nov 12, 2022
Next