shruti-singh

Shruti Singh shruti-singh

PhD student @ IIT Gandhinagar | NLP for Scientific Texts

29 followers · 59 following

Achievements

Highlights

Stars

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 16,234 1,611 Updated May 11, 2025

xhluca / bm25s

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,161 67 Updated May 20, 2025

McGill-NLP / medal

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

Python 271 44 Updated Oct 18, 2023

benchopt / benchmark_bci

Benchmark for Brain Computer Interface methods

Python 16 7 Updated Feb 1, 2025

yuzhimanhua / Awesome-Scientific-Language-Models

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)

576 32 Updated Feb 26, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 49,893 7,203 Updated Apr 20, 2025

allenai / marg-reviewer

Code/data for MARG (multi-agent review generation)

Python 43 5 Updated Nov 14, 2024

p-lambda / dsir

DSIR large-scale data selection framework for language model training

Python 249 19 Updated Apr 7, 2024

Future-House / paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Python 7,369 723 Updated May 21, 2025

yuzhimanhua / SciMult

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Python 11 Updated Aug 24, 2024

OpenPipe / OpenPipe

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,593 143 Updated May 25, 2024

microsoft / LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,102 294 Updated Mar 11, 2025

kagisearch / pyllms

Minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, Groq, Reka, Together, AI21, Cohere, Aleph Alpha, HuggingfaceHub), with a built-in model performance benchmark.

Python 770 52 Updated May 22, 2025

soldni / pyllms

Forked from kagisearch/pyllms

Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.

Python 1 Updated Oct 1, 2023