HennyJie

👩‍💻

I may be slow to respond.

Hejie Cui HennyJie

👩‍💻

I may be slow to respond.

Postdoc @ Stanford, CS Ph.D. from Emory, B.S. from Tongji | #datamining #machinelearning #AI4Health

245 followers · 110 following

Stanford University
Palo Alto, California
https://hejiecui.com/
@HennyJieCC

Achievements

x2 x2

Achievements

x2 x2

Highlights

Developer Program Member
Pro

Organizations

Lists (1)

Sort

🚀 My stack

3 repositories

Stars

microsoft / BiomedParse

BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities

Jupyter Notebook 380 34 Updated Jan 10, 2025

OpenBioLink / ThoughtSource

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Jupyter Notebook 920 74 Updated Dec 16, 2024

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

973 55 Updated Nov 5, 2024

Eladlev / AutoPrompt

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,312 202 Updated Nov 23, 2024

princeton-nlp / HELMET

The HELMET Benchmark

Python 103 13 Updated Jan 18, 2025

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 847 50 Updated Jan 7, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,692 240 Updated Jan 19, 2025

yiqingxyq / DocLens

Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)

Python 13 3 Updated May 18, 2024

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,642 278 Updated Aug 14, 2024

epfml / landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 420 36 Updated Dec 20, 2023

som-shahlab / slurmtool

SLURM helper

Python 1 Updated Aug 5, 2024

ChenLiu-1996 / CitationMap

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 480 39 Updated Jan 19, 2025

synthetichealth / synthea

Synthetic Patient Population Simulator

Java 2,249 670 Updated Jan 13, 2025

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Python 3,514 517 Updated Dec 22, 2024

Shekswess / LLM-Medical-Finetuning

A code repository that cointains all the code for finetuning some of the popular LLMs on medical data

Jupyter Notebook 31 8 Updated Apr 19, 2024

snorkel-ai / long-context-eval

Tests for long context window evaluation

Python 9 1 Updated Jul 8, 2024

run-llama / llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 38,135 5,463 Updated Jan 17, 2025

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 165 13 Updated Oct 28, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,420 4,721 Updated Jan 18, 2025

BioMed-VITAL / BioMed-VITAL.github.io

BioMed-VITAL

HTML 4 Updated Nov 22, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,492 2,014 Updated Jan 19, 2025

Google-Health / med-gemini-medqa-relabelling

For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.

Jupyter Notebook 44 6 Updated Jun 19, 2024

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,401 118 Updated Apr 17, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 674 85 Updated Jan 18, 2025

medspacy / medspacy

Library for clinical NLP with spaCy.

Jupyter Notebook 544 92 Updated Dec 19, 2024

huggingface / evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,086 264 Updated Jan 10, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,975 5,218 Updated Jan 19, 2025

iv-lop / auto_securegpt

Python 2 Updated Jun 14, 2024

PKU-Alignment / beavertails

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 118 5 Updated Oct 27, 2023

MrYxJ / calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 653 26 Updated Jun 27, 2024