Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
requirements.txt		requirements.txt
sample.py		sample.py

Repository files navigation

XGen

Official research release for the family of XGen models (7B) by Salesforce AI Research:

Title: Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length

Authors: Salesforce AI Research.

Models

Model cards are published on the HuggingFace Hub:

XGen-7B-4K-Base with support for 4K sequence length.
XGen-7B-8K-Base with support for 8K sequence length.
XGen-7B-8k-Instruct with instruction-finetuning (for research purpose only).

The tokenization uses the OpenAI Tiktoken library, which can be installed the package via pip:

pip install tiktoken

The models can be used as auto-regressive samplers as follows:

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Salesforce/xgen-7b-8k-base", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Salesforce/xgen-7b-8k-base", torch_dtype=torch.bfloat16)
inputs = tokenizer("The world is", return_tensors="pt")
sample = model.generate(**inputs, max_length=128)
print(tokenizer.decode(sample[0]))

Citation

@misc{XGen,
  title={Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length},
  author={Salesforce AI Research},
  howpublished={Salesforce AI Research Blog},
  year={2023},
  url={https://blog.salesforceairesearch.com/xgen-7b/}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XGen

Models

Citation

About

Releases

Packages

Languages

License

josegron/xgen

Folders and files

Latest commit

History

Repository files navigation

XGen

Models

Citation

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages