Skip to content

zqgong/Llama2_keras3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

llama2 keras3

This respository is a Multi-Backend (Pytorch, Tensorflow, Jax) implementation of LLaMA using Keras3.

Base on LLaMA-Lite.

Implement the KVCache in simple code. Speed up the GPT

Easy to convert to tflite.

Inference

  • Get the tinyllama model weights from HF.
  • You can also try the Llama2 weights from Meta HF
python performence.py

About

Llama2 of keras3 version

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages