LogLLM: Log-based Anomaly Detection Using Large Language Models

Official Implementation of "LogLLM: Log-based Anomaly Detection Using Large Language Models"

Datasets

The statistics of datasets used in the experiments.

			Training Data	Training Data	Training Data	Testing Data	Testing Data	Testing Data
	# Log Messages	# Log Sequences	# Log Sequences	# Anomalies	Anomaly Ratio	# Log Sequences	# Anomalies	Anomaly Ratio
HDFS	11,175,629	575,061	460,048	13497	2.93%	115013	3341	2.90%
BGL	4,747,963	47,135	37,708	4009	10.63%	9427	817	8.67%
Liberty	5,000,000	50,000	40000	34144	85.36%	10000	651	6.51%
Thunderbird	10,000,000	99,997	79,997	837	1.05%	20000	29	0.15%

Experiment Results

Experimental Results on HDFS, BGL, Liberty, and Thunderbird datasets. The best results are indicated using bold typeface.

		HDFS	HDFS	HDFS	BGL	BGL	BGL	Liberty	Liberty	Liberty	Thunderbird	Thunderbird	Thunderbird
	Log Parser	Prec.	Rec.	F1	Prec.	Rec.	F1	Prec.	Rec.	F1	Prec.	Rec.	F1	Avg. F1
DeepLog	✔	0.835	0.994	0.908	0.166	0.988	0.285	0.751	0.855	0.800	0.017	0.963	0.033	0.506
LogAnomaly	✔	0.886	0.893	0.966	0.176	0.985	0.299	0.684	0.876	0.768	0.025	0.963	0.050	0.521
PLELog	✔	0.893	0.979	0.934	0.595	0.880	0.710	0.795	0.874	0.832	0.826	0.704	0.760	0.809
FastLogAD	✔	0.721	0.893	0.798	0.167	1.000	0.287	0.151	0.999	0.263	0.008	0.931	0.017	0.341
LogBERT	✔	0.989	0.614	0.758	0.165	0.989	0.283	0.909	0.615	0.734	0.143	0.500	0.222	0.499
LogRobust	✔	0.961	1.000	0.980	0.696	0.968	0.810	0.695	0.979	0.813	0.318	1.000	0.482	0.771
CNN	✔	0.966	1.000	0.982	0.698	0.965	0.810	0.580	0.914	0.709	0.900	0.670	0.766	0.817
NeuralLog	✘	0.971	0.988	0.979	0.792	0.884	0.835	0.875	0.926	0.900	0.794	0.931	0.857	0.893
RAPID	✘	1.000	0.859	0.924	0.874	0.399	0.548	0.911	0.611	0.732	0.200	0.207	0.203	0.602
LogLLM	✘	0.994	1.000	0.997	0.861	0.979	0.916	0.992	0.926	0.958	0.966	0.966	0.966	0.959

Using Our Code to Reproduce the Results

Create conda environment.

 conda install --yes --file requirements.txt # You may need to downgrade the torch using pip to match the CUDA version

Download open-source LLM Meta-Llama-3-8B, and Bert bert-base-uncased.

   ├── Meta-Llama-3-8B
   │ ├── config.json
   │ ├── generation_config.json
   │ ├── LICENSE
   │ ├── model-00001-of-00004.safetensors
   │ ├── model-00002-of-00004.safetensors
   │ ├── model-00003-of-00004.safetensors
   │ ├── model-00004-of-00004.safetensors
   │ ├── model.safetensors.index.json
   │ ├── special_tokens_map.json
   │ ├── tokenizer.json
   │ └── tokenizer_config.json

   ├── bert-base-uncased
   │ ├── config.json
   │ ├── model.safetensors
   │ ├── tokenizer.json
   │ ├── tokenizer_config.json
   │ └── vocab.txt

Prepare training and testing data

Download BGL/HDFS_v1/Thunderbird dataset from here. Download Liberty dataset from here.
For BGL, Thunderbird and Liberty, set the following variations in sliding_window.py under * prepareData* directory:
```
data_dir =  # i.e. r'/mnt/public/gw/SyslogData/BGL'
log_name =  # i.e. 'BGL.log'
```
For Liberty, you should activate
```
start_line = 40000000
end_line = 45000000
```
For Thunderbird, you should activate
```
start_line = 160000000
end_line = 170000000
```
Run python prepareData.sliding_window.py from the root directory to generate training and testing data. Training and testing data will be saved in {data_dir}.
For HDFS, set the following directories in session_window.py under prepareData directory:
```
data_dir =  # i.e. r'/mnt/public/gw/SyslogData/HDFS_v1'
log_name =  # i.e. 'HDFS.log'
```
Run python prepareData.session_window.py from the root directory to generate training and testing data. Training and testing data will be saved in {data_dir}

Train our proposed deep model. This step can be skipped by directly using our fine-tuned model ( ft_model_[dataset_name])

Set the following variations in train.py

Bert_path = # i.e., r"/mnt/public/gw/LLM_model/bert-base-uncased"
Llama_path = # i.e., r"/mnt/public/gw/LLM_model/Meta-Llama-3-8B"
dataset_name = # i.e., 'BGL'
data_path =  # i.e., r'/mnt/public/gw/SyslogData/{dataset_name}/train.csv'.format(dataset_name)

Run python train.py from the root directory to get fine-tuned model.

Evaluate on test dataset.

Set the following variations in eval.py

Bert_path = # i.e., r"/mnt/public/gw/LLM_model/bert-base-uncased"
Llama_path = # i.e., r"/mnt/public/gw/LLM_model/Meta-Llama-3-8B"
dataset_name = # i.e., 'BGL'
data_path =  # i.e., r'/mnt/public/gw/SyslogData/{dataset_name}/test.csv'.format(dataset_name)

We have provided the test file for the BGL dataset, which can be accessed at here.
Run python eval.py from the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ft_model_BGL		ft_model_BGL
ft_model_HDFS		ft_model_HDFS
ft_model_Liberty		ft_model_Liberty
ft_model_Thunderbird		ft_model_Thunderbird
prepareData		prepareData
README.md		README.md
customDataset.py		customDataset.py
eval.py		eval.py
framework.png		framework.png
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogLLM: Log-based Anomaly Detection Using Large Language Models

Datasets

Experiment Results

Using Our Code to Reproduce the Results

About

Releases

Packages

Languages

guanwei49/LogLLM

Folders and files

Latest commit

History

Repository files navigation

LogLLM: Log-based Anomaly Detection Using Large Language Models

Datasets

Experiment Results

Using Our Code to Reproduce the Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages