LLMDFA: Analyzing Dataflow in Code with Large Language Models

LLMDFA is an LLM-powered data-flow analysis framework. Specifically, it instantiates bottom-up summary-based data-flow analysis by interpreting intra-procedural data-flow facts with LLMs. With the specified sources/sinks and data-flow transfer functions (i.e., rules of propagating data-flow facts), LLMDFA can support various forms of bug detection in a context- and path-sensitive manner. Notably, the analysis is totally compilation-free.

Installation

Clone the repository:

git clone [email protected]:chengpeng-wang/LLMDFA.git
cd LLMDFA

Install the required dependencies:

conda create --name llmdfa python=3.10
conda activate llmdfa
pip install -r requirements.txt

Ensure you have the Tree-sitter library and language bindings installed:
```
cd lib
python build.py
```

Configure your key:

export OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Quick Start

Analyze a demo case using LLMDFA

First, you need to synthesize the source/sink extractor with the following command:
```
cd src
sh run_extractor_synthesizer.sh
```
After that, you can obtain synthesized extractors in the file src/TSAgent/TS_synthesis_extractor.py. We also provide the manually crafted extractors in src/TSAgent/TS_manual_extractor.py. If you want to use the manually crafted ones, you can just overwrite src/TSAgent/TS_synthesis_extractor.py with the content of src/TSAgent/TS_manual_extractor.py.

Then you can run the following commands to detect XSS bugs using LLMDFA powered by gpt-4o-mini as a demo, which contains 10 cases.
```
cd src
python run_llmdfa.py --bug-type xss --model-name gpt-4o-mini \
    -syn-parser -fscot -syn-solve \
    --solving-refine-number 3 \
    --analysis-mode single    
```
Then you can obtain the bug report in the directory log/gpt-4o-mini/synparser_fscot_synsolver. Also, the console output indicates the numbers of TPs and FPs. Here is an example:
```
CWE369_Divide_by_Zero__int_database_modulo_81 

{'input_token_cost': 14237, 'output_token_cost': 973, 'analysis_result': {'TPs': 1, 'FPs': 0}, 'ground_truth': {'TPs': 1, 'FPs': 2}, 'single time cost': 16.143609285354614} 
```
In the above example, the case CWE369_Divide_by_Zero__int_database_modulo_81 contains 1 true positive and 2 cases that are easily reported as false positives. The above console output indicates that LLMDFA reported one true positive without false positives. The input and output costs are 14237 and 973, respectively. The time cost is 16.143609285354614 seconds.

If you want to detect all the XSS bugs in the Juliet Test Suite, you can just change the value of analysis-mode to all.

If you want to change the bug types, you can reset the value of bug-type to osci and dbz.

If you want to change the LLMs, you can just change the value of model-name to the name of the models you want to use, such as gpt-3.5-turbo and gpt-4-turbo. The output reports of LLMDFA are stored in separate directories in log when using different LLMs.
Run ablations of LLMDFA

You can run the three ablations of LLMDFA as follows:
- NoSynExt: Utilize LLMs to identify sources and sinks instead of applying synthesized source/sink extractors
```
cd src
python run_llmdfa.py --bug-type xss --model-name gpt-4o-mini \
    -fscot -syn-solve \
    --solving-refine-number 3 \
    --analysis-mode single    
```
- NoCoT: Directly ask LLMs to summarize intra-procedural data-flow paths without chain-of-thought prompting
```
cd src
python run_llmdfa.py --bug-type xss --model-name gpt-4o-mini \
    -syn-parser -syn-solve \
    --solving-refine-number 3 \
    --analysis-mode single    
```
- NoSynVal: Validate path feasibility with LLMs instead of invoking SMT solvers
```
cd src
python run_llmdfa.py --bug-type xss --model-name gpt-4o-mini \
    -syn-parser -fscot \
    --solving-refine-number 3 \
    --analysis-mode single    
```
The bug reports of the ablations are located in log/gpt-4o-mini/nosynparser_fscot_synsolver, log/gpt-4o-mini/synparser_nofscot_synsolver, and log/gpt-4o-mini/synparser_fscot_nosynsolver, respectively.
Run end-to-end prompting-based analyses as baselines

You can run the following command to apply end-to-end prompting-based analyzers:
```
cd src
python run_baseline.py --bug-type xss --model-name gpt-4o-mini --analysis-mode single    
```

Remark on Dataset

To avoid the leakage of ground truth to LLMs, we obfuscate the code in the Juliet Test Suite. Specifically, we remove the comments and rename the functions. Also, we concatenate multiple Java files belonging to the same test case into a single file for convenience in prompting, even though the resulting file may not be compilable.

More Programming Languages

LLMDFA is language-agnostic. To migrate the current implementations to other programming languages or extract more syntactic facts, please refer to the grammar files in the corresponding Tree-sitter libraries and refactor the code in the directory src/TSAgent. Basically, you only need to change the node types when invoking find_nodes.

Here are the links to grammar files in Tree-sitter libraries targeting mainstream programming languages:

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.

Contact

For any questions or suggestions, please contact [email protected] or [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
benchmark		benchmark
lib		lib
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMDFA: Analyzing Dataflow in Code with Large Language Models

Installation

Quick Start

Remark on Dataset

More Programming Languages

Contributing

Contact

About

Releases

Packages

Contributors 2

Languages

chengpeng-wang/LLMDFA

Folders and files

Latest commit

History

Repository files navigation

LLMDFA: Analyzing Dataflow in Code with Large Language Models

Installation

Quick Start

Remark on Dataset

More Programming Languages

Contributing

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages