RegexHarvester

RegexHarvester is a command-line tool written in Go that allows you to search for and extract specific patterns from files within a directory. It uses regular expressions to identify and extract matches, making it a powerful tool for data mining and text processing tasks.

Features

Pattern Matching: Utilizes regular expressions to find and extract specific patterns from files.
Directory Scanning: Scans all files within a specified directory (recursively).
File Extension Filtering: Only processes files with a specified extension.
Unique Matches: Ensures that only unique matches are returned.
Sorting: Sorts the results alphabetically for easy readability.

Installation

To install RegexHarvester, you need to have Go installed on your system. Follow these steps:

Clone the repository:

git clone https://github.com/toxyl/regex-harvester.git

Navigate to the project directory:
```
cd regex-harvester
```
Build the project:
```
go build
```
Run the executable:
```
./regex-harvester
```

Usage

Command Line Arguments

RegexHarvester requires three command-line arguments:

File Extension: The extension of the files you want to process (e.g., eml).
Directory: The directory containing the files you want to scan.
Regular Expression: The regular expression pattern you want to match.

Example

./regex-harvester eml /emails/ '\bfoo[bar|]\b'

This command will:

Scan all .eml files in the /emails/ directory (recursively).
Extract and print all unique matches of the pattern \bfoo[bar|]\b.

Output

The output will be a list of unique matches, sorted alphabetically, printed line by line.

Contributing

Contributions are welcome! If you have any ideas, suggestions, or bug reports, please open an issue or submit a pull request.

License

This project is released into the public domain under the UNLICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RegexHarvester

Features

Installation

Usage

Command Line Arguments

Example

Output

Contributing

License

About

Releases

Packages

Languages

License

toxyl/regex-harvester

Folders and files

Latest commit

History

Repository files navigation

RegexHarvester

Features

Installation

Usage

Command Line Arguments

Example

Output

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages