Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github		.github
tests		tests
LICENSE		LICENSE
README.md		README.md
marple.py		marple.py
requirements.txt		requirements.txt

Repository files navigation

Marple

Summary

Collect links to profiles by username through 10+ search engines (see the full list below).

Features:

multiple engines
proxy support
CSV file export
plugins
- pdf metadata extraction
- social media info extraction

Quick Start

./marple.py soxoj

Results:

https://t.me/soxoj
Contact @soxoj - Telegram

https://github.com/soxoj
soxoj - GitHub

https://coder.social/soxoj
soxoj - Coder Social

https://gitmemory.com/soxoj
soxoj

...

PDF files
https://codeby.net/attachments/v-0-0-1-social-osint-fundamentals-pdf.45770
Social OSINT fundamentals - Codeby.net
/Creator: Google

...

Links: total collected 111 / unique with username in URL 97 / reliable 38 / documents 3

Installation

All you need is Python3. And pip. And requirements, of course.

pip3 install -r requirements.txt

Options

You can specify 'junk threshold' with option -t or --threshold (default 300) to get more or less reliable results.

Junk score is summing up from length of link URL and symbols next to username as a part of URL.

Also you can increase count of results from search engines with option --results-count (default 1000). Currently limit is only applicable for Google.

Other options:

  -h, --help            show this help message and exit
  -t THRESHOLD, --threshold THRESHOLD
                        Threshold to discard junk search results
  --results-count RESULTS_COUNT
                        Count of results parsed from each search engine
  --no-url-filter       Disable filtering results by usernames in URLs
  --plugin {socid_extractor,metadata,maigret}
                        Additional plugins to analyze links
  -v, --verbose         Display junk score for each result
  -d, --debug           Display all the results from sources and debug messages
  -l, --list            Display only list of all the URLs
  --proxy PROXY         Proxy string (e.g. https://user:[email protected]:8080)
  --csv CSV             Save results to the CSV file

Supported sources

Name	Method	Requirements
Google	scraping	None, works out of the box; frequent captcha
DuckDuckGo	scraping	None, works out of the box
Yandex	XML API	Register and get USER/API tokens
Aol	scraping	None, scrapes with pagination
Ask	scraping	None, scrapes with pagination
Bing	scraping	None, scrapes with pagination
Startpage	scraping	None, scrapes with pagination
Yahoo	scraping	None, scrapes with pagination
Qwant	scraping	None, scrapes with pagination
Dogpile	scraping	None, scrapes with pagination
Torch	scraping	Tor proxies (socks5://localhost:9050 by default), scrapes with pagination
Qwant	scraping	Check if search available in your exit IP country, scrapes with pagination

Development & testing

$ python3 -m pytest tests

TODO

[v] Proxy support
Additional search engines
Engine-specific filters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Marple

Summary

Quick Start

Installation

Options

Supported sources

Development & testing

TODO

About

Sponsor this project

Contributors 2

Languages

License

soxoj/marple

Folders and files

Latest commit

History

Repository files navigation

Marple

Summary

Quick Start

Installation

Options

Supported sources

Development & testing

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Contributors 2

Languages