q - Treating Text as a Database

Overview

Have you ever stared at a text file on the screen, hoping it would have been a database so you could ask anything you want about it? I had that feeling many times, and I've finally understood that it's not the database that I want. It's the language - SQL.

SQL is a declarative language for data, and as such it allows me to define what I want without caring about how exactly it's done. This is the reason SQL is so powerful, because it treats data as data and not as bits and bytes (and chars).

The goal of this tool is to provide a bridge between the world of text files and of SQL.

"q allows performing SQL-like statements on tabular text data, including joins and subqueries"

You can use this gitter chat room for contacting me directly. I'm trying to be available at the chat room as much as possible.

Highlights

Seamless multi-table SQL support, including joins. filenames are just used instead of table names (use - for stdin)
Automatic column name and column type detection (Allows working more naturally with the data)
Multiple parsing modes - relaxed and strict. Relaxed mode allows to easily parse semi-structured data, such as log files.
Standard installation - RPM, Homebrew (Mac). Debian package coming soon.
Support for quoted fields
Full UTF-8 support (and other encodings)
Handling of gzipped files
Output delimiter matching and selection
Output beautifier
man page when installed through the RPM package

Examples and Tutorial

Quick usage example: sudo find /tmp -ls | q "select c5,c6,sum(c7)/1024.0/1024 as total from - group by c5,c6 order by total desc"

Full examples and a beginner's tutorial can be found here

Installation

Current stable version is 1.3.0.

No special requirements other than python >= 2.5 are needed.

Mac Users

Just run brew install q.

Thanks @stuartcarnie for the initial homebrew formula

Manual installation (very simple, since there are no dependencies)

Download the main q executable from here into a folder in the PATH.
Make the file executable.

For Windows machines, also download q.bat here into the same folder and use it to run q.

RPM-Base Linux distributions

Download the RPM here here.

Install using rpm -ivh <rpm-name>.

RPM Releases also contain a man page. Just enter man q.

Debian-based Linux distributions

Debian packaing is in progress. In the mean time install manually. See the section below.

Usage

q's basic usage is very simple:q <flags> <query>, but it has lots of features under the hood and in the flags that can be passed to the command.

Simplest execution is q "SELECT * FROM myfile" which prints the entire file.

Complete information can be found here

Implementation

Some implementation details can be found here

Limitations

No checks and bounds on data size
Spaces in file names are not supported yet. I'm working on it.

Future Ideas

Faster reuse of previous data loading
Allow working with external DB
Real parsing of the SQL, allowing smarter execution of queries.
Smarter batch insertion to the database
Full Subquery support (will be possible once real SQL parsing is performed)
Provide mechanisms beyond SELECT - INSERT and CREATE TABLE SELECT and such.

Rationale

Some information regarding the rationale for this tool and related philosophy can be found here

Change log

History of changes can be found here

Contact

Any feedback/suggestions/complaints regarding this tool would be much appreciated. Contributions are most welcome as well, of course.

Harel Ben-Attia, [email protected], @harelba on Twitter

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
.gitignore		.gitignore
.qrc		.qrc
AUTHORS		AUTHORS
CHANGELOG.markdown		CHANGELOG.markdown
EXAMPLES.markdown		EXAMPLES.markdown
IMPLEMENTATION.markdown		IMPLEMENTATION.markdown
INSTALL.markdown		INSTALL.markdown
LICENSE		LICENSE
RATIONALE.markdown		RATIONALE.markdown
README.markdown		README.markdown
THANKS		THANKS
USAGE.markdown		USAGE.markdown
create-rpm		create-rpm
exampledatafile		exampledatafile
group-emails-example		group-emails-example
q		q
q.bat		q.bat
q.manpage.1.ronn		q.manpage.1.ronn
q.spec.template		q.spec.template
run-tests		run-tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

q - Treating Text as a Database

Overview

Highlights

Examples and Tutorial

Installation

Mac Users

Manual installation (very simple, since there are no dependencies)

RPM-Base Linux distributions

Debian-based Linux distributions

Usage

Implementation

Limitations

Future Ideas

Rationale

Change log

Contact

About

Releases

Packages

License

boriel/q

Folders and files

Latest commit

History

Repository files navigation

q - Treating Text as a Database

Overview

Highlights

Examples and Tutorial

Installation

Mac Users

Manual installation (very simple, since there are no dependencies)

RPM-Base Linux distributions

Debian-based Linux distributions

Usage

Implementation

Limitations

Future Ideas

Rationale

Change log

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages