Skip to content

Lightweight network packet aggregation and efficient storage and querying of flows

License

Notifications You must be signed in to change notification settings

open-ch/goProbe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

goProbe

This package comprises:

  • goProbe - A lightweight, concurrent, network packet aggregator
  • goDB - A small, high-performance, columnar database
  • goQuery - Query front-end used to read out data acquired by goProbe and stored by goDB
  • goConvert - Helper binary to convert goProbe-flow data stored in csv files

As the name suggests, all components are written in Google go.

Introduction

Today, targeted analyses of network traffic patterns have become increasingly difficult due to the sheer amount of traffic encountered. To enable them, traffic needs to be captured and examined and broken down to key descriptors which yield a condensed explanation of the underlying data.

The NetFlow standard was introduced to address this reduction. It uses the concept of flows, which combine packets based on a set of shared packet attributes. NetFlow information is usually captured on one device and collected in a central database on another device. Several software probes are available, implementing NetFlow exporters and collectors.

goProbe deviates from traditional NetFlow as flow capturing and collection is run on the same device and the flow fields reduced. It was designed as a lightweight, standalone system, providing both optimized packet capture and a storage backend tailored to the flow data.

goProbe

goProbe captures packets using libpcap and gopacket and extracts several attributes which are used to classify the packet in a flow-like data structure:

  • Source and Destination IP
  • IP Protocol
  • Destination Port (if available)
  • Application Layer Protocol

Available flow counters are:

  • Bytes sent and received
  • Packet sent and received

In summary: a goProbe-flow is not a NetFlow-flow.

The flow data is written out to a custom colum store called goDB, which was specifically designed to accomodate goProbe's data. Each of the above attributes is stored in a column file

Usage

Capturing is performed concurrently by goProbe on multiple interfaces which are specified as arguments to the program, which is started as follows (as root):

/usr/local/goProbe/bin/goProbe <iface 1> <iface 2> ... <iface n>

The capturing probe can be run as a daemon via

/etc/init.d/goprobe.init {start|stop|status|restart|reload|force-reload}

By default, the interface eth0 is specified. If you want to perform capturing on other interfaces, change the respective line in goprobe.init (the variable DAEMON_ARGS stores the interfaces).

Update: version 1.05 supports the interface specification via a configuration file which should be saved as /usr/local/goProbe/etc/goprobe.conf and include a space-delimited list of interfaces on which capturing should be performed. In order for the changes to take effect the reload target should be used.

goDB

The flow records are stored block-wise on a five minute basis in their respective attribute files. The database is partitioned on a per day basis, which means that for each day, a new folder is created which holds the attribute files for all flow records written throughout the day.

Blocks are compressed using lz4 compression, which was chosen to enable both swift decompression and good data compression ratios.

goDB is a package which can be imported by other go applications.

goQuery

goQuery is the query front which is used to access and aggregate the flow information stored in the database. The following query types are supported:

  • Top talkers: show data traffic volume of all unique IP pairs
  • Top Applications (port/protocol): traffic volume of all unique destination port-transport protocol pairs, e.g., 443/TCP
  • Top Applications (layer 7): traffic volume by application layer protocol, e.g. SSH, HTTP, etc.

Usage

For a comprehensive help on how to use goQuery type /usr/local/goProbe/bin/goQuery -h

Example Output

root@analyzer# /usr/local/goProbe/bin/goQuery -i eth0 -n 10 -c proto=TCP talk_conv
Your query: talk_conv
Conditions: proto=TCP
Sort by:    accumulated data volume (sent and received)
Interface:  eth0
Query produced 779 hits and took 33.66236ms 

                                  sip                   dip    packets       %   data vol.       %
                       215.142.239.52        215.142.238.87    46.13 M   58.33    28.92 GB   89.17
                       215.142.239.52       215.142.226.100    17.74 M   22.43     1.46 GB    4.50
                       215.142.239.52       215.142.238.100    14.08 M   17.80     1.16 GB    3.57
                       215.142.239.52       215.142.226.131   320.68 k    0.41   182.69 MB    0.55
     fd03:ca0:7:c08:a0ae:118:867:d7e3   fd03:ca0:8:c0c::165    98.98 k    0.13   173.35 MB    0.52
                       215.142.239.52       215.142.238.167   266.83 k    0.34   139.72 MB    0.42
                       215.142.239.52        215.142.228.16    23.89 k    0.03   124.34 MB    0.37
    fd03:ca0:7:c08:d910:9c63:f26f:3d2   fd03:ca0:8:c0c::165    80.99 k    0.10   119.30 MB    0.36
   fd03:ca0:7:c08:28d4:b3ee:c16c:9ba4   fd03:ca0:8:c0c::165    49.06 k    0.06    72.62 MB    0.22
   fd03:ca0:7:c08:4c6e:86bb:aec1:1072   fd03:ca0:8:c0c::165    15.91 k    0.02    23.83 MB    0.07

Overall packets: 79.07 M , Overall data volume: 32.44 GB

Converting data

If you use goConvert, you need to make sure that the data which you are importing is temporally ordered and provides a column which stores UNIX timestamps. An example csv file may look as follows:

# HEADER: bytes_rcvd,bytes_sent,dip,dport,l7_proto,packets_rcvd,packets_sent,proto,sip,tstamp
...
40,72,172.23.34.171,8080,158,1,1,6,10.11.72.28,1392997558
40,72,172.23.34.171,49362,158,1,1,6,10.11.72.28,1392999058
...

You must abide by this structure, otherwise the conversion will fail.

Logging Facilities

Both goProbe and goDB write to the Syslog facility. However, the log output is passed to syslog via UDP packets to destination port 514. You will have to make sure that your syslog daemon supports logging via UDP. On most platforms uncommenting the following in /etc/rsyslog.conf should suffice:

$ModLoad imudp
$UDPServerRun 514

Changes should take effect after rebooting the machine.

Installation

Before running the installer, make sure that you have the following dependencies installed:

  • yacc
  • bison
  • curl

The package itself was designed to work out of the box. Thus, you do not even need the go environment. All of the dependencies are downloaded during package configuration. To install the package, go to the directory into which you cloned this repository and run the following commands:

sudo -s
make all

Above command runs the following targets:

  • make clean: removes all dependencies and compiled binaries
  • make configure: downloads the dependencies, configures them and applies patches (if necessary)
  • make compile: compiles dependencies, goProbe and goQuery
  • make install: set up package as a binary tree. The binaries and used libraries are placed in /usr/local/goProbe per default. The init script can be found under /etc/init.d/goprobe.init. It is also possible to install a cronjob used to clean up outdated database entries. It is not installed by default. Uncomment the line in the Makefile if you need this feature. The cronjob can be found in /etc/cron.d/goprobe.cron

By default, goConvert is not compiled. If you wish to do so, add the following line to the install target in the Makefile:

go build -a -o goConvert $(PWD)/addon/gocode/src/OSAG/convert/DBConvert.go

The binary will reside in the directory specified in the above command.

Supported Operating Systems

goProbe is currently set up to run on Linux based systems. Tested versions include:

  • Ubuntu 14.04
  • Debian 7

Support for Mac OS X will follow eventually.

Authors & Contributors

Lennart Elsen and Fabian Kohn, Open Systems AG

This software was developed at Open Systems AG in close collaboration with the Distributed Computing Group at the Swiss Federal Institute of Technology.

License

See the LICENSE file for usage conditions.

About

Lightweight network packet aggregation and efficient storage and querying of flows

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published