This project adds a basic high availability and consistent hash layer to InfluxDB.
NOTE: influx-proxy must be built with Go 1.14+ with Go module support, don't implement udp.
We used InfluxDB Relay before, but it doesn't support some demands. We use grafana for visualizing time series data, so we need add datasource for grafana. We need change the datasource config when influxdb is down. We need transfer data across idc, but Relay doesn't support gzip. It's inconvenient to analyse data with connecting different influxdb. Therefore, we made InfluxDB Proxy. More details please visit https://github.com/shell909090/influx-proxy.
Forked from the above InfluxDB Proxy, after many improvements and optimizations, InfluxDB Proxy v1 has released, which no longer depends on python and redis, and supports more features.
Since the InfluxDB Proxy v1 is limited by the only ONE
database and the KEYMAPS
configuration, we refactored InfluxDB Proxy v2 with high availability and consistent hash, which supports multiple databases and tools to rebalance, recovery, resync and cleanup.
- Support gzip.
- Support query.
- Support some cluster influxql.
- Filter some dangerous influxql.
- Transparent for client, like cluster for client.
- Cache data to file when write failed, then rewrite.
- Support multiple databases to create and store.
- Support database sharding with consistent hash.
- Support tools to rebalance, recovery, resync and cleanup.
- Load config file and no longer depend on python and redis.
- Support precision query parameter when writing data.
- Support influxdb-java, influxdb shell and grafana.
- Support authentication and https.
- Support health status query.
- Support database whitelist.
- Support version display.
- Golang >= 1.14 with Go module support
$ git clone https://github.com/chengshiwen/influx-proxy.git
$ cd influx-proxy
$ make
$ ./bin/influx-proxy -config proxy.json
$ # build current platform
$ make build
$ # build linux amd64
$ make linux
The architecture is fairly simple, one InfluxDB Proxy instance and two consistent hash circles with two InfluxDB instances respectively. The Proxy should point HTTP requests with db and measurement to the two circles and the four InfluxDB servers.
The setup should look like this:
┌──────────────────┐
│ writes & queries │
└──────────────────┘
│
▼
┌──────────────────┐
│ │
│ InfluxDB Proxy │
│ (only http) │
│ │
└──────────────────┘
│
▼
┌──────────────────┐
│ db,measurement │
│ consistent hash │
└──────────────────┘
| |
┌─┼──────────────┘
│ └────────────────┐
▼ ▼
Circle 1 Circle 2
┌────────────┐ ┌────────────┐
│ │ │ │
│ InfluxDB 1 │ │ InfluxDB 3 │
│ InfluxDB 2 │ │ InfluxDB 4 │
│ │ │ │
└────────────┘ └────────────┘
The configurations in proxy.json
are the following:
circles
: circle listname
: circle name,required
backends
: backend list belong to the circle,required
name
: backend name,required
url
: influxdb addr or other http backend which supports influxdb line protocol,required
username
: influxdb username, with encryption if auth_encrypt is enabled, default isempty
which means no authpassword
: influxdb password, with encryption if auth_encrypt is enabled, default isempty
which means no authauth_encrypt
: whether to encrypt auth (username/password), default isfalse
listen_addr
: proxy listen addr, default is:7076
db_list
: database list permitted to access, default is[]
data_dir
: data dir to save .dat .rec, default isdata
tlog_dir
: transfer log dir to rebalance, recovery, resync or cleanup, default islog
hash_key
: backend key for consistent hash, including "idx", "exi", "name" or "url", default isidx
, once changed rebalance operation is necessaryflush_size
: default is10000
, wait 10000 points writeflush_time
: default is1
, wait 1 second write whether point count has bigger than flush_size configcheck_interval
: default is1
, check backend active every 1 secondrewrite_interval
: default is10
, rewrite every 10 secondsconn_pool_size
: default is20
, create a connection pool which size is 20write_timeout
: default is10
, write timeout until 10 secondsidle_timeout
: default is10
, keep-alives wait time until 10 secondsusername
: proxy username, with encryption if auth_encrypt is enabled, default isempty
which means no authpassword
: proxy password, with encryption if auth_encrypt is enabled, default isempty
which means no authauth_encrypt
: whether to encrypt auth (username/password), default isfalse
write_tracing
: enable logging for the write, default isfalse
query_tracing
: enable logging for the query, default isfalse
https_enabled
: enable https, default isfalse
https_cert
: the ssl certificate to use when https is enabled, default isempty
https_key
: use a separate private key location, default isempty
The following commands are forbid.
ALTER
GRANT
REVOKE
KILL
SELECT INTO
Multiple queries
delimited by semicolon;
Only support match the following commands.
select from
show from
show measurements
show series
show field keys
show tag keys
show tag values
show retention policies
show stats
show databases
create database
drop database
delete from
drop series from
drop measurement
on clause
(thedb
parameter takes precedence when the parameter is set in/query
http endpoint)
There are three tools for benchmarking InfluxDB, which can also be applied to InfluxDB Proxy:
- influx-stress is a stress tool for generating artificial load on InfluxDB.
- influxdb-comparisons contains code for benchmarking InfluxDB against other databases and time series solutions.
- tsbs (Time Series Benchmark Suite) is a tool for comparing and evaluating databases for time series data.
MIT.