We can help you get Chronicle up and running in your organisation, we suggest you invite us in for consultancy, charged on an ad-hoc basis, we can discuss the best options tailored to your individual requirements. - Contact Us
Or you may already be using Chronicle and just want some help - find out more..
Replicate your Key Value Store across your network, with consistency, durability and performance.
<dependency>
<groupId>net.openhft</groupId>
<artifactId>chronicle-map</artifactId>
<version><!--replace with the latest version--></version>
</dependency>
Click here to get the Latest Version Number
- Should I use Chronicle Queue or Chronicle Map
- What is the difference between SharedHashMap and Chronicle Map
- Overview
- JavaDoc
- Getting Started Guide
- Simple Construction
- Maven Download
- Snapshot Download
- Key Value Object Types
- Off Heap and How to improve performance
- Sharing Data Between Two or More Maps
- Entries
- Size of Space Reserved on Disk
- Chronicle Map Interface
- [Oversized Entries Support] (https://github.com/OpenHFT/Chronicle-Map/blob/master/README.md#oversized-entries-support)
- Serialization
- Close
- TCP / UDP Replication
- TCP / UDP Background.
- How to setup UDP Replication
- TCP/IP Throttling
- Replication How it works
- Multiple Processes on the same server with Replication
- Identifier for Replication
- Bootstrapping
- Identifier
- Port
- Heart Beat Interval
- Multi Chronicle Maps - Network Distributed
- Stateless Client
- How to speed up the Chronicle Map Stateless Client
- Questions/Answers
- Known Issues
- Stackoverflow
- [Development Tasks - JIRA] (https://higherfrequencytrading.atlassian.net/browse/HCOLL)
- [Use Case Which include Chronicle Map] (http://openhft.net/products/chronicle-engine/)
- Hello World - A map which stores data off heap
- Sharing the map between two ( or more ) processes on the same computer
- Replicating data between process on different servers with TCP/IP Replication
- [Replicating data between process on different servers with UDP] (https://github.com/OpenHFT/Chronicle-Map/blob/master/README.md#example--replicating-data-between-process-on-different-servers-using-udp)
- Creating a Chronicle Set and adding data to it
- Chronicle Map with Large Data
- [Lock Contention] (https://github.com/OpenHFT/Chronicle-Map/blob/master/README.md#lock-contention)
- Better to use small keys
- ConcurrentHashMap v ChronicleMap
Chronicle Map implements the java.util.concurrent.ConcurrentMap
, however unlike the standard
java map, ChronicleMap is able to share your entries accross processes:
If you compare HashMap
, ConcurrentHashMap
and ChronicleMap
, most of the maps in your system
are likely to be HashMap. This is because HashMap
is lightweight and synchronized HashMap works
well for lightly contended use cases. By contention we mean, how many threads on average are trying
to use a Map. One reason you can't have many contended resources, is that you only have so many
CPUs and they can only be accessing so many resources at once (ideally no more than one or two
per thread at a time).
ConcurrentHashMap
scales very well when highly contended. It uses more memory but if you only
have a few of them, this doesn't matter. They have higher throughput than the other two solutions,
but also it creates the highest garbage. If garbage pressure is an issue for you, you may want
to consider ChronicleMap
One of the main differences between chronicle and ConcurrentHashMap, is how you go about creating an instance see the getting started guide below for details.
If you have;
- lots of small key-values
- you want to minimise garbage produced, and medium lived objects.
- you need to share data between JVMs
- you need persistence
Chronicle queue is designed to send every update. If your network can't do this something has to give. You could compress the data but at some point you have to work within the limits of your hardware or get more hardware. Chronicle Map on the other hand sends the latest value only. This will naturally drop updates and is a more natural choice for low bandwidth connections.
What is the difference between SharedHashMap and Chronicle Map
SharedHashMap was the old name given to ChronicleMap, Since the last release of SharedHashMap we have added a lot of new features to ChronicleMap, most of these are listed in this readme.
To download the JAR which contains Chronicle Map, we recommend you use maven, which will download it
from Maven Central, once you have installed maven, all you have to do is
add the following to your projects pom.xml
:
<dependency>
<groupId>net.openhft</groupId>
<artifactId>chronicle-map</artifactId>
<version><!--replace with the latest version--></version>
</dependency>
To get the latest version number Click Here
when you add ( the above dependency ) to your pom maven will usually attempt to download the release artifacts from:
http://repo1.maven.org/maven2/net/openhft/chronicle-map
If you want to try out the latest pre-release code, you can download the snapshot artifact manually from:
https://oss.sonatype.org/content/repositories/snapshots/net/openhft/chronicle-map/
a better way is to add the following to your setting.xml, to allow maven to download snapshots :
<repository>
<id>Snapshot Repository</id>
<name>Snapshot Repository</name>
<url>https://oss.sonatype.org/content/repositories/snapshots</url>
<snapshots>
<enabled>true</enabled>
</snapshots>
</repository>
and define the snapshot version in your pom.xml, for example:
<dependency>
<groupId>net.openhft</groupId>
<artifactId>chronicle-map</artifactId>
<version><!--replace with the latest version--></version>
</dependency>
Unlike HashMap which will support any heap object, Chronicle Map only works with objects that it can store off heap, so the objects have to be : (one of the following )
- AutoBoxed primitives - for good performance.
- Strings - for good performance.
- implements Serializable
- implements Externalizable ( with a public default constructor )
- implements our custom interface BytesMarshallable ( with a public default constructor ) - use this for best performance.
or value objects that are created through, a directClass interface, for example :
ChronicleMap<String, BondVOInterface> chm = ChronicleMapBuilder
.of(String.class, BondVOInterface.class)
.create();
Object graphs can also be included as long as the outer object supports Serializable, Externalizable or BytesMarshallable.
Creating an instance of Chronicle Map is a little more complexed than just calling a constructor. To create an instance you have to use the ChronicleMapBuilder.
import net.openhft.chronicle.map.*
.....
try {
String tmp = System.getProperty("java.io.tmpdir");
String pathname = tmp + "/shm-test/myfile.dat";
File file = new File(pathname);
ChronicleMapBuilder<Integer, CharSequence> builder =
ChronicleMapBuilder.of(Integer.class, CharSequence.class);
ConcurrentMap<Integer, CharSequence> map = builder.createPersistedTo(file);
} catch (IOException e) {
e.printStackTrace();
}
Chronicle Map stores its data off the java heap, If you wish to share this off-heap memory between processes on the same server, you must provide a "file", this file must be the same "file" for all the instances of Chronicle Map on the same server. The name and location of the "file" is entirely up to you. For the best performance on many unix systems we recommend using tmpfs.
Since this file is memory mapped, if you were to create another instance of the Chronicle Map, pointing to the same file, both Chronicle Maps use this file as a common memory store, which they both read and write into. The good thing about this is the two ( or more instances of the Chronicle Map ) don't have to be running in the same java process. Ideally and for best performance, the two processes should be running on the same server. Since the file is memory mapped, ( in most cases ) the read and writes to the file are hitting the disk cache. Allowing the chronicle map to exchange data between processes by just using memory and in around 40 nanoseconds.
ConcurrentMap<Integer, CharSequence> map1, map2;
// this could be on one process
map1 = ChronicleMapBuilder.of(Integer.class, CharSequence.class).createPersistedTo(file);
// this could be on the other process
map2 = ChronicleMapBuilder.of(Integer.class, CharSequence.class).createPersistedTo(file);
Note: In order to share data between map1 and map2, the file has to point to the same file location on your server.
One of the differences with Chronicle Map against ConcurrentHashMap, is that it can't be resized, unlike the ConcurrentHashMap, Chronicle Map is not limited to the available on heap memory. Resizing is a very expensive operation for Hash Maps, as it can stall your application, so as such we don't do it. When you are building a Chronicle Map you can set the maximum number of entries that you are ever likely to support, its ok to over exaggerate this number. As the Chronicle Map is not limited to your available memory, At worst you will end up having a very large file on disk.
You set the maximum number of entries by the builder:
ConcurrentMap<Integer, CharSequence> map =
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
constantValueSizeBySample("a long sample string"),
.entries(1000) // set the max number of entries here
.create();
In this example above we have set 1000 entries.
We have optimised chronicle, So that you can have situations where you either don't use;
-
all the entries you have allowed for. This works best on Unix where the disk space and memory used reflect the number of actual entries, not the number you allowed for.
-
all the space you allow for each entry. This helps if you have entries which are multiple cache lines (128 bytes +), only the lines you touch sit in your CPU cache and if you have multiple pages (8+ Kbytes) only the pages you touch use memory or disk. The CPU cache usage matters as it can be 10000x smaller than main memory.
The size of each entry depends on the type of the Keys and Values, as some types are larger than others. For example, if an entry uses Integers for both the keys and values, both the key and value each take exactly 4 bytes. There is also some other overhead which is internal to chronicle, such as its internal multi-map, which has an overhead of
- 6 bytes per entry for segment size of < 64k,
- 9 bytes per entry for segment size of < 16m but > 64k
- 12 bytes per entry for segment size of > 16m entries per segment.
Also, if you create a replicated map, there is an additional 10 bytes per entry.
We suggest you don't configure size for constant-sized keys or values, instead you can use the builder methods .constantKeySizeBySample(sampleKey) and .constantValueSizeBySample(sampleValue), For common types like Integer we suggest you don't use these methods, for example ChronicleMap knows that Integer is 4 bytes long, Long is 8, etc.
In linux, if you looked at the size of the 'file', it will report the used entry size so if you have just added one entry, it will report the size of this entry, but Windows will report the reserved size, as it reserves the disk space eagerly ( in fact windows also reserves the memory eagerly as well ) in other words number-of-entries x entry-size.
so on linux, if your type
# It shows you the extents.
ls -l <file>
# It shows you how much is actually used.
du <file>
To illustrate this with an example - On Ubuntu we can create a 100 TB chronicle map. Both top
and
ls -l
say the process virtual size / file size is 100 TB, however the resident memory via du
says the size is 71 MB after adding 10000 entries. You can see the size actually used with du.
As a pure java library, the same chronicle map java byte code can be run on Windows, Linux and Mac OSX. However these operating systems work with memory mapped files differently, these differences effect how chronicle is able to map memory to a file, and hence this can impact the total number of entries that you are able to configure.
- Windows allocates memory and disk eagerly, Windows will fail if more than 4 GB is allocated in a single memory mapping, ( calculated as 4GB = 2^20 * 4 KB pages). Windows doesn't fail when a memory mapped region is mapped, rather it will fail when it is used up. This limitation doesn't apply to newer or server based versions of Windows. Eager memory allocation means you can't map more than free memory, but it should reduce jitter when you use it. In the future we may support multiple mappings to avoid this limitation, but there is no immediate plan to do so.
- Linux allocates memory and disk lazily. Linux systems see a performance degradation at around 200% of main memory.
- Mac OSX allocates memory lazily and disk eagerly.
Chronicle Map allocates head room which is a waste on Windows (Linux's sparse allocation means the head room has little impact).
- on Windows we recommend you use map sizes of less than 4 GB each, and less than 50% main memory in total.
- on Linux we recommend you use small to large maps of less than double main memory. e.g. if you have a 128 GB server, we recommend you have less than 256 GB of maps on the server.
- on Mac OSX, we have no specific recommendations.
The Chronicle Map interface adds a few methods above an beyond the standard ConcurrentMap, the ChronicleMapBuilder can also be used to return the ChronicleMap, see the example below :
ChronicleMap<Integer, CharSequence> map =
ChronicleMapBuilder.of(Integer.class, CharSequence.class).create();
One way to achieve good performance is to focus on unnecessary object creation, as this reduces the amount of work that has to be carried out by the Garbage Collector. As such ChronicleMap supports the following methods :
V getUsing(K key, V value);
V acquireUsing(K key, V value);
- [
ReadContext<K, V> getUsingLocked(@NotNull K key, @NotNull V usingValue);
] (https://github.com/OpenHFT/Chronicle-Map#off-heap-storage-and-how-using-a-proxy-object-can-improve-performance) - [
WriteContext<K, V> acquireUsingLocked(@NotNull K key, @NotNull V usingValue);
] (https://github.com/OpenHFT/Chronicle-Map#acquireusinglocked)
These methods let you provide the object which the data will be written to, but the value use to be mutable. For example
CharSequence using = new StringBuilder();
CharSequence myResult = map.getUsing("key", using);
// at this point "using" and "myResult" will both point to the same object
The map.getUsing()
method is similar to map.get()
, but because Chronicle Map stores its data off
heap, if you were to call get("key"), a new object would be created each time, map.getUsing() works
by reusing the heap memory which was used by the original Object "using". This technique provides
you with better control over your object creation.
Exactly like map.getUsing()
, map.acquireUsing()
will give you back a reference to an value
based on
a key, but unlike map.getUsing()
if there is not an entry in the map for this key the entry
will be added and the value return will we the same value which you provided. ( The section below
covers both map.getUsing()
and map.acquireUsing()
in more detail )
Chronicle Map stores its data off heap. There are some distinct advantages in using off heap data storage
- When the off heap data is backed by a memory mapped file, the entire map and its contents are automatically persisted. So if you have to restart your system, you won’t lose the content of your map.
- Off heap data structures are not visited by the garbage collector, with on heap maps like HashMap the garbage collector has to scan your entire object graph ( in fact just the live objects ) to remove garbage, if you are able to keep your on heap footprint low, the garbage collector has a lot less work to do, this in turn improves your performance.
- Your Chronicle map and your data contained within it is serialised so it can be stored in off heap memory. When this memory is shared between processes on the same server, the chronicle map is able to distribute entries between processes with extremely low overhead, as both processes are sharing the same off heap memory space. Since your objects are already stored off heap ( as a series of bytes ), replicating entries over the network adds relatively low over head.
For more information on the benefit's of off heap memory, see our article on - [On heap vs off heap memory usage] (http://vanillajava.blogspot.co.uk/2014/12/on-heap-vs-off-heap-memory-usage.html)
One of the downsides of an Off Heap Map is that whenever you wish to get a value ( on heap ) from an entry which is off heap, for example calling :
Value v = get(key)
that entry has to be deserialised onto the java heap so that you can use its value just like any other java object. So if you were to call get(key) ten times,
for(int i=1;i<=10;i++) {
Value v = get(key)
}
this would create 10 separate instances of the value. As each time get() is called the map has to
first create a Object to store the result in and then deserialise the value stored in the off
heap entry. If you want to get the value back on heap so that you can use it like a normal java
value, There is nothing we can do about the deserialisation as this has to occur every time,
( since the value may have been changed by another thread ), but we don’t have to create the
object each time. This is why we create getUsing(key,using)
. By reducing the number of objects
you create, you reduce the amount of work that the garbage collector has to carry out,
this in turn may, improve your overall performance. So back to getUsing(key,using)
, If you wish to reuse
and existing object ( in this case the ‘using’ value ), you can instead call :
Value using = new Value();
for(int i=1;i<=10;i++) {
Value bond = map.getUsing(key,using); // this won’t create a new value each time.
assert using == bond; // this will always be the same instance and the Vaule type can not be immutable
}
We can get a further performance improvement if we don't deserialize the whole object, but only deserialize only the bytes that we are actually interested in,
Lets assume that we had the following interface :
public interface LongValue {
long getValue();
void setValue(long value);
}
It is possible to use chronicle as an off heap proxy that can go directly to the memory location off heap and just get back the long that you require. This is especially use full if the value object has a lot of fields in it like the BondVOInterface
public interface BondVOInterface {
...
long getIssueDate();
void setIssueDate(long issueDate); /* time in millis */
long getMaturityDate();
void setMaturityDate(long maturityDate); /* time in millis */
double getCoupon();
void setCoupon(double coupon);
String getSymbol();
void setSymbol(@MaxSize(20) String symbol);
...
}
If for example you want to just do the following
BondVOInterface bond = map.get(key);
bond.getCoupon()
lets say that it is only the coupon
field that we are interested in, then its better not to have to
deserialise the whole object that implements the BondVOInterface
. The ChronicleMapBuilder
will look a the types of keys
and values that you use, If the value type is a simple accessor/mutator interface that is exposing a non nested pojo, which uses simple types
like CharSequence
and primitives with corresponding get..() and
set..() methods, Chronicle is able to generate off heap poxies so the whole object is not desrialized each
time it is accessed, The off heap poxies are able to read
and write into
the off heap data structures directly, this reduced serialisation can give you a big performance boost.
Below we show you how you can work directly with the off heap entries.
ChronicleMap<CharSequence, BondVOInterface> map = ChronicleMapBuilder
.of(String.class, BondVOInterface.class)
.keySize(10)
.create();
notice that the
.of(CharSequence.class, BondVOInterface.class)
value class, in our case BondVOInterface.class
is an interface
rather than a class
, now
like before, we can
use the getUsing(key,using)
method, but this time we have to create the ‘using’ instance slightly
differently, we have to call either the map.newValueInstance()
or map.newKeyInstance()
method.
BondVOInterface using = map.newValueInstance();
the call to getUsing(key,value)
is the same as we had in the earlier example
BondVOInterface using = map.newValueInstance();
BondVOInterface bond = map.getUsing(key,using);
assert using == bond; // this will always be the same instance
getUsing(key,using)
won’t create any on heap objects, and it won’t deserialize the ‘bond’ entry from off heap to on
heap, all it does is sets the bond as a proxy to the off heap memory, this proxy object was created by
map.newValueInstance()
, it allows us access to the fields of our
entry, directly into the off heap storage.
As above, ideally you would reuse the using
variable.
BondVOInterface using = map.newValueInstance();
for(int i=1;i<=10;i++) {
Value bond = map.getUsing(key,using); // this won’t create a new value each time.
double coupon = bond.getCoupon()
assert using == bond; // this will always be the same instance
}
so when you call :
bond.getCoupon()
its only the coupon that gets deserialized.
Just like any other concurrent map, chronicle map uses segment locking, if you wish to obtain a read lock when calling getUsing(key,using) you can do this :
BondVOInterface bond = map.getUsing(key,value);
try (ReadContext<?, BondVOInterface> context = map.getUsingLocked(key,bond)) {
long issueDate = bond.getIssueDate();
String symbol = bond.getSymbol();
// add your logic here ( the lock will ensure this bond can not be changed by another thread )
} // the lock is released here because close() is called
To ensure that you can read the 'issueDate' and 'symbol' can be read atomically, these values must be read while the segment read lock is in place.
when you call map.getUsingLocked(key,using) we return a ReadContext, the ReadContext extends AutoCloseable so will automatically unlock the segment when the try block is exited.
If you wish not to use a try block you must manually release the segment lock by calling
context.close() // the lock will get released when this is called
Just like getUsing(), acquireUsing() will also recycle the value you pass it, the following code is a pattern that you will often come across,
V acquireUsing(key,value) {
Lock l = ...; // the segment lock of the map
l.lock();
try {
V v = map.getUsing(key,value)
if (v == null) {
map.put(key,value); // create a new entry
return value;
} else
return v; // return the value of the existing entry
} finally {
l.unlock();
}
}
acquireUsing(key,value)
offers this functionality from a single method call, an be cause it
implemented inside the chronicle map we are able to reduce the hash lookups and make it more
efficient.
If you are only accessing ChronicleMap from a single thread. If you are not doing replication
and don't care about atomic reads. Then its simpler ( and faster ) to use acquireUsing() otherwise we
recommend you use acquireUsingLocked(key,value)
Since the acquireUsing() method can end up creating an entry, the acquireUsingLocked(key,value) method must hold a segment write lock, this is unlike getUsing(key,using) which only holds a segment read lock.
BondVOInterface bond = ... // create your instance
try (WriteContext<?, BondVOInterface> context = map.acquireUsingLocked("one", bond)) {
assert bond == context.value();
long issueDate = bond.getIssueDate();
String symbol = bond.getSymbol();
if (condition) // based on your own business logic
context.removeEntry();
}
If after you have read the 'issueDate' and 'symbol' and you wish to remove the entry based on some business logic, its more efficient to use the 'context' to remove the entry, as the contents is already aware when the entry is in memory.
It is possible for the size of your entry to be 64 times as large as the maximum entry size, we refer to this type of entry as an oversized entry. Oversized entries are there to cater for the case where only a small percentage of your entries are larger than the others, in this case your large entry will span across a number of entries. The alternative would be to increase your maximum entry size to be similar to the size of the largest entry, but this approach is wasteful of memory, especially when most entries are no where near the max entry size.
Chronicle Map stores your data into off heap memory, so when you give it a Key or Value, it will serialise these objects into bytes.
If you are using simple auto boxed objects based on the primitive types, Chronicle Map will automatically handle the serialisation for you.
For anything other than the standard object, the Objects either have to :
- implement "java.io.Serializable" ( which we don't recommend as this can be slow )
- we also support "java.io.Externalizable", we recommend this over Serializable as its usually faster.
- or for the best performance implement net.openhft.lang.io.serialization.BytesMarshallable, an example of how to do this can be found at "IntValue$$Native"
- alternatively, you could write a "Custom Marshaller", the custom marshaller can be implemented for a single type or a number of types.
Chronicle Map supports importing and exporting all the entries into a JSON encoded file.
void getAll(File toFile) throws IOException;
void putAll(File fromFile) throws IOException;
Its only the entries of your map that are exported, not the configuration of your map. So care must be taken to populate the data in to a map of the correct Key/Value type and with enough available entries. When importing data :
- entries that are in the map but not in the JSON file will remain untouched.
- entries that are in the map and in the JSON file will be updated.
- entries that are not in the map but are in the JSON file will be added.
In other words importing data into a Chronicle Map works like map.putAll(<JSON entries>)
.
When Importing data if you are also writing to the map at the same time, the last update will win. In other words a write lock is not held for the entire import process. Importing and exporting the map, is ideal if you wish to:
- Bulk load data from one chronicle map into another.
- migrate data between versions of chronicle map.
WARNING : The current version only supports Chronicle Maps that contained serialized KEYS and
VALUES, future versions will support a binary encoding of objects that are net.openhft.lang.io .serialization.BytesMarshallable
.
Unlike ConcurrentHashMap, chronicle map stores its data off heap, often in a memory mapped file. Its recommended that you call close() once you have finished working with a Chronicle Map.
map.close()
This is especially important when working with chronicle map replication, as failure to call close may prevent you from restarting a replicated map on the same port. In the event that your application crashes it may not be possible to call close(). Your operating system will usually close dangling ports automatically, so although it is recommended that you close() when you have finished with the map, its not something that you must do, its just something that we recommend you should do.
If you call close too early before you have finished working with the map, this can cause your JVM to crash. Close MUST BE the last thing that you do with the map.
Chronicle Map supports both TCP and UDP replication
TCP/IP is a reliable protocol, what this means is unless you have a network failure or hardware outage the data is guaranteed to arrive. TCP/IP provides point to point connectivity. So in effect ( over simplified ), if the message was sent to 100 hosts, The message would have to be sent 100 times. With UDP, the message is only sent once. This is ideal if you have a large number of hosts and you wish to broadcast the same data to each off them. However, one of the big drawbacks with UDP is its not a reliable protocol. This means, if the UDP message is Broadcast onto the network, The hosts are not guaranteed to receive it, so they can miss data. Some solutions attempt to build resilience into UDP, but arguably, this is in effect reinventing TCP/IP.
In reality on a good quality wired LAN, when using UDP, you will rarely miss messages, this is a risk that we suggest you don't take. We suggest that whenever you use UDP replication you use it in conjunction with a throttled TCP replication, therefore if a host misses a message over UDP, they will later pick it up via TCP/IP.
We are careful not to swamp your network with too much TCP/IP traffic, We do this by providing a throttled version of TCP replication. This works because Chronicle Map only broadcasts the latest update of each entry.
Chronicle Map provides multi master hash map replication, What this means, is that each remote hash-map, mirrors its changes over to another remote hash map, neither hash map is considered the master store of data, each hash map uses timestamps to reconcile changes. We refer to in instance of a remote hash-map as a node. A node can be connected to up to 128 other nodes. The data that is stored locally in each node becomes eventually consistent. So changes made to one node, for example by calling put() will be replicated over to the other node. To achieve a high level of performance and throughput, the call to put() won’t block, With concurrentHashMap, It is typical to check the return code of some methods to obtain the old value for example remove(). Due to the loose coupling and lock free nature of this multi master implementation, this return value is only the old value on the nodes local data store. In other words the nodes are only concurrent locally. Its worth realising that another node performing exactly the same operation may return a different value. However reconciliation will ensure the maps themselves become eventually consistent.
If two ( or more nodes ) receive a change to their maps for the same key but different values, say by a user of the maps, calling the put(key,value). Then, initially each node will update its local store and each local store will hold a different value, but the aim of multi master replication is to provide eventual consistency across the nodes. So, with multi master when ever a node is changed it will notify the other nodes of its change. We will refer to this notification as an event. The event will hold a timestamp indicating the time the change occurred, it will also hold the state transition, in this case it was a put with a key and value. Eventual consistency is achieved by looking at the timestamp from the remote node, if for a given key, the remote nodes timestamp is newer than the local nodes timestamp, then the event from the remote node will be applied to the local node, otherwise the event will be ignored. Since none of the nodes is a primary, each node holds information about the other nodes. For this node its own identifier is referred to as its 'localIdentifier', the identifiers of other nodes are the 'remoteIdentifiers'. On an update or insert of a key/value, this node pushes the information of the change to the remote nodes. The nodes use non-blocking java NIO I/O and all replication is done on a single thread. However there is an edge case, If two nodes update their map at the same time with different values, we had to deterministically resolve which update wins, because of eventual consistency both nodes should end up locally holding the same data. Although it is rare two remote nodes could receive an update to their maps at exactly the same time for the same key, we had to handle this edge case, its therefore important not to rely on timestamps alone to reconcile the updates. Typically the update with the newest timestamp should win, but in this example both timestamps are the same, and the decision made to one node should be identical to the decision made to the other. This dilemma is resolved by using a node identifier, the node identifier is a unique 'byte' value that is assigned to each node, So when the time stamps are the same if the remoteNodes identifier is smaller than the local nodes identifier, this update will be accepted otherwise it will be ignored.
On a server if you have a number of java processes and then within each java process you create a instance of the same Chronicle map which binds to the same underline 'file', they exchange data via shared memory rather than TCP or UDP replication. So if a Chronicle Map which is not performing TCP Replication is updated, this update can be picked up by another Chronicle Map, this other Chronicle Hash Map could be a TCP replicated Chronicle Map, In this example the TCP replicated Chronicle Map would then push the update to the remote nodes.
Likewise, If the TCP replicated Chronicle Map was to received an update from a remote node, then
this update would be immediately available to all the Chronicle Maps on the server.
If all you are doing is replicating your chronicle maps on the same server you don't have to set up TCP and UDP replication. You also don't have to set the identifiers.
If however you wish to replicate data between 2 or more servers, then ALL of the Chronicle Maps including those not actively participating in TCP or UDP replication must have the identifier set. The identifier must be unique to each server. Each Chronicle Map on the same server must have the same identifier. The reason that all Chronicle Maps must have the identifier set, is because the memory is laid out slightly differently when using replication, so even if a Map is not actively performing TCP or UDP replication its self, if it wishes to replicate with one that is, it must have its memory laid out the same way to be compatible.
If the identifiers are not set up uniquely then the updates will be ignored, as for example a Chronicle Map set up with the identifiers equals '1', will ignore all events which contain the remote identifier of '1', in other words Chronicle Map replication is set up to ignore updates which have originated from itself. This is to avoid the circularity of events.
When setting up the identifier you can use values from 1 to 127. ( see the section above for more information on identifiers and how they are used in replication. )
The identifier is setup on the builder as follows.
TcpTransportAndNetworkConfig tcpConfig = ...
map = ChronicleMapBuilder
.of(Integer.class, CharSequence.class)
.replication(identifier, tcpConfig)
.create();
When a node is connected over the network to an active grid of nodes. It must first receive any data that it does not have from the other nodes. Eventually, all the nodes in the grid have to hold a copy of exactly the same data. We refer to this initial data load phase as bootstrapping. Bootstrapping by its very nature is point to point, so it is only performed over TCP replication. For architectures that wish to use UDP replication it is advised you use TCP Replication as well. A grid which only uses UDP replication will miss out on the bootstrapping, possibly leaving the nodes in an inconsistent state. To avoid this, if you would rather reduce the amount of TCP traffic on your network, we suggest you consider using a throttle TCP replication along with UDP replication. Bootstrapping is not used when the nodes are on the same server, so for this case, TCP replication is not required.
Each map is allocated a unique identifier
Server 1 has:
.replication((byte) 1, tcpConfigServer1)
Server 2 has:
.replication((byte) 2, tcpConfigServer2)
Server 3 has:
.replication((byte) 3, tcpConfigServer3)
If you fail to allocate a unique identifier replication will not work correctly.
Each map must be allocated a unique port, the port has to be unique per server, if the maps are running on different hosts they could be allocated the same port, but in our example we allocated them different ports, we allocated map1 port 8076 and map2 port 8077. Currently we don't support data forwarding, so it important to connect every remote map, to every other remote map, in other words you can't have a hub configuration where all the data passes through a single map which every other map is connected to. So currently, if you had 4 servers each with a Chronicle Map, you would require 6 connections.
In our case we are only using 2 maps, this is how we connected map1 to map 2.
TcpTransportAndNetworkConfig.of(8076, new InetSocketAddress("localhost", 8077))
.heartBeatInterval(1, SECONDS);
you could have put this instruction on map2 instead, like this
TcpTransportAndNetworkConfig.of(8077, new InetSocketAddress("localhost", 8076))
.heartBeatInterval(1, SECONDS);
even though data flows from map1 to map2 and map2 to map1 it doesn't matter which way you connected this, in other words its a bidirectional connection.
Below is example how to set up tcpConfig for 3 host
String hostServer1 = "localhost"; // change this to your host
int serverPort1 = 8076; // change this to your port
InetSocketAddress inetSocketAddress1 = new InetSocketAddress(hostServer1, serverPort1);
String hostServer2 = "localhost"; // change this to your host
int serverPort2= 8077; // change this to your port
InetSocketAddress inetSocketAddress2 = new InetSocketAddress(hostServer2, serverPort2);
String hostServer3 = "localhost"; // change this to your host
int serverPort3 = 8078; // change this to your port
InetSocketAddress inetSocketAddress3 = new InetSocketAddress(hostServer3, serverPort3);
// this is to go on server 1
TcpTransportAndNetworkConfig tcpConfigServer1 =
TcpTransportAndNetworkConfig.of(serverPort1);
// this is to go on server 2
TcpTransportAndNetworkConfig tcpConfigServer2 = TcpTransportAndNetworkConfig
.of(serverPort2, inetSocketAddress1);
// this is to go on server 3
TcpTransportAndNetworkConfig tcpConfigServer3 = TcpTransportAndNetworkConfig
.of(serverPort3, inetSocketAddress1, inetSocketAddress2);
We set a heartBeatInterval, in our example to 1 second
heartBeatInterval(1, SECONDS)
A heartbeat will only be send if no data is transmitted, if the maps are constantly exchanging data no heartbeat message is sent. If a map does not receive either data of a heartbeat the connection is dropped and re-established.
Chronicle Map TCP Replication lets you distribute a single Chronicle Map, to a number of servers
across your network. Replication is point to point and the data transfer is bidirectional, so in the
example of just two servers, they only have to be connected via a single tcp socket connection and
the data is transferred both ways. Which is great, however what if you wanted to replicate more than
just one chronicle map, what if you were going to replicate two chronicle maps across your network,
unfortunately with just TCP replication you would have to have two tcp socket connections, which is
not ideal. This is why we created the ReplicationHub
. The ReplicationHub
lets you replicate numerous
Chronicle Maps via a single point to point socket connection.
The ReplicationHub
encompasses TCP replication, where each map has to be given a
unique identifier, but when using the ReplicationHub
we use a channel to identify the map,
rather than the identifier, As the identifier is used to identify the host/server which broadcasts the
update. Put simply:
- Each host mustbe given a unique identifier.
- Each map must be given a unique channel.
int maxEntrySize = 1024;
byte identifier= 2;
ReplicationHub replicationHub = ReplicationHub.builder()
.maxEntrySize(maxEntrySize)
.tcpTransportAndNetwork(tcpConfig)
.createWithId(identifier);
In this example above the ReplicationHub
is given the identifier of 2.
In addition to specifying the identifier we also have to set the maximum entry size, this sets
the size of the memory buffers within the ReplicationChannel
. This has to be set manually, with channels you
are able to attach additional maps to a ReplicationChannel
once its up and running, so the maximum size of each
entry in the map can not be known in advance and we don’t currently support automatic resizing
of buffers.
When creating the ReplicationChannel
you should attach your tcp or udp configuration
int maxEntrySize = 1024;
byte identifier = 1;
ReplicationHub replicationHub = ReplicationHub.builder()
.maxEntrySize(maxEntrySize)
.tcpTransportAndNetwork(tcpConfig)
.createWithId(identifier);
Attaching a ReplicationChannel
to the replication to the map:
short channel = (short) 2;
ChronicleMap<Integer, CharSequence> map = ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.entries(1000)
.instance()
.replicatedViaChannel(replicationHub.createChannel(channel))
.create();
The chronicle channel is use to identify which map is to be replicated to which other map on the remote node, in the example above this is assigned to '(short) 1', so for example if you have two maps, lets call them map1 and map2, you could assign them with chronicle channels 1 and 2 respectively. Map1 would have the chronicle channel of 1 on both servers. You should not confuse the Chronicle Channels with the identifiers, the identifiers are unique per replicating node ( in this case which host, the reason we say replicating node rather than host as it is possible to have more than one replicating node per host if each of them had a different TCP/IP port ) ), where as the chronicle channels are used to identify which map you are referring. No additional socket connection is made per chronicle channel that you use, so we allow up to 32767 chronicle channels.
If you inadvertently got the chronicle channels around the wrong way, then chronicle would attempt to replicate the wrong maps data. The chronicle channels don't have to be in order but they must be unique for each map you have.
// server 1 with identifier = 1
{
ChronicleMapBuilder<CharSequence, CharSequence> smallStringToStringMapBuilder =
ChronicleMapBuilder.of(CharSequence.class, CharSequence.class)
.entries(1000);
byte identifier = (byte) 1;
TcpTransportAndNetworkConfig tcpConfig = TcpTransportAndNetworkConfig
.of(8086, new InetSocketAddress("localhost", 8087))
.heartBeatInterval(1, SECONDS);
hubOnServer1 = ReplicationHub.builder()
.tcpTransportAndNetwork(tcpConfig)
.createWithId(identifier);
// this demotes favoriteColour
short channel1 = (short) 1;
favoriteColourServer1 = smallStringToStringMapBuilder.instance()
.replicatedViaChannel(hubOnServer1.createChannel(channel1)).create();
favoriteColourServer1.put("peter", "green");
// this demotes favoriteComputer
short channel2 = (short) 2;
favoriteComputerServer1 = smallStringToStringMapBuilder.instance()
.replicatedViaChannel(hubOnServer1.createChannel(channel2)).create();
favoriteComputerServer1.put("peter", "dell");
}
// server 2 with identifier = 2
{
ChronicleMapBuilder<CharSequence, CharSequence> smallStringToStringMapBuilder =
ChronicleMapBuilder.of(CharSequence.class, CharSequence.class)
.entries(1000);
byte identifier = (byte) 2;
TcpTransportAndNetworkConfig tcpConfig = TcpTransportAndNetworkConfig
.of(8087).heartBeatInterval(1, SECONDS);
hubOnServer2 = ReplicationHub.builder()
.tcpTransportAndNetwork(tcpConfig)
.createWithId(identifier);
// this demotes favoriteColour
short channel1 = (short) 1;
favoriteColourServer2 = smallStringToStringMapBuilder.instance()
.replicatedViaChannel(hubOnServer2.createChannel(channel1)).create();
favoriteColourServer2.put("rob", "blue");
// this demotes favoriteComputer
short channel2 = (short) 2;
favoriteComputerServer2 = smallStringToStringMapBuilder.instance()
.replicatedViaChannel(hubOnServer2.createChannel(channel2)).create();
favoriteComputerServer2.put("rob", "mac");
favoriteComputerServer2.put("daniel", "mac");
}
// allow time for the recompilation to resolve
for (int t = 0; t < 2500; t++) {
if (favoriteComputerServer2.equals(favoriteComputerServer1) &&
favoriteColourServer2.equals(favoriteColourServer1))
break;
Thread.sleep(1);
}
assertEquals(favoriteComputerServer1, favoriteComputerServer2);
Assert.assertEquals(3, favoriteComputerServer2.size());
assertEquals(favoriteColourServer1, favoriteColourServer2);
Assert.assertEquals(2, favoriteColourServer1.size());
favoriteColourServer1.close();
favoriteComputerServer2.close();
favoriteColourServer2.close();
favoriteColourServer1.close();
}
A stateless client is an instance of a ChronicleMap
or a ChronicleSet
that does not hold any
data
locally, all the Map or Set operations are delegated via a Remote Procedure Calls ( RPC ) to
another ChronicleMap
or ChronicleSet
which we will refer to as the server. The server
holds all your data, the server can not it’s self be a stateless client. Your stateless client must
be connected to the server via TCP/IP.
The stateless client delegates all your method calls to
the remote server. The stateless client operations will block, in other words the stateless
client waits for the server to send a response before continuing to the next operation. The stateless
client could be consider to be a ClientProxy to ChronicleMap
or ChronicleSet
running
on another host.
Below is an example of how to configure a stateless client.
final ChronicleMap<Integer, CharSequence> serverMap;
final ChronicleMap<Integer, CharSequence> statelessMap;
// server
{
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.replication((byte) 2, TcpTransportAndNetworkConfig.of(8076))
.create();
serverMap.put(10, "EXAMPLE-10");
}
// stateless client
{
statelessMap = ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.statelessClient(new InetSocketAddress("localhost", 8076))
.create();
Assert.assertEquals("EXAMPLE-10", statelessMap.get(10));
Assert.assertEquals(1, statelessMap.size());
}
serverMap.close();
statelessMap.close();
When used with a stateless client, Each state-full server has to be configured with TCP replication, when you set up TCP Replication you must define a port for the replication to run on, the port you choose is up to you, but you should pick a free port that is not currently being used by another application. In this example we choose the port 8076
// sets the server to run on localhost:8076
.replication((byte) 2, TcpTransportAndNetworkConfig.of(8076))
On the "stateless client" we connect to the server via TCP/IP on localhost:8076 :
.statelessClient(new InetSocketAddress("localhost", 8076))
but in your example you should choose the host of the state-full server and the port you allocated it.
.statelessClient(new InetSocketAddress(<host of state-full server>, <port of state-full server>))
the ".statelessClient(..)" returns an instance of StatelessClientConfig
, which has only a few
of its own configurations, such as the create()
method, which can be used to create a new
stateless client.
If you don’t add this line, a normal state-full ChronicleMap
will be created. For this example
we ran both
the client an the server on the same host ( hence the “localhost" setting ),
but in a real life example the stateless client will typically be on a different server than the
state-full host. If you are aiming to create a stateless client and server on the same host, its
better not to do this, as the stateless client connects to the server via TCP/IP. It better to
share the maps via memory as this will give you better performance ( read more about
this at [Sharing Data Between Two or More Maps](https://github
.com/OpenHFT/Chronicle-Map#sharing-data-between-two-or-more-maps).
click here
When calling the stateless client, you will get better throughput if you invoke your requests from a
number of threads, this is because by default when you make a method call to a ChronicleMap
stateless client, your method call is wrapped into an event which is sent over tcp and processed
by the server,
your stateless client will block until an acknowledgement has been received from the server that
your event was processed.
When you are calling methods that return a value like get() this
blocking adds no additional overhead, because you have to wait for the return value anyway,
In some cases you could get better performance if you don't have to wait for the acknowledgement, This maybe the case when you are calling the put()
method, but the problem with this method is it returns the old value even though you may not use it.
For in memory data-structures like a HashMap this isn’t a big problem. But in a distributed environment things are a bit more complicated. Blocking for an old value that you don’t require adds additional network overhead and additional processing overhead of serialising the old value on the server and deserialising it on the client
So if you don’t require the old value and don’t wish to block until your put()
has been received by the server, then you may wish to consider using the following configuration :
.putReturnsNull(true)
and also for the remove()
method
.removeReturnsNull(true)
statelessClientMap = ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.putReturnsNull(true)
.removeReturnsNull(true)
.statelessClient(new InetSocketAddress("localhost", 8076))
.create();
For the very best performance you should also set these properties on the server as well
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.replication((byte) 2, TcpTransportAndNetworkConfig.of(8076))
.putReturnsNull(true)
.removeReturnsNull(true)
.create();
The throughput and latency performance for different configurations.
Tested using a test called BGChronicleTest.
On one machine (i7 3.5 GHz) we have two persisted replicas run as
-Dreplicas=2 eg.BGChronicleTest server
On another machine (Dual Xeon 8 core 2.6 GHz) connected via a pair of Solarflare SFN5121T 10 Gig-E with onload enabled.
-Dreplicas=2 -Dclients={see below} -DmaxRate=30000 -DreadRatio=2 eg.BGChronicleTest client
2 clients Throughput test messages per seconds: 58,864
Latency test at 30,000 msg/sec 50% / 90% / 99% // 99.9% / 99.99% / worst latency was 33 / 80 / 111 // 120 / 148 / 3,921 us
4 clients Throughput test messages per seconds: 94,006
Latency test at 30,000 msg/sec 50% / 90% / 99% // 99.9% / 99.99% / worst latency was 32 / 94 / 106 // 131 / 177 / 2,153 us
8 clients Throughput test messages per seconds: 162,961
Latency test at 30,000 msg/sec 50% / 90% / 99% // 99.9% / 99.99% / worst latency was 35 / 94 / 117 // 140 / 167 / 1,685 us
16 clients Throughput test messages per seconds: 267,097
Latency test at 30,000 msg/sec 50% / 90% / 99% // 99.9% / 99.99% / worst latency was 38 / 97 / 122 // 149 / 174 / 2,771 us
24 clients Throughput test messages per seconds: 253,052
Latency test at 30,000 msg/sec 50% / 90% / 99% // 99.9% / 99.99% / worst latency was 40 / 99 / 121 // 151 / 243 / 3,669 us
its always important to close ChronicleMap
's and ChronicleSet
's when you have finished with them
serverMap.close();
statelessMap.close();
Chronicle map lets you assign a map larger than your available memory, If you were to create more entries than the available memory, chronicle map will page the segments that are accessed least to disk, and load the recently used segments into available memory. This feature lets you work with extremely large maps, it works brilliantly on Linux but unfortunately, this paging feature is not supported on Windows, if you use more memory than is physically available on windows you will experience the following error :
Java frames: (J=compiled Java code, j=interpreted, Vv=VM code)
j sun.misc.Unsafe.compareAndSwapLong(Ljava/lang/Object;JJJ)Z+0
j net.openhft.lang.io.NativeBytes.compareAndSwapLong(JJJ)Z+13
j net.openhft.lang.io.AbstractBytes.tryLockNanos8a(JJ)Z+12
j net.openhft.lang.io.AbstractBytes.tryLockNanosLong(JJ)Z+41
j net.openhft.collections.AbstractVanillaSharedHashMap$Segment.lock()V+12
It will throw this exception :
Caught: java.lang.IllegalStateException: VanillaShortShortMultiMap is full
java.lang.IllegalStateException: VanillaShortShortMultiMap is full
at net.openhft.collections.VanillaShortShortMultiMap.nextPos(VanillaShortShortMultiMap.java:226)
at net.openhft.collections.AbstractVanillaSharedHashMap$Segment.put(VanillaSharedHashMap.java:834)
at net.openhft.collections.AbstractVanillaSharedHashMap.put0(VanillaSharedHashMap.java:348)
at net.openhft.collections.AbstractVanillaSharedHashMap.put(VanillaSharedHashMap.java:330)
Chronicle Map doesn't resize automatically. It is assumed you will make the virtual size of the map larger than you need and it will handle this reasonably efficiently. With the default settings you will run out of space between 1 and 2 million entries.
You should set the .entries(..) and .entrySize(..) to those you require.
If you put() and entry that is much larger than the max entry size set via entrySize(), the code will error. To see how to set the entry size the example below sets the entry size to 10, you should pick a size that is the size in bytes of your entries :
ChronicleMap<Integer, String> map =
ChronicleMapBuilder.of(Integer.class, String.class)
.entrySize(10).create();
This example will throw an java.lang.IllegalArgumentException because the entrySize is too small.
@Test
public void test() throws IOException, InterruptedException {
ChronicleMap<Integer, String> map =
ChronicleMapBuilder.of(Integer.class, String.class)
.entrySize(10).create();
String value = new String(new char[2000]);
map.put(1, value);
Assert.assertEquals(value, map.get(1));
}
If the entry size is dramatically too small ( like in the example below ), you will get a malloc_error_break :
@Test
public void test() throws IOException, InterruptedException {
ChronicleMap<Integer, String> map =
ChronicleMapBuilder.of(Integer.class, String.class)
.entrySize(10).create();
String value = new String(new char[20000000]);
map.put(1, value);
Assert.assertEquals(value, map.get(1));
}
This simple chronicle map, works just like ConcurrentHashMap but stores its data off-heap. If you want to use Chronicle Map to share data between java process you should look at the next exampl
Map<Integer, CharSequence> map = ChronicleMapBuilder.of(Integer.class,
CharSequence.class).create();
map.put(1, "hello world");
System.out.println(map.get(1));
Lets assume that we had two server, lets call them server1 and server2, if we wished to share a map between them, this is how we could set it up
// --- RUN ON ONE JAVA PROCESS ( BUT ON THE SAME SERVER )
{
File file = new File("a-new-file-on-your-sever");
Map<Integer, CharSequence> map1 = ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.createPersistedTo(file); // this has to be the same file as used by map 2
map1.put(1, "hello world");
}
// --- RUN ON THE OTHER JAVA PROCESS ( BUT ON THE SAME SERVER )
{
File file = new File("a-new-file-on-your-sever"); // this has to be the same file as used by map 1
Map<Integer, CharSequence> map1 = ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.createPersistedTo(file);
System.out.println(map1.get(1));
}
Lets assume that we had two server, lets call them server1 and server2, if we wished to share a map between them, this is how we could set it up
import org.junit.Assert;
import org.junit.Test;
import java.io.IOException;
import java.net.InetSocketAddress;
import java.util.Map;
import java.util.concurrent.TimeUnit;
public class YourClass {
@Test
public void test() throws IOException, InterruptedException {
Map map1;
Map map2;
// ---------- SERVER1 1 ----------
{
// we connect the maps via a TCP/IP socket connection on port 8077
TcpTransportAndNetworkConfig tcpConfig = TcpTransportAndNetworkConfig
.of(8076, new InetSocketAddress("localhost", 8077))
.heartBeatInterval(1L, TimeUnit.SECONDS);
ChronicleMapBuilder<Integer, CharSequence> map1Builder =
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.entries(20000L)
.replication((byte) 1, tcpConfig);
map1 = map1Builder.create();
}
// ---------- SERVER2 on the same server as ----------
{
TcpTransportAndNetworkConfig tcpConfig =
TcpTransportAndNetworkConfig.of(8077)
.heartBeatInterval(1L, TimeUnit.SECONDS);
ChronicleMapBuilder<Integer, CharSequence> map2Builder =
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.entries(20000L)
.replication((byte) 2, tcpConfig);
map2 = map2Builder.create();
// we will stores some data into one map here
map2.put(5, "EXAMPLE");
}
// ---------- CHECK ----------
// we are now going to check that the two maps contain the same data
// allow time for the recompilation to resolve
int t = 0;
for (; t < 5000; t++) {
if (map1.equals(map2))
break;
Thread.sleep(1);
}
Assert.assertEquals(map1, map2);
Assert.assertTrue(!map1.isEmpty());
}
}
This example is the same as the one above, but it uses a slow throttled TCP/IP connection to fill in updates that may have been missed when sent over UDP. Usually on a good network, for example a wired LAN, UDP won’t miss updates. But UDP does not support guaranteed delivery, we recommend also running a TCP connection along side to ensure the data becomes eventually consistent. Note : It is possible to use Chronicle without the TCP replication and just use UDP ( that’s if you like living dangerously ! )
import org.junit.Assert;
import org.junit.Test;
import java.io.IOException;
import java.net.Inet4Address;
import java.net.InetSocketAddress;
import java.util.Map;
import java.util.concurrent.TimeUnit;
public class YourClass {
@Test
public void test() throws IOException, InterruptedException {
Map map1;
Map map2;
int udpPort = 1234;
// ---------- SERVER1 1 ----------
{
// we connect the maps via a TCP socket connection on port 8077
TcpTransportAndNetworkConfig tcpConfig = TcpTransportAndNetworkConfig
.of(8076, new InetSocketAddress("localhost", 8077))
.heartBeatInterval(1L, TimeUnit.SECONDS)
// a maximum of 1024 bits per millisecond
.throttlingConfig(ThrottlingConfig.throttle(1024, TimeUnit.MILLISECONDS));
UdpTransportConfig udpConfig = UdpTransportConfig
.of(Inet4Address.getByName("255.255.255.255"), udpPort);
ChronicleMapBuilder<Integer, CharSequence> map1Builder =
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.entries(20000L)
.replication(SingleChronicleHashReplication.builder()
.tcpTransportAndNetwork(tcpConfig)
.udpTransport(udpConfig)
.createWithId((byte) 1));
map1 = map1Builder.create();
}
// ---------- SERVER2 2 on the same server as ----------
{
TcpTransportAndNetworkConfig tcpConfig =
TcpTransportAndNetworkConfig.of(8077)
.heartBeatInterval(1L, TimeUnit.SECONDS)
.throttlingConfig(ThrottlingConfig.throttle(1024, TimeUnit.MILLISECONDS));
UdpTransportConfig udpConfig = UdpTransportConfig
.of(Inet4Address.getByName("255.255.255.255"), udpPort);
ChronicleMapBuilder<Integer, CharSequence> map2Builder =
ChronicleMapBuilder.of(Integer.class, CharSequence.class)
.entries(20000L)
.replication(SingleChronicleHashReplication.builder()
.tcpTransportAndNetwork(tcpConfig)
.udpTransport(udpConfig)
.createWithId((byte) 2));
map2 = map2Builder.create();
// we will stores some data into one map here
map2.put(5, "EXAMPLE");
}
// ---------- CHECK ----------
// we are now going to check that the two maps contain the same data
// allow time for the recompilation to resolve
int t = 0;
for (; t < 5000; t++) {
if (map1.equals(map2))
break;
Thread.sleep(1);
}
Assert.assertEquals(map1, map2);
Assert.assertTrue(!map1.isEmpty());
}
}
This project also provides the Chronicle Set, ChronicleSet
is built on Chronicle Map, so the builder
configuration are almost identical to ChronicleMap
( see above ), this example shows how to create
a simple off heap set
Set<Integer> set = ChronicleSetBuilder.of(Integer.class).create();
set.add(1);
set.remove(1)
and just like map it support shared memory and TCP replication.
There are general principles we can give direction on - for specific advise we believe consulting to be the most productive solution.
We want the Map to be reasonably general purpose, so in broad terms we can say
- the key and values have to be self contained, ideally trees, rather than graphs.
- ideally values are similar lengths, however we support varying lengths.
- ideally you want to use primitives and use object recycling for performance, though this is not a requirement.
- ideally you have some idea as to the maximum number of entries, though it is not too important if the maximum entries is above what you need.
- if for example you are working with, market depth, this can be supported via an array of nested types.
- we support code generation of efficient custom serializes - See the examples where you provide an interface as the data type, the map will generate the implementation.
Are you finding that when using Chronicle Map the serialisation is slow, but you have a large object graph, if so - here are some steps that you can take to improve performance ( you may find that just one of these steps gives you the performance you require )
- consider replacing java.io.Serializable with java.io.Externalizable ( java.io.Externalizable can be much faster, but requires you do the the serialisation your self)
- consider using net.openhft.lang.io.serialization.BytesMarshallable, working with this is like using java .io.Externalizable but in addition is supports some compressed types, which can give you slightly better performance.
- try where possible to de-normalize your object graph, if you are able to limit your graph down to just a few types you can then look at using our off heap interface proxy objects and have a map instance per type of object ( much like in a database where you would have a table per type ), you could then set up foreign key relationships between the maps, by using for example integers or longs. This works especially well as integers and longs have a good hash distribution.
Generally speaking ChronicleMap
is slower then ConcurrentHashMap for a small number of entries, but
for a large number of entries ConcurrentHashMap doesn't scale as well as Chronicle Map, especially
when you start running low on heap. ConcurrentHashMap quickly becomes unusable whereas Chronicle Map
can still work when it is 20 times the size of a ConcurrentHashMap with an Out of Memory Error.
For example with a heap of 3/4 of say 32 GB main memory, you might get say 100 million entries but
when using most of the heap you might see 20-40 second gc pauses with ChronicleMap
you could have
1000 million entries and see < 100 ms pauses (depending on your disk subsystem and how fast you
write your data)
Chronicle Map makes heavy use of the OS to perform the memory management and writing to disk. How it behaves is very dependant on how you tune the kernel and what hardware you are using. You may get bad behaviour when the kernel forces a very large amount of data to disk after letting a lot of uncommited data build up. In the worst case scenario the OS will stop the process for tens of seconds at a time ( even up to 40 seconds) rather than let the program continue. However, to get into that state you have to be loading a lot of data which exceeds main memory with very little rest (e.g. cpu processing). There are good use cases for bulk data loads, but you have to be careful how this is done if you also want good worst case latency characteristics. (the throughput should be much the same)
When you create a Chronicle Map, it has many segments. By default it has a minimum of 128, but one for every 32 K entries. e.g. for 500M entries you can expect ~16K segments (being the next power of 2). With so many segments, the chances of a perfect hash distribution is low and so the Chronicle Map allows for double what you asked for but is designed to do this with almost no extra main memory (only extra virtual memory). This means when you ask for 500M * 256 bytes entries you actually get 1 BN possible entries (assuming a perfect hash distribution between segments) There is a small overhead per entry of 16 - 24 bytes adding another 20 GB.
So while the virtual memory is 270 GB, it is expected that for 500 M entries you will be trying to use no more than 20 GB (overhead/hash tables) + ~120 GB (entries)
When ChronicleMap
has exhausted all the memory on your server, its not going to be so fast, for a
random access pattern you are entirely dependant on how fast your underlying disk is. If your home
directory is an HDD and its performance is around 125 IOPS (I/Os per second). Each lookup takes two
memory accesses so you might get around 65 lookups per second. For 100-200K operations you can
expect around 1600 seconds or 25-50 minutes. If you use an SSD, it can get around 230 K IOPS, or
about 115 K ChronicleMap
lookups per second.
If you see the following warning :
WARNING:net.openhft.lang.io.AbstractBytes tryLockNanosLong0
WARNING: Thread-2, to obtain a lock took 0.129 seconds
It's likely you have lock contention, this can be due to :
- a low number of segments and
- the machine was heavily over utilised, possibly with the working data set larger than main memory.
- you have a large number of threads, greater than the number of cores you have, doing nothing but hit one map.
It’s not possible to fully disable locking, locking is done a a segment basis. So, If you set a large number of actual segments, this will reduce your lock contention.
See the example below to see how to set the number of segments :
ChronicleMap<Long, String> map = ChronicleMapBuilder.of(Long.class, String.class)
.entries(100)
.actualSegments(100) // set your number of segments here
.create();
Reducing lock contention will make this warning message go away, but this message maybe more of a symptom of a general problem with what the system is doing, so you may experience a delay anyway.
If you put() a small number of large entries into Chronicle Map, you are unlikely to see any performance gains over a standard map, So we recommend you use a standard ConcurrentHashMap, unless you need Chronicle Maps other features.
Chronicle Map gives better performance for smaller keys and values due to the low overhead per entry. It can use 1/5th the memory of ConcurrentHashMap. When you have larger entries, the overhead per entry doesn't matter so much and the relative waste per entry starts to matter. For Example, Chronicle Map assumes every entry is the same size and if you have 10kB-20kB entries the 10K entries can be using 20 kB of virtual memory or at least 12 KB of actual memory (since virtual memory turns into physical memory in multiples of a page)
As the ChronicleMap
gets larger the most important factor is the use of CPU cache rather than main
memory, performance is constrained by the number of cache lines you have to touch to update/read an
entry. For large entries this is much the same as ConcurrentHashMap. In this case, ChronicleMap
is
not worse than ConcurrentHashMap but not much better.
For large key/values it is not total memory use but other factors which matter such as;
- how compact each entry is. Less memory used makes better use of the L3 cache and memory bus which is often a bottleneck in highly concurrent applications.
- reduce the impact on GCs. The time to perform GC and its impact is linear. Moving the bulk of your data off heap can dramatically improve throughput not to mention worst case latency.
- Large data structures take a long time to reload and having a persisted store significantly reduces restart times.
- data can be shared between processes. This gives you more design options to share between JVMS and support short lived tasks without having to use TCP.
- data can be replicated across machines.
ConcurrentHashMap ( CHM ) outperforms ChronicleMap
( CM ) on throughput. If you don't need
the extra features SharedHashMap gives you, it is not worth the extra complexity it brings.
i.e. don't use it just because you think it is cool. The test can be found in
ChronicleMapTest
under testAcquirePerf() and testCHMAcquirePerf()
Chronicle Map out performs ConcurrentHashMap on memory consumption, and worst case latencies. It can be used to reduce or eliminate GCs.
The following performance test consists of string keys of the form "u:0123456789" and an int counter. The update increments the counter once in each thread, creating an new entry if required.
Number of entries | Chronicle* Throughput | Chronicle RSS | HashMap* Throughput | HashMap Worst GC pause | HashMap RSS |
---|---|---|---|---|---|
10 million | 30 Mupd/s | ½ GB | 155 Mupd/s | 2.5 secs | 9 GB |
50 million | 31 Mupd/s | 3⅓ GB | 120 Mupd/s | 6.9 secs | 28 GB |
250 million | 30 Mupd/s | 14 GB | 114 Mupd/s | 17.3 secs | 76 GB |
1000 million | 24 Mupd/s | 57 GB | OOME | 43 secs | NA |
2500 million | 23 Mupd/s | 126 GB | Did not test | NA | NA |
*HashMap refers to ConcurrentHashMap, Chronicle refers to Chronicle Map
Key : RSS - Resident memory size. How much main memory was used. Mupd/s - Million write operations per second. i.e. put(key, value);
Notes:
ChronicleMap
was tested with a 32 MB heap, CHM was test with a 100 GB heap.- The
ChronicleMap
test had a small minor GC on startup of 0.5 ms, but not during the test. This is being investigated. ChronicleMap
was tested "writing" to a tmpfs file system.
Chronicle Map also supports persistence. In this regard there is no similar class in the JDK.
Number of entries | Chronicle Throughput | Chronicle RSS |
---|---|---|
10 million | 28 Mupd/s | ½ GB |
50 million | 28 Mupd/s | 9 GB |
250 million | 26 Mupd/s | 24 GB |
1000 million | 1.3 Mupd/s | 85 GB |
Notes:
- Persistence was performed at a PCI-SSD which supports up to 230K IOPS and 900 MB/s write speed. This test didn't test the card to it's limit until the last test.
- The kernel tuning parameters for write back are important here. This explains the suddern drop off and this is being investigated.
The sysctl parameters used were approximately 10x the defaults to allow as many operations to be performed in memory as possible.
vm.dirty_background_ratio = 50
vm.dirty_expire_centisecs = 30000
vm.dirty_ratio = 90
vm.dirty_writeback_centisecs = 5000
I'm searching for a Map implementation that is backed by either direct off-heap or a memory mapped file. I want to use it as write-once, read-n-times kind of cache.
The latest version of chronicle map has been optimised for this use case. Ie for heavy read to write ratios. Note: heavy writers will see about the same performance.
The backing of to off-heap is only needed to prevent OOM-situations. Otherwise the process can use a big part of the available memory for the map.
Chronicle Map can be larger than main memory. In fact you might want to reduce the heap size to give chronicle map more memory in extreme cases. Ie because the map can't utilise on heap memory much.
I stumbled over chronicle-map and it looks like it could do most of those things. Correct me if i'm wrong but
// would create the off-heap version
ChronicleMapBuilder.of(A.class, B.class).create();
Yes.
// the memory-mapped file version ?
ChronicleMapBuilder.of(A.class, B.class).file(mapFile).create();
Yes.
But both versions write the complete map at every single moment to off-heap, right ?
Every key/value is off heap. There is no cache of objects on heap.
Is there something in chronicle map which can have part of the map-content still in on-heap memory.
There are data structures which are on heap to manahe the off heap data. This is typically less than 100 kB.
Either serialized or deserialized. Performance wise do you think that would even matter ?
It matter a lot but you can get the performance close to on heap objects without the long GC pause times. Ie only the bit on the heap makes any difference.
Last but not least... do you think chronicle is suitable for this simple purpose ?
Actually this is all it does really. We try to make it fast by keeping it simple. Eg it is not a cache and doesn't have expiries etc. There is no background thread managing it. In fact it is so simple that many methods are completely in lined. Ie if you look at a crash dump the chronicle methods have been completely in lined and don't appear in the stack trace any more. (The still appear in java stack traces)
Or is there a lot of overhead for other functionality which i do not intend to use (yet) !?
You can share the data between processes on the same machine but this doesn't add overhead. You can add replication between machines. But this uses extra classes which are not used in the simple case ie it is as if they were not there, only an option.