-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge of hdtCat #74
Merge of hdtCat #74
Conversation
Noticed that when two identical files are merged with hdtCat then the triples are duplicated. This leads to a larger output file where hdtSearch returns the double number of results in comparison to the original file. |
Hi, Salut |
Guys, love this work. Could you list the major changes to the original code/API? are there any breaking changes? |
There are basically no breaking changes, only things beside .... enjoy |
alright, cool, merged! Would you mind extending the README to document the new feature a bit? |
cool thank you! Somehow there where two README files. I changed that ..... Also the description of hdtCat is in the hdt-java-cli README, that I already updated! |
This branch contains an implementation of hdtCat, an algorithm and command line tool to merge 2 hdt files without decompressing them. This especially allows to merge HDT files and serialize big RDF file to HDT with low memory footprint. On a 16Gb machine we were able to generate an HDT file with 5 billion triples. The code is not working under Windows. This issue is known to us.