Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add diffBit into k-HDTCat #184

Merged
merged 1 commit into from
Dec 12, 2022
Merged

Add diffBit into k-HDTCat #184

merged 1 commit into from
Dec 12, 2022

Conversation

ate47
Copy link
Contributor

@ate47 ate47 commented Dec 12, 2022

This pull request add the method diffBitCatHDT(HDTs, Bitmaps)

public static HDT diffBitCatHDT(List<String> hdtFileNames, List<? extends Bitmap> deleteBitmaps, HDTOptions hdtFormat, ProgressListener listener) throws IOException;

It was implemented by adding few things to k-HDTCat (#181)

First it will create 3 bitmaps per hdt to denote which components are deleted (s,p,o)

Then during the dictionary merge phase, it will ignore those elements.

When building the triples, it will pass by the mapped shared/subject twice to find first the shared elements and then the subject elements (because a shared can become a subject now)

This method is faster than the current HDTDiff and knowing k-HDTCat is also faster than HDTCat, a merge of those 2 methods can lead to reducing a lot of time in update (Cat+Diff) operations.

@D063520
Copy link
Contributor

D063520 commented Dec 12, 2022

It's great, this will speed up things again! Nice contribution!

@D063520 D063520 merged commit ff0bcde into rdfhdt:master Dec 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants