Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Malformed lines #34

Open
lironT74 opened this issue Sep 30, 2021 · 1 comment
Open

Malformed lines #34

lironT74 opened this issue Sep 30, 2021 · 1 comment

Comments

@lironT74
Copy link

Hi @joaopalotti,
It seems that for malformed lines, such as:
image
trec_eval throws:
trec_eval.get_results: Malformed line 790
While trec_tools does not.

I am not sure that this is a bad thing but perhaps a warning will suite here?
Unfortunately I am a bit swamped lately so I am not available to offer a fix myself.
Thank you!

@joaopalotti
Copy link
Owner

Hi @lironT74, thank you very much for identifying this mismatch between trec_eval and trectools.
While ago, I created a set of validation scripts for CLEF eHealth. Their goal was to verify this type of error and other similar problems such as non-sequential document rank or non-decreasing score for a given topic.

Having this kind of check integrated into trectools would be amazing!

I will leave this issue with a help-wanted flag and hope to get help from the amazing IR community!

joaopalotti added a commit that referenced this issue Jan 9, 2023
Add format validation when loading a run (#34)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants