-
Notifications
You must be signed in to change notification settings - Fork 179
Issues: IBM/data-prep-kit
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] Parallel KFR execution of the same simple pipeline
bug
Something isn't working
#1109
opened Mar 7, 2025 by
roytman
2 tasks done
[Feature] Ability to inject XML/JATs into parquet for extended training
enhancement
New feature or request
#1107
opened Mar 6, 2025 by
touma-I
1 of 2 tasks
[Feature] Investigate new approach for further simplification by eliminating python runtime, ray runtime and spark runtime
enhancement
New feature or request
#1105
opened Mar 6, 2025 by
touma-I
1 of 2 tasks
Bring in cookbooks, recipes, and scripts for post-processing applications to DPK
enhancement
New feature or request
#1104
opened Mar 5, 2025 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Enable in-memory chaining and parallel execution of transforms
enhancement
New feature or request
#1102
opened Mar 5, 2025 by
touma-I
2 tasks done
[Bug] S3 creds are printed to the logs
bug
Something isn't working
#1100
opened Mar 5, 2025 by
revit13
1 of 2 tasks
[Bug] HuggingFace token is exposed in kfp pipelines
bug
Something isn't working
#1098
opened Mar 5, 2025 by
revit13
2 tasks done
[Feature] Analyze transforms on the inner and covert to outer
enhancement
New feature or request
#1096
opened Mar 4, 2025 by
touma-I
1 of 2 tasks
[Bug] Wrong/ambiguous explanation in the ededup README file
bug
Something isn't working
sprint-Mar-7
#1086
opened Mar 2, 2025 by
roytman
1 of 2 tasks
Update the main README table with the list of new GneissWeb Transforms
enhancement
New feature or request
sprint-Mar-7
#1069
opened Feb 26, 2025 by
shahrokhDaijavad
1 of 2 tasks
On-boarding Multi-lingual transforms to DPK
enhancement
New feature or request
sprint-Mar-21
#1065
opened Feb 25, 2025 by
shahrokhDaijavad
1 of 2 tasks
[Bug] HAP transform crashes if "contents" is empty
bug
Something isn't working
sprint-Mar-7
#1048
opened Feb 13, 2025 by
burn2l
2 tasks done
[Feature]New DPK transform to get the distributions of quality metrics
enhancement
New feature or request
sprint-Mar-21
#1045
opened Feb 11, 2025 by
Hajar-Emami
1 of 2 tasks
[Feature] Filter both the parquet and arrow files and update the metadata simultaneously
enhancement
New feature or request
Pending
#1044
opened Feb 11, 2025 by
Hajar-Emami
1 of 2 tasks
[Feature] Enable crawling of websites that require credentials via SSO or 2FA
enhancement
New feature or request
Pending
#1040
opened Feb 11, 2025 by
touma-I
1 of 2 tasks
[Bug] Error running lang_id and code_quality kfp pipelines
bug
Something isn't working
sprint-Mar-7
#1038
opened Feb 11, 2025 by
revit13
1 of 2 tasks
Rep_removal for large data files crashes on 16GB memory
bug
Something isn't working
#1035
opened Feb 10, 2025 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Enabling gneissweb_classification transform by using multiple fasttext classifiers simultaneously
enhancement
New feature or request
sprint-Mar-7
#1034
opened Feb 10, 2025 by
Hajar-Emami
1 of 2 tasks
[Feature] Update PII sample notebook to use simple APIs
enhancement
New feature or request
sprint-Mar-7
#1032
opened Feb 10, 2025 by
sujee
2 tasks done
On-boarding Multimodal transforms to DPK
enhancement
New feature or request
sprint-Apr-11
#1020
opened Feb 6, 2025 by
shahrokhDaijavad
1 of 2 tasks
Improve performance of the Readability transform
enhancement
New feature or request
sprint-Mar-7
#1015
opened Feb 5, 2025 by
shahrokhDaijavad
1 of 2 tasks
Consistency of defined configuration parameters with the CLI Options in all transforms READMEs and Notebooks
enhancement
New feature or request
#1002
opened Jan 30, 2025 by
shahrokhDaijavad
2 tasks done
[Feature] how to find which DPK 'modules' are installed
enhancement
New feature or request
#996
opened Jan 29, 2025 by
sujee
1 of 2 tasks
[Bug] Unable to access quay.io/dataprep1/data-prep-kit/doc_chunk-ray:latest
bug
Something isn't working
#995
opened Jan 29, 2025 by
touma-I
2 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-02-09.