N3C-CodeSets repository has three CSVs:
- codeset.csv has a list of all TermHub codesets and their metadata
- codeset_item.csv has one entry for every concept in the codesets
- researcher.csv has one entry for every unique researcher_id referenced in the codeset.csv metadata (codeset_created_by and container_created_by fields)
To regenerate these CSV files: clone this repository, delete the existing CSV files, and execute populateCodesets.py. If you do not delete the existing CSV files, they will be appended to.